Hoarder
Self-hosted comprehensive bookmark and content archival system
Hoarder is a self-hosted application for comprehensive web content archiving and organization, combining traditional bookmarking with advanced features like full-page archival, AI-powered tagging, and OCR capabilities. It serves as a personal knowledge management system that puts you in control of your data.
What we love ❤️
- Open-source and actively maintained
- Complete control through self-hosting
- Automatic metadata extraction from links
- Full-page archival protects against link rot
- AI-powered tagging (supports both ChatGPT and local models via Ollama)
- OCR functionality for image text extraction
- Full-text search across all stored content
- REST API available for custom integrations
Worth noting 💡
- Requires self-hosting setup and maintenance
- Storage requirements scale with archived content
- Export/import currently limited: system backups require manual DATA_DIR copying with stopped containers; basic per-user export only covers links and notes without cached content; lacks user-friendly backup interface
Hoarder delivers a robust self-hosted solution for comprehensive web content archival and organization with notable AI-enhanced features. While it requires technical knowledge for setup and maintenance, the combination of automatic content processing, full-page archival, and cross-platform access makes it a powerful tool for serious data collection. It's ideal for users who want complete control over their bookmarking system and value protection against link rot.