Hoarder

Self-hosted comprehensive bookmark and content archival system

Hoarder is a self-hosted application for comprehensive web content archiving and organization, combining traditional bookmarking with advanced features like full-page archival, AI-powered tagging, and OCR capabilities. It serves as a personal knowledge management system that puts you in control of your data.

What we love ❤️

  • Open-source and actively maintained
  • Complete control through self-hosting
  • Automatic metadata extraction from links
  • Full-page archival protects against link rot
  • AI-powered tagging (supports both ChatGPT and local models via Ollama)
  • OCR functionality for image text extraction
  • Full-text search across all stored content
  • REST API available for custom integrations

Worth noting 💡

  • Requires self-hosting setup and maintenance
  • Storage requirements scale with archived content
  • Export/import currently limited: system backups require manual DATA_DIR copying with stopped containers; basic per-user export only covers links and notes without cached content; lacks user-friendly backup interface

Hoarder delivers a robust self-hosted solution for comprehensive web content archival and organization with notable AI-enhanced features. While it requires technical knowledge for setup and maintenance, the combination of automatic content processing, full-page archival, and cross-platform access makes it a powerful tool for serious data collection. It's ideal for users who want complete control over their bookmarking system and value protection against link rot.