Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Not quite self-hosting, but in the same spirit I've slowly been working on a simple local archival system for anything I don't want to lose. It's changed my life.

Even across years of content, it's required less storage space than I expected. The more I archive, the less I need to rely on online search engines or worry about linkrot. It's also helped me cut down on how many tabs I keep open in fear of losing information.

If I can't recall some piece of information, I can do a fuzzy global search through the text of all articles I've saved in a specific category, for example. If I find some obscure fix for something deep in an old reddit or HN thread, you bet I'm archiving that so if I run into the same issue a year later I can easily fix it again without trawling through 50 Google results.



What do you use to organize all of this unstructured data in a way that is searchable and retrievable?


It's somewhat structured; I use both broad categories and a tag system. I can also add additional comment text to archived pages. It's all patched together with shell scripts and some Lua (since that's what I'm familiar with). `ripgrep` is the utility used for searching. It's fast enough for me even when I don't use any kind of category filtering, but I have a beefy computer and use NVMe drives, so YMMV.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: