Always enjoyed scrolling though these posts, figured I’d give it a go here:
What are your must-have selfhosted services?
Some of mine:
- Adguard Home - Add blocker
- Adguard Home Sync - sync multiple adguard instances
- Bookstack - documentation
- BorgMatic - config driven backup
- Change Detection - monitor websites for changes, prices for example.
- FreshRSS - RSS reader
- Home Assistant - home automation
- KitchenOwl - groceries
- Rclone - sync backups to remote storage
- Traefik - reverse proxy
- Vikunja - todo list
- Wireguard Easy - VPN
Some features like a “tl,dr” bot would probably not even need high end hardware, because it does not matter if it takes ten minutes for a summary.
Features like a chat bot do not belong into paperless IMO.
True, that’s a good take. Tl;dr for the masses! Do you think an internal or external tl;dr bot would be embraced by the Paperless community?
It could either process the (entire or selected) collection, adding the new tl;dr entries to the files “behind the scenes”, just based on some general settings/prompt to optimize for the desired output – or it could do the work on-demand on a per-document basis, either based on the general settings or custom settings, though this could be a flow-breaking bottleneck in situations where the hardware isn’t powerful enough to keep up with you. However, that only seems like a temporary problem to me, since hardware, LLMs etc. will keep advancing and getting more powerful/efficient/cheap/noice.
Right – but, opposingly to that, Paperless definitely do belong into some chatbots!
I think more “intelligence” in parsing the documents would be well-received. Just as OCR is fundamental to paperless, AI features could be the next step forward. Automatically extract the relevant positions of e.g. a bill, understand the document (and select the correct date, not my birthday) and apply correct tags to new documents.
Definitely!
Yes, I think that’s the way to go. If the paperless-ngx team doesn’t believe in following that path, someone else will probably fork the project and do it, or build something with similar capabilities “from scratch”. Then, it’ll be interesting to see what’s coming forth of open-source models with capabilites similar to GPT-4Vision… . . . . 🤯