

ArchiveBox
274 posts

@ArchiveBoxApp
Self-hosted open-source internet archiving solution https://t.co/2hZPgQsWat ⑊ Maintained by @theSquashSH ⑊ #webarchiving #internetarchiving #digipres #foss





We just open-sourced LiteParse 🎉 A lightweight, local document parser in the shape of an easy-to-use CLI. No API calls, no external service, no cloud dependency. Just fast text extraction from common file formats, right from your terminal. It's built for developers who want parsing that stays on their own infrastructure and gets out of their way. Clean PDFs, DOCX, HTML: run it, get your text, move on. The output is designed to be fed straight into agents so they can read parsed text and reason over screenshots without any extra wrangling. When you hit more complex territory like scanned docs, dense tables, or multi-column layouts, that's where LlamaParse picks up. Same philosophy, more horsepower for the hard stuff. 📖 Announcement post: llamaindex.ai/blog/liteparse… 🔗 GitHub: github.com/run-llama/lite… 🎬 Walkthrough: youtu.be/_gcqMGUWN-E

someone just open-sourced a tool that converts pdfs to markdown at 100 pages per second. 100% free. runs entirely on cpu. no expensive gpus needed.





This was a fun project. zdnet.com/article/how-to…

👋 Linkwarden 2.11 is out and we just announced it on Reddit, Lemmy, and our blog! 🚀 What's new in a nutshell: - ✨ Customizable Readable View - ✏️ Add Notes to Highlights - ⚙️ Customizable Dashboard - 📥 Import from Pocket - 🌐 Crowdin translation





