
Jerry Chen
7.8K posts

Jerry Chen
@jerrychen
Restless. Irreverent. Greylock GP. @GreylockVC



LlamaParse Agentic Plus mode now delivers precise visual grounding with bounding boxes for the most challenging document elements. Our latest update brings major improvements to how we handle complex visual content: 📐 Complex LaTex formulas - accurately parse mathematical expressions with precise positioning ✍️ Handwriting recognition - extract handwritten text with location coordinates 📊 Complex layouts - navigate multi-column documents and intricate formatting 📈 Infographics and charts - identify and extract data visualizations with spatial context This means you can now build applications that not only extract text from documents but also understand exactly where that content appears on the page - perfect for creating more intelligent document analysis workflows. Try LlamaParse Agentic Plus mode and see how visual grounding transforms your document parsing capabilities: cloud.llamaindex.ai/?utm_source=so…








Ever wondered what we mean by 'agentic' OCR? It's parsing that reasons about documents instead of just reading them. Agentic OCR adapts to layout changes by treating document processing as a goal-oriented task rather than simple text extraction. 🧠 Uses multimodal language models to understand document structure and context, not just convert pixels to text 📍 Provides visual grounding with bounding boxes so every extracted field traces back to its source location 🔄 Runs self-correction loops to catch inconsistencies before they reach your downstream systems ⚡ Achieves 90-95%+ straight-through processing rates on new document formats without template setup This matters for legal teams processing M&A due diligence, healthcare admins handling medical forms, and finance teams reconciling reports across subsidiaries. The agent doesn't just extract data - it completes document workflows with built-in validation and business logic. LlamaParse is our implementation of agentic OCR. Get 10,000 free credits to test it against your actual documents: Read the full breakdown: llamaindex.ai/blog/agentic-o…


Most agents don’t fail on models… they fail on context: those ugly, messy, complex documents that trip up even the latest LLMs (PDFs, tables, messy scans). Don't worry. We got you. 🚀 VC-backed (seed+) startup? Join the LlamaParse Startup Program: ✅ free credits ✅ dedicated slack channel + priority support ✅ alignment call with our founder Jerry Liu ✅ community spotlight (millions of devs) ✅ production-ready ingestion pipelines Apply today spots are limited → llamaindex.ai/startups



If you’re working with lots of slide decks and need a better way to search through them, Surreal Slides makes it simple 🌀 Built around LlamaParse, it parses presentation files into clean, structured data, turning raw slides into something AI can truly understand. Each slide is extracted, summarized, and organized before being stored in @SurrealDB for flexible retrieval. From there, you can query your entire presentation library in natural language through an agentic interface: no need to manually scan files or remember where a specific slide lives. Take a look at the demo below👇 GitHub Repository: github.com/run-llama/surr…

















