ChatDOC

339 posts

ChatDOC banner
ChatDOC

ChatDOC

@chatdoc_ai

ChatGPT-based file-reading assistant🤖 Quickly extract, locate and summarize information from docs.🚀 Citations are presented for fact-checking 🔍

เข้าร่วม Mart 2023
201 กำลังติดตาม3.3K ผู้ติดตาม
ทวีตที่ปักหมุด
ChatDOC
ChatDOC@chatdoc_ai·
👏We just released OCRflux — a multimodel PDF-to-Markdown toolkit that handles multi-column layouts, table parsing & cross-page merging. 🔵Built on top of finetuning the multimodal LLM Qwen2.5-VL-3B-Instruct, it delivers 3× higher throughput on a GTX 3090 GPU than the olmOCR-7B baseline model. 🔵It achieves 0.967 Edit Distance Similarity vs olmOCR’s 0.872 vs Nanonets's 0.858. Try demo:2ly.link/28rzz GitHub:github.com/chatdoc-com/OC… Hugging Face:huggingface.co/ChatDOC/OCRFlu… 😃Key features in the threads #OCRFlux #OCR #PDF2Markdown #Qwen #olmocr #OpenSource #Benchmark #DocumentAI
ChatDOC tweet media
English
3
2
16
3.9K
ChatDOC
ChatDOC@chatdoc_ai·
We’ve officially upgraded ChatDOC'S default model to GPT-5 mini—bringing you faster reasoning and deeper document insights at zero extra cost. 🖼️Plus, we’ve leveled up our Multimodal Question Packages to GPT-5.3 for unmatched accuracy with charts and tables. 💎 Smarter 💎 Faster 💎 No extra costs Go to ChatDOC.com and feel the difference of #GPT5 #ChatPDF #TechUpdate #ChatDOC #Productivity
English
1
1
2
184
ChatDOC
ChatDOC@chatdoc_ai·
📚✨ NEW UPDATE ALERT! ✨📚 Now you can chat with scanned PDFs for FREE! Engage with photographed study materials like old textbooks, complex forms, scattered notes, or screenshots Easily extract any information you need with a fully searchable and interactive format. Simplify your learning by centralizing all resources in one place for cross-file Q&A, data extraction, or even translation. Try it now: 2ly.link/27e87
ChatDOC tweet mediaChatDOC tweet mediaChatDOC tweet mediaChatDOC tweet media
English
0
0
0
125
ChatDOC
ChatDOC@chatdoc_ai·
🚀 ChatPaper Update We’ve made two exciting upgrades! 1️⃣ Added daily sync of arXiv – Information Retrieval papers 📚 2️⃣ Introduced AI Summary – get a full-text understanding and key takeaways in seconds 🤖 Explore the latest IR research with instant summaries 👉 chatpaper.com
English
0
0
1
144
ChatDOC
ChatDOC@chatdoc_ai·
🚀 Launch your AI Help Desk in minutes with ChatDOC Studio ! Turn your business docs into a 24/7 AI assistant — with answer tracing, brand customization, no-code deployment, and API integration. 💡 Train with your files or website 🔍 Smarter search via Context Retrieval API 📄 Parse PDFs into structured, LLM-ready data 👥 Collaborate in one shared Knowledge Hub 👉 Try it now: chatdoc.studio #AI #Chatbot #SaaS #NoCode #Agents #ChatDOC #GPT
English
0
0
4
195
ChatDOC
ChatDOC@chatdoc_ai·
📷 Immersive Q&A Mode is here! Hide the document panel and focus only on the answers you need. A cleaner, distraction-free way to browse and interact with your documents. 👉Try it now → chatdoc.com
English
0
0
1
170
ChatDOC
ChatDOC@chatdoc_ai·
TGIF! 🎉 Just in time for the weekend, ChatDOC is rolling out GPT-5-Mini, now ready for your document Q&A! Plus, OpenAI and OpenRouter have expanded to support GPT-5 and GPT-5-Mini for a fully personalized experience. Why not give it a try this weekend? 📚 #AI #GPT5 #ChatDOC
ChatDOC tweet media
English
1
0
3
369
ChatDOC รีทวีตแล้ว
Tom Dörr
Tom Dörr@tom_doerr·
turns PDFs and images into clean Markdown with AI
Tom Dörr tweet media
English
1
4
48
4.7K
ChatDOC รีทวีตแล้ว
karminski-牙医
karminski-牙医@karminski3·
卧槽, 真的 SOTA 了, 正在准备做一个 PDF 转 Markdown 的领域大模型评测, 结果我发现 ChatDOC 的 OCRFlux-3B 在我测试的所有场景均达到了现阶段最理想的效果. 下面直接为大家带来实测: 场景A: 让我们直接来现阶段的 PDF 转化试金石——发票转 Markdown 可以看到, 重要的 购买方, 单价, 价税合计, 销售方均十分准确, 并且没有把印章错误的当成内容. 而且形成的 Markdown 表格行列准确, 甚至还有合并单元格的表格. 以往的测试别说表格, 连内容都不一定识别准确. 更别提生成正确的 Markdown 表格了. 当然也不是没有瑕疵. 如果 OCRFlux 可以把表头和页脚也能准确识别进去就完美了. 现在生成的内容用来识别普票还是专票还是有困难的. #OCRFlux #ChatDOC #Markdown #PDF
karminski-牙医 tweet mediakarminski-牙医 tweet mediakarminski-牙医 tweet media
中文
25
131
744
101.9K
ChatDOC
ChatDOC@chatdoc_ai·
@ProgramErgoSum @rohanpaul_ai Right now we can work with tables, but can't handle visual elements like diagrams or charts yet. If we add support for saving those locally in the future, we'll definitely let you know!
English
0
0
2
24
Rohan Paul
Rohan Paul@rohanpaul_ai·
PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images. 💡Testing the new open-source OCRFlux model, and here the results are really good for a change. So OCRFlux is a multimodal, LLM based toolkit for converting PDFs and images into clean, readable, plain Markdown text. Because the underlying VLM is only 3B param, it runs even on a 3090 GPU. The model is available on @huggingface . The engine that powers the OCRFlux, teaches the model to rebuild every page and then stitch fragments across pages into one clean Markdown file. It bundles one vision language model with 3B parameters that was fine-tuned from Qwen 2.5-VL-3B-Instruct for both page parsing and cross-page merging. OCRFlux reads raw page images and, guided by task prompts, outputs Markdown for each page and merges split elements across pages. The evaluation shows Edit Distance Similarity (EDS) 0.967 and cross‑page table Tree Edit Distance 0.950, so the parser is both accurate and layout aware. How it works while parsing each page - Convert into text with a natural reading order, even in the presence of multi-column layouts, figures, and insets - Support for complicated tables and equations - Automatically removes headers and footers Cross-page table/paragraph merging - Cross-page table merging - Cross-page paragraph merging A compact vision‑language models can beat bigger models once cross‑page context is added. 🧵 1/n Read on 👇
English
60
172
1.3K
149.1K
Joseph Perry
Joseph Perry@infra_scale_5·
@pgillian____ been collecting terabytes of PDF images from research papers with Burla (infra for parallel jobs). would love to throw a few hundred GPUs at OCRFlux and finally decode academia’s layout hell
English
3
0
2
76
ChatDOC
ChatDOC@chatdoc_ai·
@venki_lfc Thank you for your feedback! Could you share some screenshots or examples of the issues you’re facing? That would help us understand better and figure out what’s going on
English
1
0
0
43
Venki LFC
Venki LFC@venki_lfc·
@chatdoc_ai Thanks for the great work. I have faced some inconsistencies with the model though. The model's response sometimes does not start as a dictionary, hence the following post processing steps fail. The model sometimes skips the table and just outputs the text in the page.
English
2
0
1
61
ChatDOC
ChatDOC@chatdoc_ai·
👏We just released OCRflux — a multimodel PDF-to-Markdown toolkit that handles multi-column layouts, table parsing & cross-page merging. 🔵Built on top of finetuning the multimodal LLM Qwen2.5-VL-3B-Instruct, it delivers 3× higher throughput on a GTX 3090 GPU than the olmOCR-7B baseline model. 🔵It achieves 0.967 Edit Distance Similarity vs olmOCR’s 0.872 vs Nanonets's 0.858. Try demo:2ly.link/28rzz GitHub:github.com/chatdoc-com/OC… Hugging Face:huggingface.co/ChatDOC/OCRFlu… 😃Key features in the threads #OCRFlux #OCR #PDF2Markdown #Qwen #olmocr #OpenSource #Benchmark #DocumentAI
ChatDOC tweet media
English
3
2
16
3.9K
ChatDOC
ChatDOC@chatdoc_ai·
@aivovaske @rohanpaul_ai Thanks for sharing ! OCRFlux currently works best with English & Chinese docs. For other languages, the parsing might not be perfect yet😅 Will let you know when we expand support!
English
1
0
1
104
ChatDOC
ChatDOC@chatdoc_ai·
🤩Seriously made our day seeing you dig our multi-lingual &column doc parsing! 👇Everyone come give it a spin and tell us what you think! 2ly.link/27e87 #PDFparsing #OCRFlux #ocr #opensourceai
Rohan Paul@rohanpaul_ai

PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images. 💡Testing the new open-source OCRFlux model, and here the results are really good for a change. So OCRFlux is a multimodal, LLM based toolkit for converting PDFs and images into clean, readable, plain Markdown text. Because the underlying VLM is only 3B param, it runs even on a 3090 GPU. The model is available on @huggingface . The engine that powers the OCRFlux, teaches the model to rebuild every page and then stitch fragments across pages into one clean Markdown file. It bundles one vision language model with 3B parameters that was fine-tuned from Qwen 2.5-VL-3B-Instruct for both page parsing and cross-page merging. OCRFlux reads raw page images and, guided by task prompts, outputs Markdown for each page and merges split elements across pages. The evaluation shows Edit Distance Similarity (EDS) 0.967 and cross‑page table Tree Edit Distance 0.950, so the parser is both accurate and layout aware. How it works while parsing each page - Convert into text with a natural reading order, even in the presence of multi-column layouts, figures, and insets - Support for complicated tables and equations - Automatically removes headers and footers Cross-page table/paragraph merging - Cross-page table merging - Cross-page paragraph merging A compact vision‑language models can beat bigger models once cross‑page context is added. 🧵 1/n Read on 👇

English
2
0
5
870
ChatDOC
ChatDOC@chatdoc_ai·
Thanks for the shoutout! 🙌 Even the messiest multi-page tables get stitched together perfectly by our magic. Try it yourself at 2ly.link/27e87
ハカセ アイ(Ai-Hakase)🐾最新トレンドAIのためのX 🐾@ai_hakase_

PDFの常識を変える?!🤯✨ 新ツール「OCRFlux」が登場しましたよ! ✎. FYIG: x.com/aigclink/statu… PDFをMarkdownに変換する新しいツール「OCRFlux」が発表されたそうです!これがもう、すごい機能満載なんですって! 一番驚いたのは、PDFでページをまたぐ表や段落を、自動で繋げてMarkdownにしてくれること!😲 これって、今まで手作業で調整していた部分ですよね。論文や長い報告書を扱う方には、革命的な機能なんじゃないでしょうか?! なんと98.3%という驚異の正確率なんですって!✨ しかも、多列レイアウトや、中国語と英語が混ざったドキュメントまで、ちゃんと自然な読み順で変換してくれるそうですよ。グローバルな資料作成にも心強いですね! 他の有名なOCRツールと比べても、ベンチマークで圧倒的な性能を示しているとのこと!そして、GTX 3090などのGPUがあれば、3Bパラメータモデルでめちゃくちゃ高速に処理できちゃうそうです!🚀 作業効率がグンと上がりそうでワクワクしますね! 実際の変換の様子は動画で見られますが、まるで魔法みたいにサクサク動いていましたよ!これがあれば、PDFからのデータ抽出や再利用が格段にラクになりますよね。 例えば、古いPDF資料をMarkdown化して検索しやすくしたり、議事録をPDFでもらってもすぐに編集可能なテキストに変換したり…アイデア次第で色々な使い方ができそうです! 詳しい情報や、もしかしたら試せるデモもあるかもしれませんね!ぜひチェックしてみてください! PDFを扱う皆さんの救世主になるかもしれませんね!これからが本当に楽しみです!😊 #OCRFlux #PDF変換 #AIツール

English
0
1
2
579
ChatDOC รีทวีตแล้ว
Csaba Kissi
Csaba Kissi@csaba_kissi·
This PDF-to-Markdown toolkit can convert PDFs and images into clean Markdown, parse complex tables, and much more... Open this 🧵 ↓ Bookmark for later 🔖
English
25
6
60
10.5K
ChatDOC
ChatDOC@chatdoc_ai·
😩Extract tables from PDFs = pure pain? Not anymore! OCRFlux transforms cross-page tables into structured ones, perfectly preserving layouts and auto-removing duplicate headers. Check out the comparison between olmOCR and OCRFlux 👇 Or try it at: 2ly.link/28rzz #OCRFlux #ocr #OpenSource #PDFparser #olmOCR
English
2
0
5
455