ChatDOC (@chatdoc_ai) - โปรไฟล์ Twitter

ทวีตที่ปักหมุด

ChatDOC@chatdoc_ai·17 Haz

👏We just released OCRflux — a multimodel PDF-to-Markdown toolkit that handles multi-column layouts, table parsing & cross-page merging. 🔵Built on top of finetuning the multimodal LLM Qwen2.5-VL-3B-Instruct, it delivers 3× higher throughput on a GTX 3090 GPU than the olmOCR-7B baseline model. 🔵It achieves 0.967 Edit Distance Similarity vs olmOCR’s 0.872 vs Nanonets's 0.858. Try demo:2ly.link/28rzz GitHub:github.com/chatdoc-com/OC… Hugging Face:huggingface.co/ChatDOC/OCRFlu… 😃Key features in the threads #OCRFlux #OCR #PDF2Markdown #Qwen #olmocr #OpenSource #Benchmark #DocumentAI

English

3

2

16

3.9K

ChatDOC@chatdoc_ai·12 Mar

We’ve officially upgraded ChatDOC'S default model to GPT-5 mini—bringing you faster reasoning and deeper document insights at zero extra cost. 🖼️Plus, we’ve leveled up our Multimodal Question Packages to GPT-5.3 for unmatched accuracy with charts and tables. 💎 Smarter 💎 Faster 💎 No extra costs Go to ChatDOC.com and feel the difference of #GPT5 #ChatPDF #TechUpdate #ChatDOC #Productivity

English

1

2

184

ChatDOC@chatdoc_ai·22 Ara

📚✨ NEW UPDATE ALERT! ✨📚 Now you can chat with scanned PDFs for FREE! Engage with photographed study materials like old textbooks, complex forms, scattered notes, or screenshots Easily extract any information you need with a fully searchable and interactive format. Simplify your learning by centralizing all resources in one place for cross-file Q&A, data extraction, or even translation. Try it now: 2ly.link/27e87

English

0

125

ChatDOC@chatdoc_ai·12 Ara

Glad ChatDOC Studio is helping 🙌 really made our day!

Ryan Lazuka@lazukars

🚨 This is WILD. A new tool just dropped that might quietly replace half of the “UI/UX agent startups” you’ve been seeing. It’s called ChatDoc Studio and it turns a single screenshot of your landing page into a full AI design review + redesign. Here’s why this matters 👇

English

1

0

2

168

ChatDOC@chatdoc_ai·29 Eki

🚀 ChatPaper Update We’ve made two exciting upgrades! 1️⃣ Added daily sync of arXiv – Information Retrieval papers 📚 2️⃣ Introduced AI Summary – get a full-text understanding and key takeaways in seconds 🤖 Explore the latest IR research with instant summaries 👉 chatpaper.com

English

0

1

144

ChatDOC@chatdoc_ai·20 Eki

🚀 Launch your AI Help Desk in minutes with ChatDOC Studio ! Turn your business docs into a 24/7 AI assistant — with answer tracing, brand customization, no-code deployment, and API integration. 💡 Train with your files or website 🔍 Smarter search via Context Retrieval API 📄 Parse PDFs into structured, LLM-ready data 👥 Collaborate in one shared Knowledge Hub 👉 Try it now: chatdoc.studio #AI #Chatbot #SaaS #NoCode #Agents #ChatDOC #GPT

English

0

4

195

ChatDOC@chatdoc_ai·22 Eyl

We just launched ChatDOC Studio! 🚀 Transform any documents into an expert AI assistant, get highly accurate, traceable answers for every query. Start building today with our FREE plan! - 1,000 Credits per month - Create 3 AI apps - Add 3 Team members - 100MB of knowledge base storage linkly.link/2FEon #AI #Chatbot #NoCode #CustomerSupport #AIAgent #aiforbusinessleaders

English

0

2

207

ChatDOC@chatdoc_ai·12 Eyl

📷 Immersive Q&A Mode is here! Hide the document panel and focus only on the answers you need. A cleaner, distraction-free way to browse and interact with your documents. 👉Try it now → chatdoc.com

English

0

1

170

ChatDOC@chatdoc_ai·8 Ağu

TGIF! 🎉 Just in time for the weekend, ChatDOC is rolling out GPT-5-Mini, now ready for your document Q&A! Plus, OpenAI and OpenRouter have expanded to support GPT-5 and GPT-5-Mini for a fully personalized experience. Why not give it a try this weekend? 📚 #AI #GPT5 #ChatDOC

English

1

0

3

369

ChatDOC รีทวีตแล้ว

Tom Dörr@tom_doerr·10 Tem

turns PDFs and images into clean Markdown with AI

English

1

4

48

4.7K

ChatDOC รีทวีตแล้ว

karminski-牙医@karminski3·4 Tem

卧槽, 真的 SOTA 了, 正在准备做一个 PDF 转 Markdown 的领域大模型评测, 结果我发现 ChatDOC 的 OCRFlux-3B 在我测试的所有场景均达到了现阶段最理想的效果. 下面直接为大家带来实测: 场景A: 让我们直接来现阶段的 PDF 转化试金石——发票转 Markdown 可以看到, 重要的购买方, 单价, 价税合计, 销售方均十分准确, 并且没有把印章错误的当成内容. 而且形成的 Markdown 表格行列准确, 甚至还有合并单元格的表格. 以往的测试别说表格, 连内容都不一定识别准确. 更别提生成正确的 Markdown 表格了. 当然也不是没有瑕疵. 如果 OCRFlux 可以把表头和页脚也能准确识别进去就完美了. 现在生成的内容用来识别普票还是专票还是有困难的. #OCRFlux #ChatDOC #Markdown #PDF

中文

25

131

744

101.9K

ChatDOC@chatdoc_ai·3 Tem

@ProgramErgoSum @rohanpaul_ai Right now we can work with tables, but can't handle visual elements like diagrams or charts yet. If we add support for saving those locally in the future, we'll definitely let you know!

English

0

2

24

{"ನಾಗೇಶ್" : "Nagesh"}@ProgramErgoSum·2 Tem

@rohanpaul_ai Does it handle charts e.g , pie chart, histogram, etc.? How about content with color highlight?

English

1

0

2

210

Rohan Paul@rohanpaul_ai·1 Tem

PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images. 💡Testing the new open-source OCRFlux model, and here the results are really good for a change. So OCRFlux is a multimodal, LLM based toolkit for converting PDFs and images into clean, readable, plain Markdown text. Because the underlying VLM is only 3B param, it runs even on a 3090 GPU. The model is available on @huggingface . The engine that powers the OCRFlux, teaches the model to rebuild every page and then stitch fragments across pages into one clean Markdown file. It bundles one vision language model with 3B parameters that was fine-tuned from Qwen 2.5-VL-3B-Instruct for both page parsing and cross-page merging. OCRFlux reads raw page images and, guided by task prompts, outputs Markdown for each page and merges split elements across pages. The evaluation shows Edit Distance Similarity (EDS) 0.967 and cross‑page table Tree Edit Distance 0.950, so the parser is both accurate and layout aware. How it works while parsing each page - Convert into text with a natural reading order, even in the presence of multi-column layouts, figures, and insets - Support for complicated tables and equations - Automatically removes headers and footers Cross-page table/paragraph merging - Cross-page table merging - Cross-page paragraph merging A compact vision‑language models can beat bigger models once cross‑page context is added. 🧵 1/n Read on 👇

English

60

172

1.3K

149.1K

ChatDOC@chatdoc_ai·3 Tem

@infra_scale_5 @pgillian____ thanks for the shoutout!

English

0

1

37

Joseph Perry@infra_scale_5·2 Tem

@pgillian____ been collecting terabytes of PDF images from research papers with Burla (infra for parallel jobs). would love to throw a few hundred GPUs at OCRFlux and finally decode academia’s layout hell

English

3

0

2

76

ChatDOC@chatdoc_ai·3 Tem

@venki_lfc Thank you for your feedback! Could you share some screenshots or examples of the issues you’re facing? That would help us understand better and figure out what’s going on

English

1

0

43

Venki LFC@venki_lfc·3 Tem

@chatdoc_ai Thanks for the great work. I have faced some inconsistencies with the model though. The model's response sometimes does not start as a dictionary, hence the following post processing steps fail. The model sometimes skips the table and just outputs the text in the page.

English

2

0

1

61

ChatDOC@chatdoc_ai·17 Haz

👏We just released OCRflux — a multimodel PDF-to-Markdown toolkit that handles multi-column layouts, table parsing & cross-page merging. 🔵Built on top of finetuning the multimodal LLM Qwen2.5-VL-3B-Instruct, it delivers 3× higher throughput on a GTX 3090 GPU than the olmOCR-7B baseline model. 🔵It achieves 0.967 Edit Distance Similarity vs olmOCR’s 0.872 vs Nanonets's 0.858. Try demo:2ly.link/28rzz GitHub:github.com/chatdoc-com/OC… Hugging Face:huggingface.co/ChatDOC/OCRFlu… 😃Key features in the threads #OCRFlux #OCR #PDF2Markdown #Qwen #olmocr #OpenSource #Benchmark #DocumentAI

English

3

2

16

3.9K

ChatDOC@chatdoc_ai·2 Tem

@aivovaske @rohanpaul_ai Thanks for sharing ! OCRFlux currently works best with English & Chinese docs. For other languages, the parsing might not be perfect yet😅 Will let you know when we expand support!

English

1

0

1

104

aivovaske@aivovaske·2 Tem

@rohanpaul_ai tried. no great success.

English

1

0

1

312

ChatDOC@chatdoc_ai·2 Tem

🤩Seriously made our day seeing you dig our multi-lingual &column doc parsing! 👇Everyone come give it a spin and tell us what you think! 2ly.link/27e87 #PDFparsing #OCRFlux #ocr #opensourceai

Rohan Paul@rohanpaul_ai

PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images. 💡Testing the new open-source OCRFlux model, and here the results are really good for a change. So OCRFlux is a multimodal, LLM based toolkit for converting PDFs and images into clean, readable, plain Markdown text. Because the underlying VLM is only 3B param, it runs even on a 3090 GPU. The model is available on @huggingface . The engine that powers the OCRFlux, teaches the model to rebuild every page and then stitch fragments across pages into one clean Markdown file. It bundles one vision language model with 3B parameters that was fine-tuned from Qwen 2.5-VL-3B-Instruct for both page parsing and cross-page merging. OCRFlux reads raw page images and, guided by task prompts, outputs Markdown for each page and merges split elements across pages. The evaluation shows Edit Distance Similarity (EDS) 0.967 and cross‑page table Tree Edit Distance 0.950, so the parser is both accurate and layout aware. How it works while parsing each page - Convert into text with a natural reading order, even in the presence of multi-column layouts, figures, and insets - Support for complicated tables and equations - Automatically removes headers and footers Cross-page table/paragraph merging - Cross-page table merging - Cross-page paragraph merging A compact vision‑language models can beat bigger models once cross‑page context is added. 🧵 1/n Read on 👇

English

2

0

5

870

ChatDOC@chatdoc_ai·1 Tem

Thanks for the shoutout! 🙌 Even the messiest multi-page tables get stitched together perfectly by our magic. Try it yourself at 2ly.link/27e87

PDFの常識を変える？！🤯✨ 新ツール「OCRFlux」が登場しましたよ！ ✎. FYIG: x.com/aigclink/statu… PDFをMarkdownに変換する新しいツール「OCRFlux」が発表されたそうです！これがもう、すごい機能満載なんですって！一番驚いたのは、PDFでページをまたぐ表や段落を、自動で繋げてMarkdownにしてくれること！😲 これって、今まで手作業で調整していた部分ですよね。論文や長い報告書を扱う方には、革命的な機能なんじゃないでしょうか？！なんと98.3%という驚異の正確率なんですって！✨ しかも、多列レイアウトや、中国語と英語が混ざったドキュメントまで、ちゃんと自然な読み順で変換してくれるそうですよ。グローバルな資料作成にも心強いですね！他の有名なOCRツールと比べても、ベンチマークで圧倒的な性能を示しているとのこと！そして、GTX 3090などのGPUがあれば、3Bパラメータモデルでめちゃくちゃ高速に処理できちゃうそうです！🚀 作業効率がグンと上がりそうでワクワクしますね！実際の変換の様子は動画で見られますが、まるで魔法みたいにサクサク動いていましたよ！これがあれば、PDFからのデータ抽出や再利用が格段にラクになりますよね。例えば、古いPDF資料をMarkdown化して検索しやすくしたり、議事録をPDFでもらってもすぐに編集可能なテキストに変換したり…アイデア次第で色々な使い方ができそうです！詳しい情報や、もしかしたら試せるデモもあるかもしれませんね！ぜひチェックしてみてください！ PDFを扱う皆さんの救世主になるかもしれませんね！これからが本当に楽しみです！😊 #OCRFlux #PDF変換 #AIツール

English

0

1

2

579

ChatDOC รีทวีตแล้ว

Csaba Kissi@csaba_kissi·30 Haz

This PDF-to-Markdown toolkit can convert PDFs and images into clean Markdown, parse complex tables, and much more... Open this 🧵 ↓ Bookmark for later 🔖

English

25

6

60

10.5K

ChatDOC@chatdoc_ai·30 Haz

Horizontally split tables across pages? No problem for OCRflux! It recognizes and merges them seamlessly into one page. Check out👇 Try demo: 2ly.link/27e87 #OCR #pdfparser #pdftomarkdown #OCRFlux #opensourceai

English

0

3

390

ChatDOC@chatdoc_ai·27 Haz

😩Extract tables from PDFs ＝ pure pain? Not anymore! OCRFlux transforms cross-page tables into structured ones, perfectly preserving layouts and auto-removing duplicate headers. Check out the comparison between olmOCR and OCRFlux 👇 Or try it at: 2ly.link/28rzz #OCRFlux #ocr #OpenSource #PDFparser #olmOCR

English

2

0

5

455

ChatDOC

ค้นพบ