Nanonets

4

96

Nanonets@nanonets·3d

Nanonets OCR-3 is live. This is the most accurate OCR model in the world currently. 87.4 on OLM-OCR (Global #1) 85.9 on IDP Leaderboard (Global #1) 90.5 on OmniDocBench OCR-3 also ships with two critical features that foundational models and VLMs miss today - confidence scores and bounding boxes.

English

7

20

36

368

Nanonets@nanonets·3d

Nanonets OCR-3 is the only OCR model you'll need in your agentic stack. The model API exposes five endpoints - /parse - structured markdown /extract - structured outputs in your schema /split - classify or split outputs based on content /chunk - context-aware chunks optimized for RAG /vqa - grounded answers with bboxes over sources We've specifically fine-tuned the model on edge cases where OCR repeatedly fails - complex tables, forms, non-trivial layouts.

GIF

English

4

80

Nanonets@nanonets·3d

With bounding boxes, you get exact coordinates for every extracted element. Use them for - 1. RAG citations 2. Feeding specific document regions to agents 3. Agent observability With confidence scores, you can measure reliability of every extraction. Pass high-confidence outputs directly, route low-confidence outputs to human review or a larger model. Use them to push your net accuracy to near 100%.

English

1

5

100

Nanonets รีทวีตแล้ว

Vinit Mehta@winitmehta·3 Mar

x.com/i/article/2028…

ZXX

2

108

Nanonets@nanonets·19 Şub

x.com/i/article/2024…

ZXX

2

232

Nanonets@nanonets·17 Şub

#changelog Introducing AI Agent Guidelines. 👉 changelog.nanonets.com/introducing-ai…

English

0

2

213

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

1

0

143

syoyo.eth 🌸 レイトラ ® 🐯 8 周年 🎉@syoyo·15 Haz

nanonets-ocr-s を vision-language.cpp でスマッホで動かしたいから優秀な VLM 若人さまはよじゃぶじゃぶ湧き出てきてもろて？🥺👊

日本語

1

601

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

7

James M. Tucker, Ph.D.@James_M_Tucker·4 Ağu

@mervenoyann AWS Textract and Nanonets were on par with each other.

English

0

32

merve@mervenoyann·3 Ağu

a question for y'all: which PDF renderers do you use (other than Docling, SmolDocling and R/OlmOCR)? why do you prefer that over these ones? 👀

English

28

10

273

52.8K

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

26

Joana Levtcheva@13_jo_jo_13·3 Ağu

I just published Nanonets OCR: A Small Gem for Handwritten Notes. As always, the testing was done with mlx-vlm. medium.com/p/nanonets-ocr… #mlx #mlx_vlm #OCR

English

0

2

188

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

13

Siddharth Dwivedi@Naamhaisidu·18 Mar

Does anyone have any contacts in Nanonets - Bangalore ??

English

0

58

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

1

70

Paul Fadieiev@pavlo_fadieiev·17 Haz

Just tried the new Nanonets-OCR-s. Works really well! - Small size: just 3.75B parameters, works on RTX 3060 without quantization. - Recognizes equations and tables! And tables with equations (!) - Is multilingual! - Outputs descriptions of the images - Outputs in Markdown format

English

0

1

101

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

1

74

Mi imamo knjigu za vas@kombib·21 Haz

Nanonets-OCR-s predstavlja veliki iskorak u odnosu na klasične OCR (optical character recognition – optičko prepoznavanje karaktera) alate. Dok većina OCR sistema samo prepoznaje i transkribuje tekst iz slika, Nanonets-OCR-s strukturira dokumente na način koji je optimizovan za dalju obradu pomoću velikih jezičkih modela (LLM) – kao što su GPT, Claude i Gemini.

0

2

162

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

5

Nanonets@nanonets·12 Ağu

@alexmcaulay We'd like to throw our hat in the ring

English

0

17

alexmcaulay@alexmcaulay·5 Tem

We are testing document parsing engines right now for a major project and going to report back on our findings. We are testing Docling, N8N, MarkITDown, LlamaParse, Mistral, Rossum, Veryfi, Google Document AI, Amazon Textract. Going to give a really good breakdown of everything for you. Anything else we should test?

English

3

1.6K

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

0

1

9

ChatDOC@chatdoc_ai·19 Haz

🥳OCRFlux shines in accuracy! It achieves significantly higher Edit Distance Similarity (EDS) and Tree Edit Distance-based Similarity (TEDS) scores compared to #olmOCR and #Nanonets. Try our demo: 2ly.link/28rkt #OCRFlux #OCR #Benchmark

English

0

3

2.5K

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

6

Hacker News 50@betterhn50·16 Haz

Nanonets-OCR-s – OCR model transforms documents into structured markdown huggingface.co/nanonets/Nanon… (news.ycombinator.com/item?id=442870…)

English

0

1

131

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

1

15

Sriram@srizzler·30 Tem

@karthikreddy95 @venky4a @miryalasrikanth Nanonets is the best IDP out there in terms of accuracy and pricing. PS: I've personally evaluated their APIs using Indian Regional Invoices (Most toughest out of all)

English

0

2

72

Srikanth Miryala@miryalasrikanth·30 Tem

డాక్టరు అవటం వలన మరో అడ్వాంటేజీ, వేరే డాక్టరు గీకిపడేసిన మాత్రల్ని మనం ఇంట్లోవాళ్లకి అర్థమయ్యేట్లుగా రాసి ఇవ్వటం.

తెలుగు

38

19

364

17.7K

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

13

Antony Barroux@Blogalto·19 Haz

📄 Nanonets vient de sortir Nanonets-OCR-s, un modèle IA révolutionnaire qui transforme tes docs (images, PDFs) en Markdown bien structuré. Ça gère les équations LaTeX, tables complexes, signatures et plus encore ! Suivez ce thread pour en savoir plus ! 👇 #NanonetsOCR #AI #Tech

Français

0

55

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English

16

Nick Levine@status_effects·31 Tem

@andersonbcdefg @hoanhle_ In my tests I found rolmocr (reducto’s version of olmocr) and nanonets worked best fwiw (much better than marker). Everything whiffs on math expressions despite the latex support

English

0

7

259

Ben (no treats)@andersonbcdefg·31 Tem

it's shocking how terrible the current state of OCR is given how many companies are working on doc intelligence AND that we are supposedly "almost at AGI"

English

82

53

1.6K

119.1K

Nanonets@nanonets·14 Eki

Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents. Live demo: docstrange.nanonets.com HF: huggingface.co/nanonets/Nanon… Blog: nanonets.com/research/nanon…

English