Rayan Garg

11 posts

Rayan Garg

Rayan Garg

@RayanGarg

co-founder @trytheta, prev @Cornell CS

Ithaca, NY Katılım Şubat 2023
115 Takip Edilen222 Takipçiler
Ritvik Pandey
Ritvik Pandey@ritvikpandey21·
the team at @Pulse__AI put bytedance's dolphin OCR to the test against complex documents that matter for real business use cases. while it shows improvements in reading order detection, we found critical limitations across key areas: - 7.7% structured data extraction from financial charts - 60% accuracy drop without grid lines, 31% hierarchy preservation rate, and boundary confusion across multi-table documents. credit to the bytedance team for the bounding box improvements, which genuinely outperform many open-source alternatives. specialized document processing systems still remain essential where accuracy and structure preservation are non-negotiable. we're excited to see continued innovation in open source document AI - this is an important problem to solve.
Ritvik Pandey tweet media
English
5
3
18
1.3K
Rayan Garg retweetledi
Theta
Theta@trytheta·
Browser agents use computers the same way humans do, unlocking powerful use cases for personal assistants, browsers, and enterprise workflows. After talking to 20+ founders in the space, we're excited to put out the definitive market map for browser agents.
Theta tweet media
English
28
86
586
102.8K
Aravind Srinivas
Aravind Srinivas@AravSrinivas·
expanding comet access to more folks as we near the finish line. reply here if interested. needs to be MacBook. Preferably Apple silicon but Intel is fine too. DM your email that you use for perplexity.
English
899
29
1.5K
175.1K
Y Combinator
Y Combinator@ycombinator·
Vesence is Cursor for Transactional Lawyers. They're building Agentic AI inside of Microsoft Office, turning Word into an IDE for contracts. Vesence is already live firm-wide with their first enterprise client, loved by both associates and partners. ycombinator.com/launches/NYo-v… Congrats on the launch, @HenrikTaro and @LudvigSwanstrom!
English
17
17
152
70K
Rayan Garg retweetledi
Theta
Theta@trytheta·
Introducing CUB: Humanity's Last Exam for Computer and Browser Use Agents
Theta tweet media
English
32
39
251
113.3K
Joseph Semrai
Joseph Semrai@josephsemrai·
Meet Context Autopilot It learns like you, thinks like you, and uses tools like you. With SoTA context understanding, it's capable of most information work today. Watch it beat a team of industry experts:
English
594
1K
4.5K
4.6M
neha 🔲
neha 🔲@nehadesaraju·
peak sf is a guy asking me if it was worth it talking to his grandparents because he had a more productive conversation with a vc partner today than he's ever had with his grandparents
English
24
14
749
73.9K