Ayush Jain

979 posts

Ayush Jain banner
Ayush Jain

Ayush Jain

@ayushaadijain

ceo + cofounder @SyntraSystems (YC S24) @dukeu

San Francisco, CA Katılım Nisan 2023
771 Takip Edilen634 Takipçiler
Ayush Jain retweetledi
sid
sid@sid_mnk·
We're open-sourcing PulseBench-Tab, a frontier benchmark for table extraction. Table parsing remains one of the hardest and most poorly measured problems in document intelligence. TEDS operates on DOM trees and conflates HTML formatting conventions with structural errors. Needleman-Wunsch linearizes a two-dimensional structure into a one-dimensional sequence, so column transpositions can still score well because values align with nearby cells. GriTS uses greedy grid matching rather than optimal assignment and does not distinguish edge directions. The upshot: existing metrics cannot reliably separate content errors from structural errors, which makes provider comparisons noisy and downstream reliability unknowable. Alongside the dataset, our research team developed T-LAG. It parses each table into a cell-position grid, emits directed RIGHT and BELOW adjacency edges (suppressed within spanning cells, deduplicated by source, target, and direction), weights each candidate edge pair by the product of Levenshtein-derived similarities on source and target text, and uses the Hungarian algorithm for globally optimal one-to-one assignment. The F1 over matched edge weight is the T-LAG score. Structure and content are evaluated in one unified pass. HTML formatting choices do not affect the result. Rankings are invariant to the similarity exponent across k ∈ {7, 8, 9, 11}. The dataset contains 1,820 human-annotated tables across 9 languages and 4 scripts (Latin, CJK, Arabic, Cyrillic), drawn from 380 real-world financial filings, government reports, and regulatory disclosures. Tables range from 2 to 1,183 cells; 48.1% contain merged or spanning cells. Ground truth was produced through 8 annotation rounds with native speakers per language, independent cross-lingual review, and adversarial cell-by-cell audits against source images. We evaluated 9 commercial and open-source systems independently across the full dataset under exclude-missing scoring. Selected findings: @Pulse__AI Ultra 2 scores 0.9347 T-LAG; the next closest system scores 0.8155. Pulse Ultra 2 is the only provider with a median of 1.0, corresponding to perfect extraction on 57.9% of samples. Non-Latin scripts produce the widest cross-provider variance. On Arabic, the spread between top and bottom systems exceeds 75 percentage points. Structural hallucinations are pervasive. The second-ranked system achieves a perfect-extraction rate of 28.6%, meaning structural or content errors on 71.4% of tables (fabricated rows, invented content, incorrect span attributes, shifted data). Coverage failure is underreported. Multiple evaluated systems return no output on 19% to 21% of samples. Raw accuracy numbers without coverage disclosure favor selection bias. Thank you to Dushyanth Sekhar and Mohammed Hadi of S&P Global's Enterprise Data Organization for their academic contributions to the benchmark methodology. Dataset: huggingface.co/datasets/pulse… Evaluation: github.com/Pulse-Software… Blog: runpulse.com/blog/pulsebenc… Research methodology: benchmark.runpulse.com/research-report Viewer: benchmark.runpulse.com
sid tweet media
English
7
7
32
6.6K
Philip Johnston
Philip Johnston@PhilipJohnston·
Most important launch of my life… she said yes!!! I secretly wrote my proposal to @Xinyi_Tong1 on our first satellite and then showed her as it passed above us at sunrise in Mexico 😍😍🤓🤓🌹🌹🥰🥰😘😘🤗🤗💎💎🎊🎊💘💘💋💋😻😻
Philip Johnston tweet mediaPhilip Johnston tweet mediaPhilip Johnston tweet media
English
302
248
4.6K
241.3K
Ayush Jain retweetledi
Andy Fang
Andy Fang@andyfang·
Today we are welcoming the Metis team to DoorDash as part of DoorDash AI Research. For the past six months, DoorDash has partnered with Metis to build AI agents together, and we have been consistently impressed by their team. By joining forces, we aim to accelerate our plans on building agentic commerce and pushing the frontier of physical intelligence. Excited to share more there soon. It’s still early innings with how AI will transform local commerce, and we’re looking forward to exploring those possibilities together with Aryan, Aayush, Marcus and the Metis team!
English
27
26
551
420.3K
Ayush Jain retweetledi
Haakam Aujla
Haakam Aujla@haakamaujla·
Announcing AgentMail's $6M Seed led by @generalcatalyst No pressure, right?
English
70
49
452
227.9K
Adi Singh
Adi Singh@adisingh·
We're #3 on Hacker News today! One of the biggest weeks yet for AgentMail
Adi Singh tweet media
English
9
4
77
4.7K
hamza mostafa
hamza mostafa@hamostaf04·
2025 was a special year for me. spent majority of it outside of home and in sf for the first time. lots of mistakes and learnings, but also never had so many highs looking forward to an even more special 2026! happy new year :)
English
3
0
44
1.9K
Vedant Nair
Vedant Nair@vedantnair__·
Our CTO, Ben, has tabbed on @cursor_ai 168,176 times this year. We know this because Cursor sent him an email (with a gift) saying he was one of their power users. If you're doing the math, that's 575 tabs per day! Ben's a pragmatist, so he doesn't like AI generating entire features for him. He thinks the code it generates can be poor and insecure. But he does believe that tabbing to autocomplete saves him SO MUCH time. He uses Tab so much that his pinky hurts, and he often switches to tabbing with his ring finger when he codes at night 😅 At Miru, we use AI to go fast, but never at the expense of writing good, reliable code. That's why we're top of the charts for Cursor Tab, but not full-stack codegen apps like Lovable ;) And check out a physical Cursor Tab they sent to the office!
Vedant Nair tweet mediaVedant Nair tweet media
English
31
15
857
124.5K
Ayush Jain retweetledi
Saathvik Boompelli
Saathvik Boompelli@SaathvikB02·
Excited to announce our $4.3M Seed Round, led by Andreessen Horowitz with participation from Alt Capital (Jack Altman) and First Harmonic (Ali Rowghani) and the public launch of Alleviate Health. Alleviate empowers research sites’ best recruiters to connect with more patients by making their days 10x more efficient. Our SMS and Voice agents engage patients at scale, pass them to recruiters if they pre-qualify via a transferred or scheduled call, and automatically update any CTMS or CRM. We’re already working with 7 of the top 15 research site networks in the country. Our agents have been battle-tested across 500k+ patient interactions, 190+ sites, and 300+ unique trials. They yield the highest conversion rates of any solution on the market due to optimization across every possible patient population, indication, and workflow in the recruitment process. We've tripled our patient volume in the past 3 months and are hiring across all roles. @mandrusko1 @JayRughani @JorgeCondeBio @a16z @a16zBioHealth @jaltma @ROWGHANI @TheManMikeTan @JdotJdotF
English
86
59
1K
284.4K
Ayush Jain
Ayush Jain@ayushaadijain·
Going to change 311 forever!
Y Combinator@ycombinator

🏛️@EffiGov is the Voice AI call center for local governments that answers resident questions, files service requests, and routes calls to the right department. Only 2% of cities can afford a traditional 311 center - and those that can are still overwhelmed. EffiGov fixes that.

English
1
0
10
698
Mayank Jain
Mayank Jain@myk_jain·
A (very) overdue personal update: Super excited to announce that I have joined the founding team at Thrive Holdings (@ThriveCapital) to help build exceptional businesses that we believe can compound in value over many decades.
GIF
English
27
9
617
96K
Ayush Jain retweetledi
sid
sid@sid_mnk·
just saw our @usepylon dashboard sub 1 sla finally @Pulse__AI
sid tweet media
English
3
2
15
777
Tanay Kothari
Tanay Kothari@tankots·
Super psyched to announce our $30M Series A led by @MenloVentures to build the voice interface for the AI era. Join us on the journey at @WisprFlow 🚀: as a user, or a teammate. It's going to be an incredible next few years.
Tanay Kothari tweet media
English
117
46
909
1.9M
Ayush Jain retweetledi
Y Combinator
Y Combinator@ycombinator·
Pulse (@Pulse__AI) has just launched Meridian, an AI-powered financial document processor that can automatically convert any PDF, Word doc, PowerPoint presentation, or image into a structured Excel export with charts and graphs. runpulse.com/blog/introduci… Congrats on the launch, @sid_mnk and @ritvikpandey21!
English
23
24
299
52.8K
Ayush Jain retweetledi
Y Combinator
Y Combinator@ycombinator·
After processing 400M+ pages for the world's largest investment firms, AI startups, and Fortune 500s, @Pulse__AI is launching Ultra: their new hybrid reasoning model. It's the most accurate document extraction model in the industry. Live for all customers today. runpulse.com/blog/introduci… Congrats on the launch, @sid_mnk and @ritvikpandey21!
English
17
31
254
34K
Ayush Jain retweetledi
Y Combinator
Y Combinator@ycombinator·
Congrats to the @octolane team on their $2.6M seed! Octolane's self-driving CRM updates itself and takes action, so sales reps can spend less time on admin and more time closing deals. 100s of teams have made the switch from HubSpot and Salesforce. forbes.com/sites/dariashu…
English
10
10
159
24.5K
Ayush Jain retweetledi
Moonshot
Moonshot@moonshot·
New domain, who dis?
Moonshot tweet media
English
159
22
313
33.1K