Vidit Agrawal
41 posts

Vidit Agrawal
@vidit_ag
eng @mercor_ai | prev @databricks @plaid @meta
Katılım Mayıs 2015
138 Takip Edilen164 Takipçiler
Vidit Agrawal retweetledi

Kinect (@trykinect) turns every e-commerce store into an AI-powered storefront that actually sells.
As customers shop, online shopping assistants leverage what each customer is looking for in the moment, adapts to every visitor in real time, captures buying intent data they’ve never had before.
Congrats on the launch, @Kratik_ag & @VarunKand!
ycombinator.com/launches/Q1Q-k…
English
Vidit Agrawal retweetledi

X has the best information on the internet and the worst incentives & culture.
meet noscroll — the AI that doomscrolls it for you and texts you just the things that matter.
no feed. no brainrot. no ragebait. just signal.
try it for free → noscroll.com 🙅🏼♂️
English

how it feels when me and my homies PRs are in the merge queue and that shit is not tested
Harry Gao@hrygao
How it feels when me and my homies PRs are in the merge queue
English

@adarsh_exe @cognition APEX-SWE is a much better proxy for real engineering work than toy coding benchmarks—testing whether models can actually build, ship, and debug working systems. Great to see GPT 5.3 Codex (High) leading at 41.5% Pass@1.
English

Traditional coding benchmarks do not reflect how software is actually built and maintained.
That's why we built a new benchmark, APEX-SWE, in partnership with @cognition. It measures whether AI models can perform complex, real-world software engineering work to ship systems that work and debug them when they don't.
@OpenAI GPT 5.3 Codex (High) tops the leaderboard at 41.5% on Pass@1.
English

@BrendanFoody Large language models are absolutely fascinating!
English

TL;DR OpenAI is now goaling explicitly on automating jobs

OpenAI@OpenAI
Today we’re introducing GDPval, a new evaluation that measures AI on real-world, economically valuable tasks. Evals ground progress in evidence instead of speculation and help track how AI improves at the kind of work that matters most. openai.com/index/gdpval-v0
English

@rkamalakantha are we aligned that we got 4 more months of greatness
English











