Dhruv Agarwal

261 posts

Dhruv Agarwal

@agdhruv

PhD @Cornell. Past: @MSFTResearch, @GoogleDeepMind, @ashokauniv. Sports fan!

Beigetreten Şubat 2013

215 Folgt749 Follower

Angehefteter Tweet

Dhruv Agarwal@agdhruv·16 Ağu

Grad school applications are stressful! I shared *practical* tips from my experience at Microsoft Research last year and my slides were greatly appreciated. To encourage inclusivity in academia, I am making my slides public: agdhruv.github.io/files/practica… #AcademicChatter @ThePhDPlace

English

161

25.1K

Dhruv Agarwal@agdhruv·24 Şub

@SmithaMilli will share once ready!

English

smitha @ iclr@SmithaMilli·24 Şub

@agdhruv ahah I'll be curious to see your experiments!

English

smitha @ iclr@SmithaMilli·23 Şub

Community Alignment v1.1 is out! ✅ +41k comparisons => 233k comparisons total ✅ Improved demographic representativeness => All 5 countries have subsets balanced on age, gender, ethnicity ✅ More natural lang explanations => in 44% of comparisons, users explain their choice huggingface.co/datasets/faceb…

smitha @ iclr@SmithaMilli

Today we're releasing Community Alignment - the largest open-source dataset of human preferences for LLMs, containing ~200k comparisons from >3000 annotators in 5 countries / languages! There was a lot of research that went into this... 🧵

English

2.2K

Dhruv Agarwal@agdhruv·5 Şub

@adityaag Hard relate. Writing code was always my creative expression and it's been disorienting to see it go away. Earlier the joy was in *how* you build stuff, now it's in *what* you build. Wrote more about this feeling as an ode to coding before AI: @agdhruv/an-ode-to-coding-before-ai-3acc35cf95da" target="_blank" rel="nofollow noopener">medium.com/@agdhruv/an-od…

English

464

Aditya Agarwal@adityaag·3 Şub

It's a weird time. I am filled with wonder and also a profound sadness. I spent a lot of time over the weekend writing code with Claude. And it was very clear that we will never ever write code by hand again. It doesn't make any sense to do so. Something I was very good at is now free and abundant. I am happy...but disoriented. At the same time, something I spent my early career building (social networks) was being created by lobster-agents. It's all a bit silly...but if you zoom out, it's kind of indistinguishable from humans on the larger internet. So both the form and function of my early career are now produced by AI. I am happy but also sad and confused. If anything, this whole period is showing me what it is like to be human again.

English

463

1.8K

15.8K

3.3M

Dhruv Agarwal@agdhruv·26 Oca

@TrungTPhan Also check out AI-generated commentary: x.com/agdhruv/status…

Dhruv Agarwal@agdhruv

I've been thinking about AI applications beyond chatbots and coding agents. Here's one we came up with: AI-generated sports commentary! Check it out:

English

512

Trung Phan@TrungTPhan·24 Oca

The Australian Open (AO) has a very underrated use of AI. It doesn’t have full broadcast rights for all matches, so the AO YouTube livestream uses AI to help render Nintendo Wii Tennis cartoon avatars that mimics the action on a 2-minute delay. As a result, this animated clip of Daniel Medvedev smashing his tennis racket on the net from last year’s tourney remains among the best AI-related video outputs to date.

English

2.8K

570.7K

Dhruv Agarwal@agdhruv·25 Oca

The NeurIPS hallucinated citations debacle is a good example of Goodhart's law: When a measure (NeurIPS acceptance) becomes a target (prestige), it ceases to be a good measure (people will do anything to achieve it)

Alex Cui@alexcdot

Okay so, we just found that over 50 papers published at @Neurips 2025 have AI hallucinations I don't think people realize how bad the slop is right now It's not just that researchers from @GoogleDeepMind, @Meta, @MIT, @Cambridge_Uni are using AI - they allowed LLMs to generate hallucinations in their papers and didn't notice at all. It's insane that these made it through peer review👇

English

336

Dhruv Agarwal@agdhruv·11 Ara

AGI will come when models stop using .get() to index into Python dictionaries

English

441

Dhruv Agarwal@agdhruv·10 Ara

@actuallysoham Haha yeah I built the precursor to messcat (which I gather was on WhatsApp instead of Messenger). Not sure if it's still around!

English

Soham De@actuallysoham·10 Ara

@agdhruv I had no idea you had made what we used to call MessCat haha 🤣 I remember maintaining it for a while and I wonder if it’s still in use?

English

Dhruv Agarwal@agdhruv·9 Ara

An Ode to Coding Before AI Like most others, I learned to code in 2014, long before AI coding agents, autocomplete, and such. My friend and I would exchange HTML/CSS/JS tutorials on 1TB hard drives. I’d flip between VLC, Sublime Text, and Chrome on my family’s old HP laptop, trying to center a div or fix a PHP bug I barely understood. We built websites at high school tech competitions without any internet access. In college, I built a lowkey viral Facebook Messenger chatbot that shared our campus food menu and bus timings. It crashed when someone sent it a cat GIF haha. I "vibe-coded" so many other projects (the vibe came from my inexperience) and encountered creative bugs on the way. Every bug was painful, but every bug taught me something: OS quirks, file systems, logs, APIs, threading, the GIL, tensor dimensions. These days, I use Cursor, Claude Code, and all the AI tools. They’re amazing! But I do feel a weird nostalgia. When AI writes so much of the code, you enjoy the product more than the process. As an engineer, I was wired to enjoy the process, but that's changing. AI coding makes me productive, but a part of me misses when coding was slow, messy, and deeply personal...

English

290

Dhruv Agarwal@agdhruv·9 Ara

I also wrote a longer version of this essay in case you're interested :) @agdhruv/an-ode-to-coding-before-ai-3acc35cf95da" target="_blank" rel="nofollow noopener">medium.com/@agdhruv/an-od…

English

393

Dhruv Agarwal@agdhruv·2 Ara

Excited to be at the Alignment Workshop at @NeurIPSConf 2025 to talk about AI safety and alignment. Unexpectedly some of the conversations were about pre-training, not just post-training! Looks like Gemini 3 Pro is making pre-training cool again!!!

English

462

Dhruv Agarwal@agdhruv·8 Kas

AI commentary unlocks new possibilities - Personalized commentary: beginner-friendly, numbers-focused, drama-heavy, biased to your team...your choice! - Interactive: e.g. discuss strategy with the commentator as the game unfolds - Democratizing sports: commentary can drive viewership for smaller/local games Built this with @justachetan and Sushrut

English

144

Dhruv Agarwal@agdhruv·8 Kas

I've been thinking about AI applications beyond chatbots and coding agents. Here's one we came up with: AI-generated sports commentary! Check it out:

English

879

Dhruv Agarwal@agdhruv·5 Kas

@shaily99 Thanks for the thread...exciting times ahead!

English

118

Shaily@shaily99·5 Kas

Happily surprised to see OpenAI curating cultural benchmarks, especially focused on India. BUT, cultural knowledge != culturally aligned generations. My work for 2+ years focuses on cultural competence in generative tasks, like creative writing. Sharing some papers in LONG 🧵

OpenAI@OpenAI

Introducing IndQA — a new benchmark that evaluates how well AI systems understand Indian languages and everyday cultural context. openai.com/index/introduc…

English

6.4K

Dhruv Agarwal@agdhruv·27 Eki

@vinodgansan @perplexity_ai @OpenAI My cognitive investment at this point is for Chrome, not Chromium. Investment = muscle memory, knowing exactly what it where, down to ~pixel level. If I switch to Comet or Atlas, I lose that flow, and I gain very little (and Chrome will cover up even those small gains soon)

English

Vinod Ganesan@vinodgansan·27 Eki

@agdhruv @perplexity_ai @OpenAI If it’s base + delta, where base being chromium, what cognitive investment are you exactly leaving behind. Resistance to delta I can understand, but the base part I’m unclear.

English

Dhruv Agarwal@agdhruv·26 Eki

My hot take on AI browsers: if it's Chromium-based (@perplexity_ai @OpenAI), I'll try it and move back to Chrome. Too much cognitive investment to leave behind (and Chrome will ship the same features anyway). The only way to dethrone Chrome is to rethink browsing – from scratch!

English

366

Dhruv Agarwal@agdhruv·27 Eki

@anupamsobti @perplexity_ai @OpenAI Doesn't it slow you down for less than proportionate productivity gains? My browsing on Chrome is muscle memory. Minor design changes inhibit that flow without much gains

English

Anupam Sobti@anupamsobti·27 Eki

@agdhruv @perplexity_ai @OpenAI I actually like to leave browsers behind, just to start from scratch. Sure, you have a few extensions to set up, but that’s usually all I need to do. Liking my move to atlas so far. Good suggestions and lower barrier to ask GPT.

English

Dhruv Agarwal@agdhruv·9 Ağu

@cleoabram Great question @cleoabram Our paper earlier this year tested this question exactly! arxiv.org/abs/2409.11360

English

Cleo Abram@cleoabram·9 Ağu

In prep for my interview with Sam Altman, I reached out NVIDIA CEO Jensen Huang for a question. Here's what he asked: "Fact is what is. Truth is what it means. Facts are objective. Truths are personal – i.e., depends on perspective, culture, values, beliefs, context. One AI can learn and know the facts. How does one AI know the truth for everyone, in every country, and every background?" Here's the start of Sam's answer (more in the full interview):

English

909

181.1K

Dhruv Agarwal@agdhruv·7 Ağu

If you code in python and aren't using uv yet, this is your sign to jump ship! It's so fast and fun...I look forward to installing dependencies now lol @astral_sh

English

447

Dhruv Agarwal retweetet

Manoel@manoelribeiro·1 Haz

This @acm_chi paper finds that AI assistance homogenized writing in a controlled experiment with American and Indian participants (n=118). With a classifier, it was harder to predict writer origin without AI (83.5% vs. 90.6%). dl.acm.org/doi/abs/10.114…

English

3.1K

Dhruv Agarwal@agdhruv·7 Nis

@ishaan_jaff @yash_347 litellm's focus is models, not tasks. eg for a simple translation task, one could abstract out the message list setup & make it ready to go. Also it has too many features & the docs are a bit distracting. Wanted to make something beginner-friendly. litellm is great overall tho!

English

Ishaan@ishaan_jaff·7 Nis

@agdhruv @yash_347 Curious what were the challenges setting up litellm @agdhruv ?

English

Dhruv Agarwal@agdhruv·2 Nis

🚀 Introducing MiniAI - AI made ridiculously simple! Sick of OpenAI boilerplate? Tired of writing complex chains to do simple AI tasks? MiniAI is a minimal, flexible solution for byte-sized AI tasks. pip install miniai Examples ⬇️

English

Entdecken

@SmithaMilli @adityaag @TrungTPhan @actuallysoham @NeurIPSConf @justachetan @shaily99 @vinodgansan