Darshan Singh

47 posts

Darshan Singh banner
Darshan Singh

Darshan Singh

@thought2vec

Research @GoogleDeepMind | @iiit_hyderabad

Inscrit le Ekim 2021
534 Abonnements152 Abonnés
Tweet épinglé
Darshan Singh
Darshan Singh@thought2vec·
Super excited to announce that our work got accepted to CVPR'25! Here is a teaser for now, stay tuned for a detailed thread 😉
Darshan Singh tweet media
Makarand Tapaswi@MakarandTapaswi

Happy to share that we have 1/1 paper accepted at #CVPR2025! Details soon 😌🎉 After a rocky 2024 with multiple close rejections, it has been a good start to 2025 with 1 WACV, 1 TMLR, 1 ICASSP, 1 NAACL (short), 1 ISBI, and now 1 CVPR. Moving on to other things for #ICCV2025 ...

English
1
2
26
2.3K
Darshan Singh retweeté
Harman Singh
Harman Singh@Harman26Singh·
Can LLMs Self-Verify? Much better than you'd expect. LLMs are increasingly used as parallel reasoners, sampling many solutions at once. Choosing the right answer is the real bottleneck. We show that pairwise self-verification is a powerful primitive. Introducing V1, a framework that unifies generation and self-verification: 💡 Pairwise self-verification beats pointwise scoring, improving test-time scaling 💡 V1-Infer: Efficient tournament-style ranking that improves self-verification 💡 V1-PairRL: RL training where generation and verification co-evolve for developing better self-verifiers 🧵👇
English
13
62
369
77.7K
Darshan Singh retweeté
Makarand Tapaswi
Makarand Tapaswi@MakarandTapaswi·
Our paper was desk rejected @NeurIPSConf! Even before the main deadline! "Non-academic title and abstract" 🙈 Thankfully, @SIGGRAPHAsia was around the corner and a perfect fit for our work on improving robustness of multi-subject multi-attribute layout-guided T2I models! 🧵1/9
English
5
11
92
27.3K
Darshan Singh retweeté
Kiana Ehsani
Kiana Ehsani@ehsanik·
Researchers consider themselves very successful if they win one test-of-time award (and one is more than enough). Ross @inkynumbers has been winning them nonstop over the past year: CVPR 2024, ICCV 2025, and now NeurIPS 2025, because winning just one was too easy for him! Having known him for many years (first as a climbing partner and then as a colleague), I can’t say I’m surprised. When he sets his mind to something, he perfects it, whether it is making the best vision model, climbing a 5.12d, or continuing the sally-up sally-down push-up challenge until the rest of the team gives up. And to all his collaborators who only worked with him remotely and didn’t get to see him in person every day: you missed out. He is fun to work with but he is even more fun in person. I'm attaching the proof below. I have some true gem videos of his goofy side that I won’t share (saving them for when I need to blackmail him), but here is a photo of Ross pretending to be a lizard under our office sun lamp. Congratulations to Ross and all his co-authors. #NeurIPS2025
Kiana Ehsani tweet mediaKiana Ehsani tweet mediaKiana Ehsani tweet mediaKiana Ehsani tweet media
English
1
12
164
28.8K
Darshan Singh retweeté
Harman Singh
Harman Singh@Harman26Singh·
Late life update 🚀 I started my PhD at @UCBerkeley after an incredible time at @GoogleDeepMind. It was exciting to work on Gemini over the past couple of years. These days I am interested in reasoning/improving RL, agents, and diffusion language models. Looking forward to contributing to open science. Also thrilled to be back in the Bay Area. Grateful to mentors, collaborators, and folks who supported me, @partha_p_t @PengchuanZ @nitish_gup @trevorcohn @xiangrenNLP @divy93t @ManishGuptaMG1, Parag Singla, friends, and family. Excited to be at @NeurIPSConf #NeurIPS2025 this week. Looking forward to meeting folks. Feel free to DM if you'd like to chat!
Harman Singh tweet mediaHarman Singh tweet media
English
23
11
615
45.9K
Nithish Kannen
Nithish Kannen@NithishKannen·
At 15, it was my dream to become a professional athlete. Nano Banana Pro 🍌 brought it to life - the character consistency and finer details are just insane!! "generate a visual story of this person <insert image> becoming an Olympic 100m champion"
Nithish Kannen tweet media
Mountain View, CA 🇺🇸 English
1
0
6
480
Darshan Singh retweeté
JB Alayrac
JB Alayrac@jalayrac·
Really proud of what we have achieved with Gemini 3 🚀! The Gemini MM team has worked relentlessly across image 🖼️ and video 🎥 from pre-training to post-training to simply deliver the best multimodal in the world 👏! Looking forward to what you will build🫡!
JB Alayrac tweet media
English
8
16
217
32.8K
Darshan Singh retweeté
Nithish Kannen
Nithish Kannen@NithishKannen·
The calm after the storm!
Nithish Kannen tweet media
Mountain View, CA 🇺🇸 English
0
1
4
475
Darshan Singh retweeté
Bonnie Li
Bonnie Li@bonniesjli·
the calm before the storm @googledeepmind hq ✨✨✨
Bonnie Li tweet media
English
31
55
2.8K
199.6K
Darshan Singh retweeté
Google DeepMind
Google DeepMind@GoogleDeepMind·
This is Gemini 3: our most intelligent model that helps you learn, build and plan anything. It comes with state-of-the-art reasoning capabilities, world-leading multimodal understanding, and enables new agentic coding experiences. 🧵
English
215
1.1K
6.5K
1.7M
Darshan Singh retweeté
stepfanie tyler
stepfanie tyler@stepfanie·
imagine having a sunday this good
stepfanie tyler tweet media
English
86
6.7K
144.5K
1.8M
Darshan Singh retweeté
Makarand Tapaswi
Makarand Tapaswi@MakarandTapaswi·
Happy that @gaur_manu and @thought2vec have this opportunity for our work on revisiting self-retrieval for fine-grained image captioning! Thread 🧵: x.com/gaur_manu/stat… arXiv: arxiv.org/abs/2409.03025 Project page: katha-ai.github.io/projects/no-de…
Hugo Larochelle@hugo_larochelle

We at TMLR are proud to announce that selected papers will now be eligible for an opportunity to present at the joint NeurIPS/ICML/ICLR Journal-to-Conference (J2C) Track: @TmlrOrg/tmlr-joins-neurips-icml-iclr-journal-to-conference-track-937a898eab3d" target="_blank" rel="nofollow noopener">medium.com/@TmlrOrg/tmlr-…

English
0
1
11
791
Darshan Singh retweeté
Yi Ma
Yi Ma@YiMaTweets·
The goal of any good theory or new knowledge is to reduce uncertainty or entropy of a field. Given how many papers published each year related to machine/artificial intelligence and how many different opinions about this subject, I hope our new book is a denoiser, not a diffuser.
English
11
15
176
13.5K
Darshan Singh retweeté
Manu Gaur
Manu Gaur@gaur_manu·
Excited to share I have joined CMU’s Robotics Institute as a Master’s student! The past few months were tough with visa uncertainties, but I’m happy to be finally here 😇 I look forward to working on native visual reasoning + other cool RL and VLM stuff 🚀 If you’re around campus, let’s hang out!
Manu Gaur tweet media
English
24
6
334
24.3K
Darshan Singh retweeté
Nithish Kannen
Nithish Kannen@NithishKannen·
👀
Nithish Kannen tweet media
QME
1
2
13
1.2K
Darshan Singh retweeté
IIIT Hyderabad
IIIT Hyderabad@iiit_hyderabad·
A large contingent from IIITH’s Computer Vision Lab participated at the Conference on Vision and Pattern Recognition (CVPR) last month in Nashville. Read on about the cutting edge research that was presented and why it’s a big deal in the vision circles. blogs.iiit.ac.in/cvpr-2025/
IIIT Hyderabad tweet media
English
0
11
58
4.1K
Darshan Singh
Darshan Singh@thought2vec·
Checkout this amazing work by my colleagues @Harman26Singh, @Pragya2k et al, on building robust LLM reward models by disentangling true attributes from spurious ones!!
Harman Singh@Harman26Singh

🚨 New @GoogleDeepMind paper 𝐑𝐨𝐛𝐮𝐬𝐭 𝐑𝐞𝐰𝐚𝐫𝐝 𝐌𝐨𝐝𝐞𝐥𝐢𝐧𝐠 𝐯𝐢𝐚 𝐂𝐚𝐮𝐬𝐚𝐥 𝐑𝐮𝐛𝐫𝐢𝐜𝐬 📑 👉 arxiv.org/abs/2506.16507 We tackle reward hacking—when RMs latch onto spurious cues (e.g. length, style) instead of true quality. #RLAIF #CausalInference 🧵⬇️

English
0
0
5
283
Darshan Singh retweeté
Zeeshan khan
Zeeshan khan@zeeshank95·
Can text-to-image Diffusion models handle surreal compositions beyond their training distribution? 🚨 Introducing ComposeAnything — Composite object priors for diffusion models 📸 More faithful, controllable generations — no retraining required. 🔗zeeshank95.github.io/composeanythin… 1/9
Zeeshan khan tweet media
English
2
9
24
1.8K
Darshan Singh retweeté
Makarand Tapaswi
Makarand Tapaswi@MakarandTapaswi·
🔔New @CVPR paper evaluating compositional reasoning of Video-LLMs on 10s, action-packed clips! 🥁 VELOCITI features 7 tests to disentangle and assess the comprehension of people, actions, and their associations across multiple events. katha-ai.github.io/projects/veloc… 🧵 1/9 #CVPR2025
Makarand Tapaswi tweet media
English
1
10
58
3.4K