Tanmay Parekh

275 posts

Tanmay Parekh

@tparekh97

Katılım Nisan 2020

510 Takip Edilen884 Takipçiler

Sabitlenmiş Tweet

Tanmay Parekh@tparekh97·1 May

Life Update: Defended my PhD thesis titled “Towards Universal Event Extractions”, where I explored agentic workflows and synthetic data generation for IE. 👨‍🎓🎉 Sincere thanks to my advisors @kaiwei_chang and @VioletNPeng, and my committee @WeiWang1973 and @adityagrover_.

English

4.4K

Tanmay Parekh retweetledi

Mustafa Suleyman@mustafasuleyman·9 Tem

Introducing Ode Poetry. Ode is a wonderful poetry pharmacy that reads you a poem for the moment you’re in. Just tell Ode what you're feeling, and it uses Microsoft AI audio models to connect you with the same work that poetry expert William Sieghart would recommend. The best technology doesn't replace human creativity, it helps more people experience it. Super proud of the team for making this truly humanist tool. More in the blog: microsoft.ai/news/ode-poetr…

English

169

439

525.2K

Tanmay Parekh@tparekh97·6 Tem

While I’m not able to attend #ACL26, please meet Sachith and learn about our text-to-SQL PExA framework!

Tech At Bloomberg@TechAtBloomberg

@aclmeeting @iSchoolUI @HaohanWang @FIUSCIS @BoiseState @junzhuang_ In Session 14 (6 PM PT) Sachith Sri Ram Kothur delivers a talk about "#PExA: Parallel Exploration Agent for Complex Text-to-SQL," work done with Bloomberg Data Science Ph.D. Fellow @tparekh97 of @UCLAnlp, @hc_ella, Shuyi Wang & @YunmoChen bloom.bg/4eIfO7c #ACL2026NLP (5/7)

English

494

Tanmay Parekh retweetledi

Nanyun (Violet) Peng @ ACL26@VioletNPeng·5 Haz

My first paper at Google is out! Thank you @rohanpaul_ai for highlighting LEAP. To share more thoughts on this direction: I strongly believe that as models generate longer and more complex proofs, automatic formal verification will be the key to the future of AI for math, and I'm bullish on using general LLMs + agentic framework for this task. As we started with competition math in LEAP for rigorous benchmarking purposes, we've already started to venture into research math. - Solved Erdős problem 527 (zero web search). - Partially formalized Knuth's cycle problem even case which resulted in ~4000 lines of Lean code. Please check out all of our solutions here: github.com/google-deepmin… I'm incredibly proud of this work, and we are just getting started. More to come!

Rohan Paul@rohanpaul_ai

Another great paper from Google. Shows general LLMs can solve formal math by planning proofs and checking each step. Raised general LLM performance from under 10% to 70%. A general LLM failed badly when asked to write full formal proofs in 1 try, but became much stronger when it planned, split the work into smaller claims, reused past claims, and learned from Lean’s feedback. The paper shows the weakness was not just the model’s math ability, but the way it was being used - the absence of structured interaction with a verifier. The key idea is that the model does not try to write one giant perfect proof at once, because that usually fails on long and tricky problems. Instead, LEAP stores the proof as a graph of goals and subgoals, so useful lemmas can be reused instead of rediscovered every time. The authors tested LEAP on Putnam 2025 and a new Lean benchmark built from 60 IMO-style problems, where ordinary one-shot proof writing did very poorly. LEAP solved all 12 Putnam 2025 problems and raised general LLM performance on the Lean IMO benchmark from under 10% to 70%. ---- Link – arxiv. org/abs/2606.03303 Title: "LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks"

English

213

72.3K

Tanmay Parekh retweetledi

Mustafa Suleyman@mustafasuleyman·2 Haz

Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier. First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks. - It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities. - It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks. - And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end. Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing. Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI. - Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost. All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat. Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost. Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare. Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: microsoft.ai/news/building-…

English

192

533

3.8K

1.3M

Tanmay Parekh retweetledi

Denis Savenkov@DenXX·12 May

Really excited the share one new Perceptron Mk1 model, significantly improved perception and now with video understanding.

Perceptron AI@perceptroninc

Today we're releasing Perceptron Mk1: frontier video and embodied reasoning.

English

11.4K

Tanmay Parekh retweetledi

Kai-Wei Chang@kaiwei_chang·9 May

I’ve recently been promoted to Full Professor at UCLA 🎉 It’s been a long journey, with many tears, laughs, and surprises along the way. When I was working on linear models 20 years ago, I couldn’t have imagined we’d be building trustworthy AI agents today. I feel incredibly fortunate and deeply grateful to my research group, mentors, collaborators, and students who have made this journey so meaningful. I still remember the moment of hooding each of my PhD students. Those are the happiest moments in my career. Many thanks as well to my family, colleagues, and friends for their support. Looking forward to the next chapter. For those interested, check out our recent work: web.cs.ucla.edu/~kwchang/ Photo: a decade after graduation

English

669

40.6K

Tanmay Parekh@tparekh97·2 May

@BrihiJ @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thanks a lot! 😇

English

Brihi Joshi@BrihiJ·2 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Congrats Tanmay!!!

English

Tanmay Parekh@tparekh97·1 May

English

4.4K

Tanmay Parekh@tparekh97·2 May

@lupantech @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thank you so much Pan!

English

Pan Lu@lupantech·2 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Congrats, Tanmay!! 👏

English

200

Tanmay Parekh@tparekh97·1 May

@IHung_Hsu @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thanks a lot @IHung_Hsu!

English

I-Hung Hsu@IHung_Hsu·1 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Congrats!

English

Tanmay Parekh@tparekh97·1 May

@hgznnn___ @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thank you so much 😇

English

125

Youze "Hargen" Zheng@hgznnn___·1 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Congrats Tanmay!!!

English

Tanmay Parekh@tparekh97·1 May

@jieyuzhao11 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thanks a lot Jieyu! 😇

English

Jieyu Zhao@jieyuzhao11·1 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ congrats!

English

204

Tanmay Parekh@tparekh97·1 May

@TuhinChakr @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thank you Tuhin! 😇

Indonesia

Tuhin Chakrabarty@TuhinChakr·1 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Great picture :-) Congrats Tanmay

English

157

Tanmay Parekh@tparekh97·1 May

@danish037 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Thank you so much! 😇

English

Danish Pruthi@danish037·1 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ Congratulations @tparekh97!!

English

298

Tanmay Parekh@tparekh97·1 May

@VioletNPeng @kaiwei_chang @WeiWang1973 @adityagrover_ Thank you so much for all your mentorship and guidance throughout my PhD! 🙏🏻

English

121

Nanyun (Violet) Peng @ ACL26@VioletNPeng·1 May

@tparekh97 @kaiwei_chang @WeiWang1973 @adityagrover_ Congratulations! I’m so proud!!

English

266

Tanmay Parekh@tparekh97·1 May

@adityagrover_ @kaiwei_chang @VioletNPeng @WeiWang1973 Thank you so much 😇

English

Aditya Grover@adityagrover_·1 May

@tparekh97 @kaiwei_chang @VioletNPeng @WeiWang1973 Congratulations @tparekh97!

English

335

Tanmay Parekh retweetledi

Kuan-Hao Huang@kuanhaoh_·5 Nis

The first-ever Texas NLP Symposium wrapped up yesterday! 🎉🎉🎉 Huge thanks to all the speakers and attendees for making it a huge success. I hope everyone had a great time. Stay tuned for info on next year! #TexasNLP Check photos and highlights here: photos.app.goo.gl/AhuqKfKDXHyUQw…

English

5.5K

Tanmay Parekh retweetledi

Lucas Bandarkar@LucasBandarkar·31 Mar

Large Reasoning Models Struggle to Transfer Parametric Knowledge Across *Scripts* ! We study very large thinking models (100B+) and find above all that for rare, local knowledge, the perceived language transfer barrier is actually a script barrier 🧵1/3 arxiv.org/abs/2603.17070

English

Tanmay Parekh@tparekh97·18 Şub

Check out this amazing 5 min read blog by @hbXNov ! Simple and elegant!

Hritik Bansal@hbXNov

New blog 📢 Can we extract dense advantages without new annotations or models in GRPO? The answer is YES! 💡Answer correctness splits rollouts into positives and negatives. Just upweight positive tokens which differ significantly from the negative tokens! 🧵👇

English

391

Tanmay Parekh@tparekh97·22 Oca

Join us to hear from @ChrisGPotts about his discoveries in memorization!

uclanlp@uclanlp

For the UCLA NLP seminar talk this Friday, we are thrilled to host Prof. Christopher Potts @ChrisGPotts from Stanford @stanfordnlp ! Title: “The Archai of Palimpsestic Memorization” When: 2–3 PM (PST), Friday, Jan 23 Registration: ucla.zoom.us/meeting/regist…

English

316

Keşfet

@rohanpaul_ai @BrihiJ @kaiwei_chang @VioletNPeng @WeiWang1973 @adityagrover_ @lupantech @IHung_Hsu