Niket Patel

22 posts

Niket Patel

Niket Patel

@niketnpatel

Deep Learning & Math, phding @ NYU (Prev. UCLA)

Katılım Kasım 2023
189 Takip Edilen29 Takipçiler
Niket Patel retweetledi
vixhaℓ
vixhaℓ@TheVixhal·
Most people with “AI/ML” in their bios don’t even know a real symmetric matrix always has real eigenvalues.
English
92
75
1.5K
104.1K
Niket Patel retweetledi
Ariel
Ariel@redtachyon·
Can we train LLMs with RL using the same next token prediction loss as pre-training? (yes) We conduct a study on (log)prob rewards and show they give a simple way to bridge verifiable and non-verifiable settings with a single reward, broadly applicable for fine-tuning LLMs.
Ariel tweet media
English
5
22
163
9.3K
Niket Patel retweetledi
Shobhita Sundaram
Shobhita Sundaram@shobsund·
Can a model learn to break its own reasoning plateau? In our new paper, we show that LLMs can be taught with meta-RL to generate their own "stepping stones" that kickstart learning on hard math problems (0/128 success rate) where direct RL fails. Paper 📝: arxiv.org/abs/2601.18778 Blog post 🌐: ssundaram21.github.io/soar/ (1/n)
Shobhita Sundaram tweet mediaShobhita Sundaram tweet media
English
20
108
663
91.3K
Niket Patel retweetledi
Mark Ibrahim
Mark Ibrahim@marksibrahim·
Want to teach AI agents to use apps like humans? Get started with digital agents research using OpenApps, our new Python-based environment.
English
1
10
28
9.6K
Niket Patel retweetledi
Julia Kempe
Julia Kempe@KempeLab·
I will be recruiting 1-2 PhD students at @NYUDataScience or @NYUCourant CS to work on Machine Learning & applications in NYU's vibrant top ML ecosystem. Check Google Scholar to see our latest research interests. Interested? Please mention my name in your application. Deadl. 12/12
English
5
79
333
27.2K
Niket Patel retweetledi
Dhrupad Bhardwaj
Dhrupad Bhardwaj@dhrupadbhardwaj·
New paper alert! When LLMs write long responses to open-ended questions, how can you tell if they're hallucinating - without checking every single claim? We found something interesting hidden *inside embeddings*. arxiv.org/pdf/2510.21891 With @timrudner @KempeLab @NYUDataScience Thread 🧵 1/5
Dhrupad Bhardwaj tweet media
English
1
4
10
657
Niket Patel retweetledi
Yunzhen Feng
Yunzhen Feng@feeelix_feng·
Current GRPO wastes compute on negative groups — when all K samples are wrong, you get zero gradient despite full generation cost. We propose a principled fix by bridging reward modeling and policy optimization: 👉 Penalize highly confident wrong answers more to create signal.🧵
Yunzhen Feng tweet media
English
7
41
341
39.9K
Niket Patel retweetledi
Nikos Tsilivis
Nikos Tsilivis@nikostsilivis·
RL has led to amazing advances in reasoning domains with LLMs. But why has it been so successful, and why does the length of the response increases during RL? In new work, we introduce a framework to provide conceptual and theoretical answers to these questions.
Nikos Tsilivis tweet media
English
2
14
62
5K
Niket Patel retweetledi
Sander Dieleman
Sander Dieleman@sedielem·
Measuring information content in bits is very useful. Information theory made digital communication, cryptography and machine learning possible. But information is not just a quantity: it also has a shape. (1/6)
Sander Dieleman tweet media
English
20
116
1.3K
139.9K
Niket Patel
Niket Patel@niketnpatel·
@ziv_ravid A podcast where you only interview graduate students called "grad student descent"
English
1
0
5
470
Ravid Shwartz Ziv
Ravid Shwartz Ziv@ziv_ravid·
We want to start a podcast about cutting-edge AI research and technical breakthroughs. Need a catchy name! What would you call it? The one who suggest the best name will be our guest 🥳
English
119
7
236
31.5K
Niket Patel retweetledi
Damek
Damek@damekdavis·
anyway, none of this is possible because the data processing inequality
English
8
5
68
10.7K
Niket Patel retweetledi
Alan Jeffares
Alan Jeffares@Jeffaresalan·
Our new ICML 2025 oral paper proposes a new unified theory of both Double Descent and Grokking, revealing that both of these deep learning phenomena can be understood as being caused by prime numbers in the network parameters 🤯🤯 🧵[1/8]
Alan Jeffares tweet media
English
13
75
941
130.5K
Niket Patel retweetledi
Ravid Shwartz Ziv
Ravid Shwartz Ziv@ziv_ravid·
Hi, happy to share that our paper "Layer by Layer: Uncovering Hidden Representations in Language Models" was accepted to ICML as a spotlight! 🥳🥳🥳 Oscar Skean, @rarefin15, DanZhao, Jalal Naghiyev and @ylecun
Ravid Shwartz Ziv tweet media
English
18
48
407
44.2K
Mathieu
Mathieu@miniapeur·
I'm looking for mathematics book full of problems to solve and, ideally, their solutions. I want to start solving at least one problem a day and it's easier if I can just browse through a book. It should be at least undergrad level. I am open to a book covering several topics or specific topics such as linear algebra, probability, analysis, statistics, etc.
English
108
125
1.6K
201.2K
Spencer Frei
Spencer Frei@sfrei_·
Job update: I've joined @GoogleDeepMind as a research scientist! I'll be working from the SF office. Super excited!
English
73
12
1.4K
102.8K
Niket Patel
Niket Patel@niketnpatel·
@ziv_ravid not sure I think the booth is closed, but I saw someone with a brick at some point
English
0
0
0
24
Ravid Shwartz Ziv
Ravid Shwartz Ziv@ziv_ravid·
People at NeurIPS, does anyone have extra swag from the conference? I went to the booths, but apparently they were already gone. My kids will kill me 😱
English
1
0
1
530