Haresh Karnan

114 posts

Haresh Karnan

@KarnanHaresh

RL post-training @ Amazon AGI | PhD-UTAustin

San Francisco, CA Katılım Şubat 2023

1.1K Takip Edilen264 Takipçiler

Sabitlenmiş Tweet

Haresh Karnan@KarnanHaresh·19 Eki

Introducing STERLING 💫: "Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience" 💫 to be presented at CoRL 2023 in Atlanta! 😀 Project Page 🌎: hareshkarnan.github.io/sterling/ Paper 🗞️: openreview.net/pdf?id=VLihM67… A thread 🧵

English

6.9K

Haresh Karnan@KarnanHaresh·14 Kas

@hallerite @emaadmanzoor Curious, what in verl do you find is painful ?

English

hallerite@hallerite·6 Kas

@emaadmanzoor verl is a pain to work with and verifiers is an environments framework first and foremost

English

131

hallerite@hallerite·5 Kas

I see the need for 2 different kinds of LLM-RL frameworks. One should be optimized for post-training of LLMs by labs that want to build frontier models. It doesn't have to be feature-complete, but should follow best practices and consolidate what we learn over time.

English

2.5K

Haresh Karnan@KarnanHaresh·8 Kas

@ThereseMaggie @_arohan_ I meant the IdlyExpress / Mylapore chain of restaurants (you’ll find them in South Bay). I remember reading somewhere they have an idlyexpress branch open in SF recently

English

Therese Maggie@ThereseMaggie·7 Kas

@KarnanHaresh @_arohan_ who is they? Copra?

English

rohan anil@_arohan_·7 Kas

Number of VCs liking this from SF makes me think … there is space to raise money for a startup to make frontier roast dosas.

rohan anil@_arohan_

Near the office. SF has stepped up its dosa game.

English

240

21K

Haresh Karnan@KarnanHaresh·7 Kas

@ThereseMaggie @_arohan_ Didn’t they open an idly express in SF?

English

Therese Maggie@ThereseMaggie·7 Kas

@_arohan_ the white space for delicious South Indian food in SF massive. It’s a shame that whole of South Indian food is reduced to just the humble dosa. The only other restaurant that serves lesser known South Indian food is Copra, but that’s mostly experimental, although delicious.

English

215

Haresh Karnan retweetledi

Vana AI@Vana_Meeting_AI·4 Kas

The Cost of Ambiguity: There’s a massive, painful gap between knowing a mental model and applying it exactly when the conversation needs it most. Every Founder, PM, and Exec we spoke to knows this pain. And the result: Meetings dissolving into opinion, or worse, ending with a vague "let’s revisit this next week". 😩 Introducing 🥁 Vana AI. It's not a note-taker. It's not an action-item generator. It’s your proactive AI strategic thinker for real-time insights in meetings.

English

15.6K

Haresh Karnan@KarnanHaresh·30 Eki

@Devvrit_Khatri @louvishh @rish2k1 @rach_it_ @dvsaisurya @brandfonbrener @agarwl_ nice work! By any chance, did you ablate the sampling temperature in GRPO/DAPO across tasks?

English

197

Devvrit@Devvrit_Khatri·16 Eki

Wish to build scaling laws for RL but not sure how to scale? Or what scales? Or would RL even scale predictably? We introduce: The Art of Scaling Reinforcement Learning Compute for LLMs

English

102

553

288.5K

Haresh Karnan@KarnanHaresh·30 Eki

@jbohnslav Time to bring back inperson interviews

English

135

Jim Bohnslav@jbohnslav·30 Eki

just wasted an hour of my time interviewing some obvious cluely-using motherfucker. do I have to conduct all interviews blindfolded? mirrored glasses?

English

Haresh Karnan@KarnanHaresh·29 Eki

@gowthami_s arxiv.org/pdf/2509.04419 relevant work on interleaving SFT and RL

English

376

Gowthami@gowthami_s·29 Eki

Why is there a weird distinction in post-training, like SFT first and then RL? Why can't they be done together? IMO, I see merit in interleaving these paradigms. Is there any research pointing to the contrary or supporting this sequential process?

English

3.5K

Haresh Karnan@KarnanHaresh·29 Eki

@ArmenAgha How does this compare to mds ?

English

155

Armen Aghajanyan@ArmenAgha·29 Eki

We've internally built a streaming solution to stream 1PB of multi-modal data over hundreds of GPU's for weeks without ever touching NFS, with no train performance regression. Enough interest for a blog post?

Orr Zohar@orr_zohar

🚨Huge for multimodal/vision AI: Datasets hit 100s of TB, making on-prem storage a nightmare. 🤗Now stream them directly from Hugging Face to GPUs - unlocking scalable training of everything from vlms to world models. 🚀 I've battled storage limits for years; thrilled to move on.

English

427

64.7K

Haresh Karnan@KarnanHaresh·29 Eki

@iScienceLuvr Tesla is an appreciating asset ?

English

111

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·29 Eki

Did they beat Tesla to the Tesla strategy? Lol Basically, they are deploying "self-driving" humanoids to consumers, it might not initially work that well and needs a lot of human supervision, but the data collected will continue to improve the robot and get it closer to "full self driving". I would note that humanoids in a home is a much more challenging environment than cars on the road: everyone lives differently in a variety of different floorplans and environments while roads have more consistency. So I do wonder how long it's gonna take for humanoids to reach near full autonomy. That said, like Teslas, Neo could be an appreciating asset if it continues to get better and more autonomous. Anyway that would be the hope, right? Let's see what happens in practice. I'm cautiously optimistic! I wonder when Tesla Optimus will be available 🤔

1X@1x_tech

NEO The Home Robot Order Today

English

303

46.2K

Haresh Karnan@KarnanHaresh·28 Eki

@jaredpalmer @github @pimoroni waaaant!

English

129

Jared Palmer@jaredpalmer·27 Eki

Your @GitHub Universe badge comes with a hackable Raspberry Pi built into it with full color display, 5 buttons, USB-C, Bluetooth, and WiFi. Built by @pimoroni

English

105

315

6.7K

744.1K

Haresh Karnan@KarnanHaresh·20 Eki

@natolambert Congrats! 🎉

English

Nathan Lambert@natolambert·20 Eki

Life update, she said yes. 🤩👩‍❤️‍👨🐕‍🦺

English

277

3.4K

178.1K

Haresh Karnan@KarnanHaresh·12 Eki

@natolambert 💯

QME

179

Nathan Lambert@natolambert·12 Eki

Retiring my original airpods pro after over 5 years of use. Ridiculous amount of value really when we're used to 2 year product cycles.

English

158

16.9K

Haresh Karnan@KarnanHaresh·11 Eki

@sivareddyg @YejinChoinka Thanks for sharing! Is the full recorded talk published on YouTube?

English

Siva Reddy@sivareddyg·10 Eki

Lot of insights in @YejinChoinka's talk on RL training. Rip for next token prediction training (NTP) and welcome to Reinforcement Learning Pretraining (RLP). #COLM2025 No place to even stand in the room.

English

294

77.4K

Haresh Karnan@KarnanHaresh·11 Eyl

@willccbb @trungthvu verl does support all of that. The default params aren’t the most efficient

English

172

will brown@willccbb·11 Eyl

the sampling is fine, the biggest red flag is that they need 12 GPUs for backprop and don’t do any microbatch packing / grad accum to accommodate for async/server on 16 GPUs i’d want 2-4 for training + 12-14 for inference you can do a loooot of backward passes in the time it takes to generate 512 rollouts

English

will brown@willccbb·11 Eyl

"veRL is the best RL framework it's super efficient" really. are you sure about that. are you sure that you need 16 GPUs to tune a 7B model at 8k context. do you think that it's reasonable each step takes 19 minutes for this

English

345

61.6K

Haresh Karnan@KarnanHaresh·7 Eyl

@public_steve @kohjingyu have you heard of "uvx nvitop" ?

English

152

publicSteve@public_steve·7 Eyl

@kohjingyu We ride!

English

2.7K

Jing Yu Koh@kohjingyu·6 Eyl

My toxic trait is thinking I can train a frontier model with 8 H100s

English

1.1K

92K

Haresh Karnan@KarnanHaresh·5 Eyl

@Alibaba_Qwen Can your guy see images and videos?

English

Qwen@Alibaba_Qwen·4 Eyl

Ready to meet the biggest, brainiest guy in the Qwen3 family?

English

431

303

5.4K

907.6K

Haresh Karnan@KarnanHaresh·19 Ağu

@willccbb That sounds a lot like o3

English

will brown@willccbb·19 Ağu

wow. thanks

English

312

19.8K

Haresh Karnan@KarnanHaresh·10 Ağu

@KyleMorgenstein I miss Austin summers! I’m hearing this summer is milder than past years

English

332

Kyle🤖🚀🦭@KyleMorgenstein·10 Ağu

despite living here 5 years this is somehow my first summer in Austin. I was warned it would be brutal but honestly it hasn’t been that bad. the key, imo, is to spend as much time outside as possible. I run or row most days. ofc if you spend all day with AC blasting you’ll melt!

English

1.7K

Haresh Karnan@KarnanHaresh·31 Tem

@jbohnslav 💀

QME

352

Jim Bohnslav@jbohnslav·30 Tem

> be me, training vlms > use hf transformers because who wants to reimplement models if they don't have to > low MFU > `pytorch_profiler.py` > ViT takes 40X longer than the LLM > sus > .item() in the forward pass makes cudaStreamSynchronize every attention layer > MFW

GIF

English

556

39K

Haresh Karnan@KarnanHaresh·30 Tem

@g_k_swamy ziebart et al. :’)

Deutsch

3.2K

Gokul Swamy@g_k_swamy·29 Tem

This story makes me smile every time I get to tell it. Back when I was an undergrad, a bunch of my friends who were much smarter than me were getting into this thing called "research." So, I also cold-emailed a few professors and one got back to me, asking me to read some paper:

λux@novasarc01

share a piece of lore about yourself getting into research

English

1.4K

210.9K

Keşfet

@hallerite @emaadmanzoor @ThereseMaggie @_arohan_ @Devvrit_Khatri @louvishh @rish2k1 @rach_it_