
Indraneil Paul
169 posts

Indraneil Paul
@androneil54
@ELLISForEurope PhD Candidate @UKPLab @TUDarmstadt ex. Research Intern @awscloud Working on RL + Code-Gen + Function Calling


This is counter-intuitively the correct take. All the Ameribros with their hur-dur Europoor takes are so provincial they’ve never been to Southern Europe or any developing country where there’s an AC compressor bolted to every balcony. It’s not about money, it’s a pure cultural artifact that stymies AC adoption, which is why it’s so fascinating.






Ever wished we had fewer X-training hyphenates? Pre, mid, post etc. Why not just Training? Trying to bridge the divides (and get all our friends into one team again), we intro *Introspective X Training*, an offline RL inspired method that scales effectively across any LLM stage by annotating your data with a thinking reward generated language critique! Up to 2.8x FLOP efficiency + 5-10 point score gains (esp with math and code) at any stage from scratch to 24T tokens on 8b (active) sized models!! We burned much compute ablating so you wouldn't have to Moral of the story is‼️don't throw out any data via filtering, just feedback condition it‼️ You can spend FLOPs up front on inference to *classify* data quality and then train so that tokens aren't all treated equally based on the feedback starting early in training itself. Right now they're really only separated out much later during mid/post training This improves overall compute efficiency and gives us benchmark perf not possible with just baseline methods! Paper here: arxiv.org/abs/2605.20285 Thanks to @BrandoCui and @GXiming for leading this w/ @__SyedaAkter @davidjesusacu @hyunw_kim @jaehunjung_com Yuxiao Qu @shrimai_ @YejinChoinka

AI is killing All About Berlin. When you Google something, you used to get a link to my website, but now you get an AI-generated answer trained on my work. This has a devastating impact on traffic.






Time to upgrade your pretraining dataset. Instead of FineWeb-EDU / DCLM / X, try ClimbMix-400B. 📄 Paper: arxiv.org/pdf/2504.13161 📦 Data: huggingface.co/datasets/nvidi… CLIMBMix uses clustering-based iterative data mixture to improve pretraining efficiency and data quality. Would love to see the community experiment with it and push it further 🚀



Under Mayor Anne Hidalgo, Paris has created nearly 300 “rues aux écoles in an effort to; improve air quality, reduce crashes, and give kids more safe spaces in their neighborhoods to walk, bike, play and just be kids. 🇫🇷


Convert your embeddings to spherical coordinates before compression - this trick cuts embedding storage from 240 GB to 160 GB, and 25% better than the best lossless baseline. Reconstruction is near-lossless as the error stays below float32 machine epsilon - so retrieval quality is preserved perfectly. Works across text, image, and multi-vector embeddings. No training, no codebooks.





Anyone else have this? @huggingface has been shrinking my storage at the end of each month and charging me $100+ I then delete data to meet the limit but the next month it lowers/charges me again. When I emailed they didn’t give a straight answer on the storage lowering.. @Thom_Wolf @ClementDelangue would be nice to get an email just telling us straight up the max we're allowed and the period we have to get to that number. Right now it seems like you’re shrinking our storage at the last minute to “catch” and charge us








