Abhinav
171 posts


@alpercanbe yeah that's why i decided to write this, would love your thoughts
abhinavjha.xyz
English

@_rajanagarwal whether the data used provides enough signal to differentiate between different plausible solutions to uniquely identify the desired one, is a question i don't think i've been asking before, really good read!
English

i don’t usually share when things go wrong
but recently i was researching world models and it went sideways, but i learned a lot
i wrote a new essay about how not to do research rajan.sh/multiplayer
English

@sighjith @MaxDiffusionRL rlly appreciate it, love the stuff you've done!
English

@ak_cozmo the demos from @physical_int seem to show that they hold pretty well, do check out their blogs they're written so so well
English

@_AbhinavJ flow matching for VLA models is a game changer - converting noise to robot actions is such a clean way to think about policy learning. reminds me of why diffusion took off for image gen. how's the inference latency holding up for real-time robot control?
English

thanks! you're completely spot on, it's a mistake from my end. In the paper for π0.5 they denote t=1 as noise and t=0 as being inside of the target distribution. I personally think it's easier to interpret it going from t=0 (noise) to t=1 (target distribution) instead, which is why the equation in the first section was also x_t = (1-t)x0 + t*x1. But when the graph was plotted it was indeed from t=1 to t=0 since it's hooked up to the actual implementation of pi0.5 and I forgot to reverse it. It's been fixed now, great catch, and thank you for reading attentively!
English

@_AbhinavJ Great post! I’m a bit confused by the cosine similarity and L2 distance plots, they look reversed along the x axis, eg shouldn’t cosine similarity start at 0 then increase to 1? Or am I interpreting it incorrectly
English

@pham_blnh thank you, loved the stuff you guys did with the data collection pipeline on the humanoid
English

@JieWang_ZJUI @yacinelearning @nrehiew_ @KyleVedder @chris_j_paxton thanks! yeah I did, just wanted to get the first version of the blog out, will be adding that soon as well!
English

@_AbhinavJ @yacinelearning @nrehiew_ @KyleVedder @chris_j_paxton Congrats! Have you considered visualizing the 3D trajectory process with pi0 in the simulator? Could be a cool proof of concept.
English

@yacinelearning @nrehiew_ @KyleVedder @chris_j_paxton @JieWang_ZJUI it has to be these visuals that show how noise transforms into actions that seem to have some local structure
GIF
GIF
GIF
English

@_AbhinavJ @nrehiew_ @KyleVedder @chris_j_paxton @JieWang_ZJUI nice man congrats what’s the main highlight?
English

@nrehiew_, @KyleVedder, , @chris_j_paxton, @JieWang_ZJUI, @yacinelearning
would love your thoughts on this
English

Just noticed that my blog post on Long Context is cited here 🫡
Prime Intellect@PrimeIntellect
We believe the next breakthrough in long-horizon agents is training models to manage their own context. Introducing our new research direction on Recursive Language Models. We are sharing our initial experiments showing the promise of RLMs. primeintellect.ai/blog/rlm
English

@_advaitpatel has to be cli, it just works better, though it does sound robotic asf
English

@elliotarledge It was such a big promise, completely forgot it was even made
English

The one thing we couldn't seem to get was 10 million context windows.
Alex Albert@alexalbert__
What are your predictions for AI in 2025? Could be benchmark scores, models, industry dynamics, etc - I want to hear it all
English








