Saurav Jha

103 posts

Saurav Jha banner
Saurav Jha

Saurav Jha

@saurav_j_

@IVADO_Qc postdoc @Mila_Quebec; interned @TencentGlobal, @SonyAI_global, @Inria_Nancy; ex-MLE @FactSet; PhD @UNSWComputing

Montréal, Québec Katılım Ağustos 2018
931 Takip Edilen198 Takipçiler
Sabitlenmiş Tweet
Saurav Jha
Saurav Jha@saurav_j_·
🎉 Happy to share that our paper “Mining your own secrets: Diffusion Classifier scores for Continual Personalization of Text-to-Image Diffusion Models” has been accepted to #ICLR2025! 👉 The work results from my #Sony internship in the stunning #Tokyo 🗼city w/ @shiqi_yang_147
Saurav Jha tweet media
English
2
2
21
2.3K
Saurav Jha retweetledi
Basile Terver
Basile Terver@BasileTerv987·
Latest great work from @artemZholus, they study in depth what has been a long-time assumption for the community working on JEPA-style world models. Semantic encoders have better captured the physics of the world. Hence, they are better-suited for decision making in robotics. We had compared some of these semantic encoders in our JEPA-WMs paper arxiv.org/abs/2512.24497, showing that, especially for real-world manipulation, better dense features was key (which is why DINO was > V-JEPA 1/2). Since V-JEPA-2.1 matches DINO on such dense tasks, I am not surprised it is better suited for robotic manipulation ! I really like the systematic set of evaluations, bridging reconstruction and success rate metrics ! Congrats for this contribution 👏
Artem Zholus@artemZholus

Extremely excited to share our recent work on diffusion world models. We ask a simple question - what space supports diffusion world modeling the most and how do we evaluate that?Turns out representation is the answer with JEPA space yielding the strongest diffusion world models!

English
1
3
24
2.8K
Saurav Jha retweetledi
Nilaksh
Nilaksh@nilaksh404·
Takeaway: For robotic diffusion world models, don’t choose the latent space only by visual realism. Start from strong semantic encoders, make them diffusion-friendly, and evaluate with policy-facing metrics. Project page: hskalin.github.io/semantic-wm/ Paper: arxiv.org/abs/2605.06388
English
1
2
8
654
Saurav Jha retweetledi
Nilaksh
Nilaksh@nilaksh404·
Main result: semantic latents are usually better for control-facing metrics. V-JEPA 2.1, Web-DINO, and SigLIP 2 improve action recovery, task-success prediction, CEM planning, policy rollouts, and robustness to distractors.
English
1
3
26
4.8K
Saurav Jha retweetledi
Chandar Lab
Chandar Lab@ChandarLab·
We're thrilled to see the Workshop on Weight-Space Symmetries coming to #ICML2026! Huge shoutout to our postdoc @KateLobacheva for co-organizing it. We're excited for the ideas and discussions this workshop will bring to the community!
Weight Space Symmetries @ ICML 2026@weightsymmetry

📢Excited to announce the Workshop on Weight-Space Symmetries @icmlconf! We welcome 4-page submissions analysing symmetries, their effects on training and model structure, and practical methods to utilize them. Submission Deadline: April 24 (23:59 AoE) #ICML2026

English
0
9
8
491
Saurav Jha retweetledi
Chandar Lab
Chandar Lab@ChandarLab·
NeoBERT: A Next-Generation BERT (TMLR Journal-to-Conference Track) We modernized BERT (RoPE, SwiGLU, 4k context). At just 250M params, it outperforms RoBERTa and ModernBERT on the MTEB benchmark. 📄 arxiv.org/abs/2502.19587
English
0
8
18
1.2K
Saurav Jha
Saurav Jha@saurav_j_·
sayonara sydney and summer, see you soon @NeurIPSConf 👀
Saurav Jha tweet media
Sydney, New South Wales 🇦🇺 English
0
0
0
54
Saurav Jha retweetledi
Andrew Gordon Wilson
Andrew Gordon Wilson@andrewgwils·
Continual learning as a discipline seems to have catastrophic forgetting that it has been focused on catastrophic forgetting for a decade with virtually no progress. Time for some radically new ideas in that area.
English
25
23
363
29.1K
Saurav Jha retweetledi
Saurav Jha retweetledi
Jehanzeb Mirza
Jehanzeb Mirza@jmie_mirza·
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models: arxiv.org/pdf/2410.06154 we did something similar, long before it was 'cool' ;-)
Jackson Atkins@JacksonAtkinsX

LLMs can now self-optimize. A new method allows an AI to rewrite its own prompts to achieve up to 35x greater efficiency, outperforming both Reinforcement Learning and Fine-Tuning for complex reasoning. UC Berkeley, Stanford, and Databricks introduce a new method called GEPA (Genetic-Pareto), an autonomous system for prompt optimization. The researchers tested this across diverse tasks like multi-hop Q&A and instruction following. They demonstrated gains using proprietary models like GPT-4.1 Mini and open-source models like Qwen3 8B. Here's a look at how it works: GEPA treats prompt optimization as a genetic evolution problem. It starts with a diverse "pool" of prompt candidates. It uses Pareto optimization to select the "fittest" prompts. It finds the ones that offer the best tradeoff between high performance on a task and low computational cost (measured in "rollouts"). It "evolves" new, better prompts using two key mechanisms: Crossover: Intelligently combining the best parts of two successful "parent" prompts to create a new "child" prompt. Reflective Mutation: This is the self-optimization engine. The system tasks an LLM to analyze its own detailed execution trace (its successes and failures) and then intelligently rewrite its own instructions to fix the flaws. How GEPA fits into your AI strategy: This method provides a powerful new tool without replacing existing ones. Here’s the distinction: GEPA works on its own. You can apply it directly to any base LLM to achieve significant performance gains just by optimizing the prompt. Fine-Tuning teaches the model what (domain knowledge), while GEPA optimizes how the model uses that knowledge (its reasoning process). This makes them powerful complements. You can use GEPA to supercharge a base model, OR you can apply it to an already fine-tuned model to get the absolute best performance from your expert AI. It's a new, flexible layer in the optimization toolkit that allows AI to optimize itself.

English
1
2
15
1.5K