Yuval Atzmon

216 posts

Yuval Atzmon

@AtzmonYuval

Research Scientist @NVIDIA, Generative AI, reasoning across different data modalities, and compositionality with few or zero examples. Opinions are my own.

London, United Kingdom เข้าร่วม Aralık 2016

194 กำลังติดตาม546 ผู้ติดตาม

Yuval Atzmon รีทวีตแล้ว

Guy Bar-Shalom@GuyBarSh·3d

New blogpost out 📃 "Detecting LLM Misbehaviors from the Inside Out with Deep Learning on Structured Data" (ffabffrasca.substack.com/p/detecting-ll…) [1/8]

English

1.6K

Yuval Atzmon@AtzmonYuval·3d

@roeiherzig @iclr_conf See you there!

English

Roei Herzig@roeiherzig·3d

Excited to be at @iclr_conf ICLR 2026 in Rio next week 🌴🇧🇷✨ I’ll be presenting two of our papers: 🤖 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝘁𝗼 𝗚𝗿𝗮𝘀𝗽 𝗔𝗻𝘆𝘁𝗵𝗶𝗻𝗴 𝗯𝘆 𝗣𝗹𝗮𝘆𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝗥𝗮𝗻𝗱𝗼𝗺 𝗧𝗼𝘆𝘀 lego-grasp.github.io ->We show that training robots on random toys enables 𝙯𝙚𝙧𝙤-𝙨𝙝𝙤𝙩 𝙜𝙧𝙖𝙨𝙥𝙞𝙣𝙜 of real-world objects. 📄🌐 𝗗𝗔𝗩𝗘: 𝗔 𝗩𝗟𝗠 𝗩𝗶𝘀𝗶𝗼𝗻 𝗘𝗻𝗰𝗼𝗱𝗲𝗿 𝗳𝗼𝗿 𝗗𝗼𝗰𝘂𝗺𝗲𝗻𝘁 𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗮𝗻𝗱 𝗪𝗲𝗯 𝗔𝗴𝗲𝗻𝘁𝘀. ->We developed a vision encoder purpose-built for VLMs and tailored to 𝙙𝙤𝙘𝙪𝙢𝙚𝙣𝙩 𝙪𝙣𝙙𝙚𝙧𝙨𝙩𝙖𝙣𝙙𝙞𝙣𝙜 and 𝙬𝙚𝙗 𝙖𝙜𝙚𝙣𝙩𝙨. If you’ll be there, let’s connect! 🚀 #iclr2026 #PhysicalAI #agents

English

2.3K

Yuval Atzmon รีทวีตแล้ว

Gal Dalal@DalalGal·17 Mar

1/ More test-time compute can actually hurt LLM reasoning. ⚠️ Beam search is often treated as a free lunch: wider beam, more candidates, better answers. In our new paper, we show that after a certain point, the opposite can happen.

English

364

Yuval Atzmon@AtzmonYuval·12 Mar

A surprising finding - instead of learning spatial patterns - classifiers cheat by locking-into linguistic traces from the prompt leaking to the attention maps. For that, we inverted each training image with both correct and incorrect relation, so classifiers can't take shortcuts

English

Yuval Atzmon@AtzmonYuval·12 Mar

The cool thing - despite training on single spatial relations, Learn-to-Steer generalizes to multiple relations in a single image - even 5 objects with 3 relations. And it works across different diffusion architectures including MMDiT

English

Yuval Atzmon@AtzmonYuval·12 Mar

This was a fun work, led by the skilled @sapiryiflach T2I models struggle with spatial reasoning - "a dog to the right of a cat" often comes out wrong. Instead of handcrafting a test-time loss func, we trained a classifier on attention maps and used it as a differentiable loss

Sapir Yiflach@sapiryiflach

🚀Excited to present our new paper that has been accepted to #WACV2026! Text-to-image models often fail at simple spatial tasks, like placing a dog to the right of a teddy bear. Our solution: Learn-to-Steer. We learn a loss function directly from attention maps and apply it during inference. This work was done together with @AtzmonYuval and @GalChechik 📰arXiv: arxiv.org/abs/2509.02295 🌐Project page: learn-to-steer-paper.github.io 📽️Video: youtu.be/KaxRwlE-UFg

English

673

Yuval Atzmon รีทวีตแล้ว

Sapir Yiflach@sapiryiflach·9 Mar

YouTube

English

931

Yuval Atzmon รีทวีตแล้ว

Dvir Samuel@dvir_samuel·3 Şub

🚀 Excited to share our new paper: “Fast Autoregressive Video Diffusion & World Models with Temporal Cache Compression & Sparse Attention.” We address attention bottlenecks in auto-regressive video diffusion, enabling ×5–×10 speedup and constant memory over long rollouts.

English

688

Yuval Atzmon รีทวีตแล้ว

Chen Tessler@ChenTessler·3 Ara

At @nvidia, we built ProtoMotions to help us, and researchers world-wide, innovate quickly without compromising on applicability. We're proud to announce ProtoMotions3 -- our biggest release yet! 🧵👇

English

269

52.3K

Yuval Atzmon รีทวีตแล้ว

Assaf Shocher@AssafShocher·14 Eki

They tell you neural nets are non-linear. What does "linear" even mean?! Linearity is only defined given two vector spaces, X → Y. What if we could find a different pair of spaces where NNs ARE linear? 🤯 We do it and use it for many apps, such as one-step diffusion! 🧵

English

562

45K

Yuval Atzmon@AtzmonYuval·26 Eki

@OPatashnik @TelAvivUni Congrats Or!

English

386

Or Patashnik@OPatashnik·26 Eki

📢 Today I begin my first semester as faculty in Computer Science at @TelAvivUni! Excited to start this new journey, and grateful to teach & research where my own journey began 🩵

English

347

27K

Yuval Atzmon รีทวีตแล้ว

Chen Tessler@ChenTessler·22 Eyl

Excited to share our latest work MaskedManipulator (proc. @SIGGRAPHAsia 2025)! With: Yifeng Jiang, @erwincoumans, @zhengyiluo, @GalChechik, and @xbpeng4

English

180

16.4K

Yuval Atzmon รีทวีตแล้ว

Bryan Catanzaro@ctnzr·18 Ağu

Today we're releasing NVIDIA Nemotron Nano v2 - a 9B hybrid SSM that is 6X faster than similarly sized models, while also being more accurate. Along with this model, we are also releasing most of the data we used to create it, including the pretraining corpus. Links to the models, datasets, and tech report are here: research.nvidia.com/labs/adlr/NVID…

English

232

1.4K

275.9K

Yuval Atzmon@AtzmonYuval·13 Ağu

@ziv_ravid Did you try with markdown? There's an option to code slides using markdown syntax. This would give you instant iteration of you're using cursor and a vscode plugin. I was planning to try it next time I'll prepare a presentation

English

104

Ravid Shwartz Ziv@ziv_ravid·13 Ağu

I ended up with ~70 slides, which meant the context window (and my token limit) filled up soooo fast all the time (I had to paste relevant papers each time). Lots of back-and-forth with the model (asking it to generate LaTeX code, copying to Overleaf, checking results).

English

229

Ravid Shwartz Ziv@ziv_ravid·13 Ağu

Some thoughts on preparing a presentation from scratch with Claude (also tried GPT-5 but it was much worse). I had a vague idea based on several papers. I started with trying PowerPoint, but iterating with Claude was too slow.

Ravid Shwartz Ziv@ziv_ravid

Can we create a 1-hour presentation from scratch in just 4 hours? Wish me and Claude good luck 😎

English

2.4K

Yuval Atzmon@AtzmonYuval·27 Tem

@MehulDamani2 Very nice work! Will it be able to do reasoning by elimination, reflecting on what it doesn't know? proceedings.mlr.press/v161/agrawal21…

English

181

Mehul Damani @ICLR@MehulDamani2·23 Tem

🚨New Paper!🚨 We trained reasoning LLMs to reason about what they don't know. o1-style reasoning training improves accuracy but produces overconfident models that hallucinate more. Meet RLCR: a simple RL method that trains LLMs to reason and reflect on their uncertainty -- improving both accuracy ✅ and calibration 🎯. [1/N]

English

205

839

111.6K

Yuval Atzmon รีทวีตแล้ว

Yoad Tewel@YoadTewel·14 Tem

🎉 Thank you @_akhaliq for featuring our work! We’re happy to release the 💻 code & 🤗 Hugging Face demo for Add-it, our #ICLR2025 paper: 💻 Code: github.com/NVlabs/addit 🤗 Demo: huggingface.co/spaces/nvidia/…

AK@_akhaliq

Nvidia presents Add-it Training-Free Object Insertion in Images With Pretrained Diffusion Models

English

28.3K

Yuval Atzmon รีทวีตแล้ว

UriG@uri_gadot·1 Tem

Tired of manual #ComfyUI workflow design? While recent methods predict them, our new paper, FlowRL, introduces a Reinforcement Learning framework that learns to generate complex, novel workflows for you! paper [arxiv.org/abs/2505.21478]

English

5.1K

Yuval Atzmon รีทวีตแล้ว

Gal Dalal@DalalGal·10 Haz

1/4 🚨 1st of 3 ICML 2025 papers! We bring gradient boosting trees (like XGBoost) to RL — live on real datacenters. Our GBRL framework is robust, efficient, and deployable on lightweight hardware — even RISC-V CPUs 💻 🧵👇

English

1.5K

Yuval Atzmon รีทวีตแล้ว

Yftah Ziser@YftahZ·4 Haz

1/6🚀 New #ACL2025Findings: We show you can predict if Chain-of-Thought (CoT) reasoning will succeed — before any tokens are generated! This works with LLMs not specifically trained for reasoning—meaning powerful signals emerge naturally in early processing.

English

7.3K

Yuval Atzmon@AtzmonYuval·23 Nis

Tomorrow, 3pm, #ICLR2025, super creative work by @YoadTewel. Adding objects to images, in natural ways, just from text prompts. It's completely zero-shot, and also resonates with "affordance" in vision, robotics and CogSci. I'll be there too. Come say 👋!

Yoad Tewel@YoadTewel

I'm going to present Add-it at #ICLR2025 tomorrow (Thursday) @ 3pm - poster #163! Project page: research.nvidia.com/labs/par/addit/ If you're around this week, feel free to DM me - happy to chat! Details below ⬇️🧵

English

254

ค้นพบ

@roeiherzig @iclr_conf @sapiryiflach @GalChechik @nvidia @OPatashnik @TelAvivUni @SIGGRAPHAsia