Fuxiao Liu

121 posts

Fuxiao Liu

Fuxiao Liu

@FuxiaoL

Research Scientist @Nvidia | CS PhD @UMDCSI, working on LLM, Multimodal Stuff

Washington, DC Katılım Ekim 2021
746 Takip Edilen861 Takipçiler
Fuxiao Liu retweetledi
AK
AK@_akhaliq·
Nvidia released Nemotron 3 Nano Omni made a gradio app for it on Hugging Face
AK tweet media
English
8
7
47
17.5K
Fuxiao Liu
Fuxiao Liu@FuxiaoL·
Stop choosing between efficiency and accuracy for your AI agents. 🛠️ NVIDIA Nemotron 3 Nano Omni is here: ✅ Unified Reasoning: One model for video, audio, and text. ✅ Up To 9x Throughput: Massive efficiency gains for video workflows. ✅ Fully Open: Weights + Data + Recipes. Optimized for Blackwell and available as a NIM. Let’s build. 🚀 Hugging Face: nvda.ws/4u79ue9 Tech Report: nvda.ws/4dbBYxO #nvidia #nemotron #omni
NVIDIA@nvidia

x.com/i/article/2049…

English
0
2
3
282
Fuxiao Liu retweetledi
Bryan Catanzaro
Bryan Catanzaro@ctnzr·
Thank you to everyone in the community who is testing and using Nemotron models. It's great to see Nemotron-Cascade-2, Nemotron-3-Super and Nemotron-3-Nano trending on HF. The Nemotron team is working hard to incorporate all your feedback into Nemotron 4. And yes, Nemotron 3 Ultra is still on track for release. huggingface.co/models?pipelin…
Bryan Catanzaro tweet media
English
20
39
225
54.8K
Fuxiao Liu retweetledi
Bryan Catanzaro
Bryan Catanzaro@ctnzr·
Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: research.nvidia.com/labs/nemotron/… And yes, Ultra is coming!
Bryan Catanzaro tweet media
English
62
205
1.2K
206.6K
Fuxiao Liu retweetledi
Guilherme Favaron
Guilherme Favaron@guifav·
What if a vision model could learn to reason about images without ever seeing one? MM Zero, from Zongxia Li, @FuxiaoL, and researchers at University of Maryland, Brown, Adobe, and NVIDIA, introduces a three role framework where VLMs bootstrap visual reasoning from literally zero data. Three agents (Proposer, Coder, Solver) all start from the same base model. The Proposer invents visual concepts and questions. The Coder renders them as SVG code. The Solver reasons over the results. All trained via GRPO reinforcement learning, no human annotation. On visual reasoning benchmarks, Qwen3 VL 8B improves from 50.7% to 56.6% accuracy. Mimo VL 7B goes from 50.9% to 56.0%. Performance keeps climbing through 5 iterations with no sign of plateau. The data bottleneck for VLM training may be less about collecting more images and more about letting models generate their own.
Guilherme Favaron tweet media
English
1
1
3
115
Simon Zhai
Simon Zhai@simon_zhai·
Today is my last day at xAI, feeling very fortunate about the opportunity. It has been an amazing journey 🫡🫡🫡
English
93
37
1.1K
171.9K
Fuxiao Liu
Fuxiao Liu@FuxiaoL·
I’ll be attending #neurips2025 in San Diego next week! If you’re interested in unified models, multimodal agents, or video generation, feel free to reach out. And if you’re worried that NVIDIA GPUs might not sell out… come talk to me — I promise I can change your mind 😂 I’ll be at the @nvidia booth on Dec 5th #NVIDIA
NVIDIA Newsroom@nvidianewsroom

We’re delighted by Google’s success — they’ve made great advances in AI and we continue to supply to Google. NVIDIA is a generation ahead of the industry — it’s the only platform that runs every AI model and does it everywhere computing is done. NVIDIA offers greater performance, versatility, and fungibility than ASICs, which are designed for specific AI frameworks or functions.

English
0
0
5
2.2K
Fuxiao Liu retweetledi
NVIDIA Newsroom
NVIDIA Newsroom@nvidianewsroom·
We’re delighted by Google’s success — they’ve made great advances in AI and we continue to supply to Google. NVIDIA is a generation ahead of the industry — it’s the only platform that runs every AI model and does it everywhere computing is done. NVIDIA offers greater performance, versatility, and fungibility than ASICs, which are designed for specific AI frameworks or functions.
English
1.2K
875
13.3K
11.8M