Vishal Ganeriwala retweetledi
Vishal Ganeriwala
5.1K posts

Vishal Ganeriwala
@VishalG
AI Product & Product Marketing leader @NVIDIA, Previously VP of Product Marketing @Citrix. These views and opinions are my own and don’t represent my employer
FL Katılım Mayıs 2008
236 Takip Edilen2.4K Takipçiler
Vishal Ganeriwala retweetledi

NVIDIA has released Nemotron Nano 9B V2, a small 9B reasoning model that scores 43 on the Artificial Analysis Intelligence Index, the highest yet for <10B models
Nemotron 9B V2 is the first Nemotron model pre-trained by @NVIDIA. Previous Nemotron models have been developed by post-training on Meta Llama models.
Architecture & Training: The model uses a hybrid Mamba-Transformer architecture. NVIDIA pre-trained a 12B parameter base model and applied post-training with a range of techniques including RLHF and GRPO. The final 9B size was pruned from this model and re-trained with the base model as a teacher.
Small-model frontier: with only 9B parameters, Nemotron Nano 9B V2 is placed ahead of Llama 4 Maverick on our leaderboard, equal to Solar Pro 2 with reasoning and trails just behind gpt-oss-20B (high).
Along with this model, NVIDIA released a 6.6-trillion token subset of their pre-training data for public use on @huggingface
Key model details:
➤ 128k token context window
➤ Supports reasoning and non-reasoning modes (with ‘/no_think’ settings in the system prompt)
➤ Released under the NVIDIA Open Model License, and not additionally covered by Meta’s Llama license like prior Nemotron models - this means that there is no limitation on use by large companies or requirement to keep ‘Nemotron’ in the name of derivative models
➤ No serverless inference providers are yet serving the model, but it is available now on Hugging Face for local inference or self-deployment
See below for our full analysis and key announcement links from NVIDIA 👇

English
Vishal Ganeriwala retweetledi

👀 @OpenAI's GPT-5 was trained on NVIDIA H100 and H200s GPUs and served on systems like NVIDIA GB200 NVL72 featuring 72 #NVIDIABlackwell GPUs and 36 Grace CPUs, connected using advanced NVIDIA NVLink and NVLink Switch computing fabrics designed for state-of-the-art AI at scale.
Congratulations to OpenAI on launching GPT-5, packed with enhanced reasoning and coding power. 🎉
OpenAI@OpenAI
GPT-5 is here. Rolling out to everyone starting today. openai.com/gpt-5/
English
Vishal Ganeriwala retweetledi

CoreWeave is planning on being part of DGX Cloud Lepton marketplace to meet AI demand.
Learn more & join Early Access - hubs.la/Q03rN9D60

English

@jeffboudier It has been great to partner with you and your team. HuggingFace totally rocks and we think we can help developers connect with cloud providers
Jeff Boudier 🤗@jeffboudier
Proud to connect the global community of AI researchers with the global network of @nvidia GPUs with Training Cluster as a Service announced today at #GTCParis! 🤗🌎 Thanks @AlexisBjorlin for building DGX Cloud Lepton, and thanks Jensen for the shoutout!
English

Vishal Ganeriwala retweetledi

The open source DeepSeek-R1 model is now available as an NVIDIA NIM microservice preview on build.nvidia.com to help developers securely experiment with its advanced AI reasoning capabilities.
NVIDIA AI Developer@NVIDIAAIDev
Securely experiment and build your own specialized agents, as the 671-billion-parameter DeepSeek-R1 model is now available as an NVIDIA NIM microservice in preview on build.nvidia.com. Learn more ➡️ nvda.ws/4grQaBq
English
Vishal Ganeriwala retweetledi

HiPerGator AI 2.0 has arrived ✨
UF is the first university to receive the latest @nvidia technologies — a $24 million upgrade to one of the world's most powerful supercomputers that will propel #GatorNation to new heights.
news.ufl.edu/2025/01/fastes…
English

It took some time, but got some good photos from #solareclipe2024. Totality was 2x as long as 2017. And nearing solar maximum, was more corona and flare activity. Amazing to see. Plus, great location in Greers Ferry, AR
Full blog: chaoticnebula.com/2024/04/17/the…
#Astrophotography



English

@reillyusa @Dr_JCF Hope everything is ok my friend, sending prayers
English

Unequivocally, today has been the hardest day of our lives. Jane and I will be forever indebted to the consummate skill of @Dr_JCF and his wonderful team. From the bottom of our hearts, thank you. 🙏❤️
English

I spoke with an ISV working on some cutting edge AI today on why they are building on top of NVIDIA GPUs. Their answer. - When you buy and build on an NVIDIA GPU. You are not just buying a GPU, you are getting 1000s of framework and the full ecosystem around it. #GTC24
English

Today is the beginning of our moonshot to solve embodied AGI in the physical world. I’m so excited to announce Project GR00T, our new initiative to create a general-purpose foundation model for humanoid robot learning.
The GR00T model will enable a robot to understand multimodal instructions, such as language, video, and demonstration, and perform a variety of useful tasks. We are collaborating with many leading humanoid companies around the world, so that GR00T may transfer across embodiments and help the ecosystem thrive.
GR00T is born on NVIDIA’s deep technology stack. We simulate in Isaac Lab (new app on Omniverse Isaac Sim for humanoid learning), train on OSMO (new compute orchestration system to scale up models), and deploy to Jetson Thor (new edge GPU chip designed to power GR00T).
Announced in Jensen's keynote, Project GR00T is a cornerstone for the “Foundation Agent” roadmap of the newly founded GEAR Lab. At GEAR, we are building generally capable agents that learn to act skillfully in many worlds, virtual and real. See if you can spot "GEAR" in the video ;)
Join us on the journey to land on the moon.
English

Vishal Ganeriwala retweetledi

@VishalG @NVIDIAGTC I am connected with Mel. I don't know if she has any more room at the inn, but I will ask. Hope all is well Vishal.
English

I'm feeling like I should really try and get to @NVIDIAGTC next month!
English

@chrisfleck I was waiting for your post tell us that there is Citrix receiver for Vision Pro 🤣
English

VisionPro first impressions:
Pros: Amazing new experience, high resolution, great large screen and multi display viewing. 3D photo viewing. No bothersome VR type claustrophobia.
Cons: Not comfortable enough for extended wear time. Can’t be shared easily. Price..
It’s a V1 but a great taste of the future.

English






