NVIDIA AI

12.6K posts

NVIDIA AI

@NVIDIAAI

Teaching your AI new tricks.

Santa Clara, CA Katılım Haziran 2016

853 Takip Edilen293.6K Takipçiler

Sabitlenmiş Tweet

NVIDIA AI@NVIDIAAI·28 Nis

Meet Nemotron 3 Nano Omni 👋 Our latest addition to the Nemotron family is the highest efficiency, open multimodal model with leading accuracy. 30B parameters. 256K context length. 🧵👇

English

188

1.3K

444.6K

NVIDIA AI@NVIDIAAI·28m

@adcock_brett Amazing - congrats to the team 👏

English

1.8K

Brett Adcock@adcock_brett·32m

This is crazy - 2 hours away from 24 hours of continuous humanoid work! The robots have sorted over 28,000 packages so far Bob, Frank, and Gary are all healthy

English

733

24.4K

NVIDIA AI@NVIDIAAI·2h

@HermesAgentTips 💯

QME

Hermes Agent Tips@HermesAgentTips·2h

@NVIDIAAI They are all unique in their own way!

English

NVIDIA AI@NVIDIAAI·2h

💚 seeing these builds

Sam Wasserman🦞@SamJWasserman

Just finished building the @NVIDIAAI Spark rig. Just need to add the Quad Fan. Even built a live custom dual sparks monitoring dashboard for the screen.

English

5.5K

NVIDIA AI@NVIDIAAI·11h

@0xSero Let's gooo!!

English

528

0xSero@0xSero·14h

My next big thing.

English

4.5K

NVIDIA AI retweetledi

NVIDIA Data Center@NVIDIADC·17h

💡 Why did @togethercompute choose NVIDIA Blackwell to serve DeepSeek-V4? Because NVIDIA Blackwell is built for the bottlenecks that matter most in long-context inference: → KV-cache pressure during decode → MoE weight bandwidth during prefill A single NVIDIA HGX B200 system can keep DeepSeek-V4’s compressed CSA/HCA/SWA cache layouts resident across many concurrent long-context requests, while native MXFP4 support enables efficient end-to-end quantized inference for V4’s MoE weights. The result? Higher throughput, lower overhead, and optimized serving efficiency at scale.

Together AI@togethercompute

x.com/i/article/2053…

English

14.2K

NVIDIA AI@NVIDIAAI·15h

Thanks for building with us @CrusoeAI 💚 As context windows explode, tokenization is becoming a major hidden bottleneck in inference pipelines. fastokens is open source, already integrated with Dynamo & @lmsysorg, and designed for the next generation of 100K-token agent systems.

English

843

Crusoe@CrusoeAI·1d

The tokenizer inside @NVIDIA Dynamo? That's fastokens — built by Crusoe. On NVIDIA GB200 NVL72 it delivers 9.1x average speedup over Hugging Face and up to 40% faster TTFT. We don't just run open-source. We build it. Read the full breakdown: crusoe.ai/resources/blog…

English

1.5K

NVIDIA AI@NVIDIAAI·18h

@xqybyt 🙌

QME

xqybyt (p=np)@xqybyt·18h

@NVIDIAAI nemotron 3 super. it's insanely good!

English

NVIDIA AI@NVIDIAAI·1d

Curious what people are running locally these days 👀

Sudo su@sudoingX

what a time to be alive. i asked my dgx spark for updates from my phone. hermes agent came back with all 8 tests passing across 3 test suites. all green. all done autonomously on a 121B model running locally. i didn't write a single test. i just asked. this is the future and it's already here.

English

10.6K

NVIDIA AI@NVIDIAAI·18h

@PrimeIntellect @nvidia @vllm_project @sgl_project Congrats! 👏

English

Prime Intellect@PrimeIntellect·1d

We use renderers across Lab, verifiers, and prime-rl. We are collaborating with leading open-source partners, including @NVIDIA @vllm_project @sgl_project, to ensure it can become a useful standard across models, inference engines, and RL infra stacks throughout the ecosystem. primeintellect.ai/blog/renderers

English

6.5K

Prime Intellect@PrimeIntellect·1d

Introducing Renderers RL trainers work in tokens. Environments work in messages. Going back and forth corrupts sampled tokens, wasting compute on every agentic turn. With Renderers, we fix this mismatch. This unlocks >3x throughput on popular open models.

English

688

182.1K

NVIDIA AI@NVIDIAAI·19h

@Recursive_SI Congrats!

English

1.1K

Recursive@Recursive_SI·1d

x.com/i/article/2054…

ZXX

358

346.5K

NVIDIA AI@NVIDIAAI·20h

@LangChain @nvidia we see you!

English

677

LangChain@LangChain·20h

Special guests at the Ask me Anything booth. @NVIDIA! Stop by during lunch!

English

3.3K

NVIDIA AI@NVIDIAAI·21h

@mouser58907 🙌

QME

276

Craig Mouser@mouser58907·21h

Since I got it stable I've burned 1B tokens on Qwen3.6 that would have cost over $500! This thing will definitely pay for itself in under 1 year. No regrets so far. Running Total (22 days): 1,046,312,172 tokens | 15,630 messages | $525.62 @NVIDIAAI

Craig Mouser@mouser58907

Uh oh, look what daddy did! (Hopefully not a huge regret)

English

536

NVIDIA AI@NVIDIAAI·21h

@0xSero Congrats!

English

2.8K

0xSero@0xSero·22h

I just received a 100,000$ grant from the Human Rights Foundation. In total I received: - 100K USD through HRF - 25.8K USD through donations site - 25K Brev credits through Nvidia - 4x B200s for a month - 5K from lambda - 4x RTX PRO 6000 private donor Open source must win

0xSero@0xSero

x.com/i/article/2034…

English

216

167

3.4K

154.1K

NVIDIA AI@NVIDIAAI·21h

Read more 👇 nvda.ws/4eKCZhz

English

2.7K

NVIDIA AI@NVIDIAAI·21h

Delivering agentic inference at scale requires balancing efficiency across: 1) Models and algorithms 2) Software 3) Compute Our full-stack platform continuously optimizes for these inputs using extreme co-design across compute, networking, storage, and memory. Plus, software with broad ecosystem support across millions of developers. The result: lower cost per token, higher throughput, and more scalable AI systems.

English

107

7.6K

NVIDIA AI@NVIDIAAI·22h

OpenShell v0.0.40 🔀 local-domain service routing in the gateway ☸️ k8s node scheduling + tolerations 🔒 CLI TLS now uses the OS trust store 🛡️ SecretResolver debug no longer leaks secrets Two security fixes ship alongside new routing and K8s scheduling control. github.com/NVIDIA/OpenShe…

English

143

9.4K

NVIDIA AI@NVIDIAAI·23h

@JoeGuglielmucci 🙌

QME

Joe Guglielmucci@JoeGuglielmucci·23h

@NVIDIAAI Nemoclaw obviously. It's insane to even be testing anything else.

English

NVIDIA AI@NVIDIAAI·23h

@sambommakanti awesome!

English

Sampath Bommakanti@sambommakanti·23h

@NVIDIAAI I just built my home robot to use VSLAM and navigate the environment autonomously. Based on NVIDIA Jetson Orin Nano super dev kit and RGB-D camera. Next to pure voice interaction capability through LLM and NemoClaw to get it to do things and interact with users 💪🏽

English

124

NVIDIA AI@NVIDIAAI·23h

Build Video Analytics AI Agents with Skills x.com/i/broadcasts/1…

English

3.4K

NVIDIA AI@NVIDIAAI·1d

@sudoingX great to see you using it 💚

English

792

Sudo su@sudoingX·1d

nobody is talking about how good nemotron 3 nano omni 30b-a3b actually is on local. very underrated. multimodal, reasoning, video understanding, image vision, all shipped in one open source release by nvidia. moe architecture 30b total params, 3b active per token, q8 is near lossless and fits comfortably on a single dgx spark with room to breathe. i have been running it for weeks now and the gap between what this model can do and what the conversation says is wide. nvidia is pushing hard on the open-source front. most builders haven't noticed yet because the discourse is locked on closed-source frontier benchmarks and the next viral chart. meanwhile this thing handles agentic loops, processes video inputs, reasons across image context, and stays responsive on consumer tier unified memory hardware. on dgx spark it flies. more content coming, showing all the modalities in action. if you have used it, what is your experience. drop your stack and your findings, curious what other builders are seeing across hardware tiers.

English

236

13.3K

NVIDIA AI@NVIDIAAI·1d

@poolsideai @nvidia @PrimeIntellect @huggingface This will be great! Can't wait to see what people build

English

728

poolside@poolsideai·1d

Poolside is hosting a 2-day model research hackathon in London. Join us to push an open-weight agent model as far as you can. RL and fine-tune Laguna XS.2, our latest-generation model, on Prime Intellect Lab. Dates: May 29–30 Partners: @nvidia + @PrimeIntellect + @huggingface Prize: NVIDIA DGX Spark Agents need better models. Better models need cracked researchers. Link below.

English

210

77.6K

Keşfet

@adcock_brett @HermesAgentTips @0xSero @togethercompute @CrusoeAI @lmsysorg @nvidia @xqybyt