NVIDIA AI

12.8K posts

NVIDIA AI

@NVIDIAAI

Teaching your AI new tricks.

Santa Clara, CA Katılım Haziran 2016

873 Takip Edilen298K Takipçiler

NVIDIA AI@NVIDIAAI·8m

@MichaelGannotti @Shaughnessy119 ⏳️

QME

Mike Gannotti@MichaelGannotti·16m

@Shaughnessy119 I am really hoping we see some killer open source LLMs come out from @NVIDIAAI AI soon that are frontier level challenging models

English

Tommy@Shaughnessy119·8h

As Chinese AI models go closed source, and all U.S. frontier models are closed, there is a massive opportunity for a western open source AI Lab A future lack of competitive open source AI models is a net negative for humanity, low cost intelligence and sovereignty

English

19.3K

NVIDIA AI retweetledi

Xuanchi Ren@xuanchi13·11h

The latent-vs-pixel debate misses the point. GPT Image 2 shows what users notice: pixel-level fidelity. Latent models show what scales: compact semantic structure. We connect them by replacing VAE/RAE decoders with a Pixel Diffusion Decoder. Code and Model available: research.nvidia.com/labs/sil/proje… 🧵(1/N)

English

285

624.7K

NVIDIA AI@NVIDIAAI·8h

From the Lab: Text Diffusion and Elastic Reasoning | Nemotron Labs x.com/i/broadcasts/1…

English

4.6K

NVIDIA AI@NVIDIAAI·11h

@tmophoto 🔥

QME

1.3K

tmo@tmophoto·11h

A month ago i discovered the true power of the DGX spark by @NVIDIAAI Its nice to see everyone is getting on the concurrency train now. This is old but it still applies to almost every model i have tested on the spark. when you get above 8-16 requests things really start slowing down from prompt processing but these are lightning fast around 8 concurrent requests.

tmo@tmophoto

Blackwell's NVFP4 is cooking 🔥 I have been trying to figure out ways to maximize the @NVIDIAAI Spark when using LLMs so i tried some concurrency tests. Now i need to figure out how to get an agent to take advantage of this. Nemotron-3-Nano-Omni (30B-A3B-Reasoning) on DGX Spark GB10 @ 50k context, post-warmup:FP8 vs NVFP4 throughput (tok/s) user/request at a time= 1: 39 → 47 (+21%) user/request at a time= 4: 96 → 132 (+38%) user/request at a time= 8: 178 → 198 (+11%) ← peak user/request at a time=16: 171 → 199 (+16%) Same concurrency curve (sweet spot at n=8, dip at 9-10, secondary peak at 16), but NVFP4 wins at every single level. Biggest gains in the moderate concurrency range (n=4-7: +31-38%). At peak, +11% throughput and noticeably snappier latency. Full benchmarks 👇

English

3.8K

NVIDIA AI@NVIDIAAI·12h

@aijoey @nvidia @NVIDIAAIDev Well done @aijoey! 🙌

English

1.5K

Joey@aijoey·12h

15 concurrent terminal workloads on a local DGX Spark, all served by nvidia/nemotron-3-super 120B A12B NVFP4 through vLLM. 15/15 completed, 0 errors, 30.2s wall time: no fake dashboard, just local inference under concurrent load. fineprint: live local concurrency demo

English

8.7K

NVIDIA AI@NVIDIAAI·12h

Good explainer on world models - well done @juliarturc

Julia Turc@juliarturc

"World models" is one of the buzziest yet ambiguous terms in AI right now. I started this video with many questions: - How are they different from video generation? - Can they do more than AI slop? - Can LeCun be trusted given that he wears knee-high white socks? Many thanks to @tjgalda and @NVIDIAAI for helping me answer (most) of these questions!

English

566

83.5K

NVIDIA AI@NVIDIAAI·1d

@IntelFactorAI @flir 🙌

QME

181

Intelfactor AI@IntelFactorAI·1d

@NVIDIAAI @flir wiko.com.hk 😁

QME

178

Intelfactor AI@IntelFactorAI·1d

3 million units of kitchen knives, fast lead times, low defect rates? This is how factory AI actually works. @NVIDIAAI + @flir spotting defects on knife blades in real time with our API - you can have a fleet.

Intelfactor AI@IntelFactorAI

1/ We just installed IntelFactor at a knife factory in an enterprise factory. 2/ Real-time pass/fail on every blade. 3/ Workers love the dashboard on their phones. Video in next post ↓

English

6.4K

NVIDIA AI@NVIDIAAI·1d

@AlicanKiraz0 🔥

QME

1.1K

Alican Kiraz@AlicanKiraz0·1d

@NVIDIAAI 🦾🔥

Alican Kiraz@AlicanKiraz0

2x DGX Spark - Minimax M2.7-MXFP4, vllm 42k context length 🔥 Bu arada normalde generation tok/s 28-32, ama TTFT üstüne interleaved thinking overhead'i birde RAM baskısı birleşince toplam tok/sec 15'e düşüyor. 16k context length ile 25-30 arası alınır 🔥

QME

3.6K

NVIDIA AI@NVIDIAAI·1d

(2x DGX Sparks) + MiniMax M2.7 NVFP4 = 16 local AI agents running simultaneously 👀

mr-r0b0t@mr_r0b0t

16 local AI agents streaming at once! MiniMax M2.7 NVFP4 — 2x GB10, no cloud APIs.

English

116

1.4K

138.3K

NVIDIA AI@NVIDIAAI·1d

@cindy_x_wu 🙌

QME

1.9K

Xindi Wu@cindy_x_wu·1d

(Mini) career update: Just wrapped up an amazing year interning at NVIDIA Spatial Intelligence Lab, it's been a wonderful journey exploring the frontier of video generation and world models with so many brilliant people. Super grateful for the amazing mentors, colleagues, and friends who supported, inspired and believed in me throughout this chapter!

English

176

10.3K

NVIDIA AI@NVIDIAAI·1d

@EtienneTRGC Nice! 🙌

English

2.8K

Etienne.Uncensored.Heretic.127B.gguf@EtienneTRGC·1d

Welcome buddy... DGX

Etienne.Uncensored.Heretic.127B.gguf tweet media

English

4.8K

NVIDIA AI@NVIDIAAI·1d

@dgarieck 🙌

QME

NVIDIA AI@NVIDIAAI·1d

@MattNiessner Face2Face was a legendary paper! You forgot to include that it got you in the NVIDIA booth at GTC 😉

English

4.3K

Matthias Niessner@MattNiessner·2d

Always a pleasure to be back at Stanford! 🌲 It's been almost a decade since my four years here as a Visiting Professor. I was fortunate to work with so many world-class researchers who helped shape my career. Many of those same colleagues are now leading frontier labs all over the world. Research-wise, it was a wild time. Deep learning was just starting to take over traditional computer vision, and generative methods were barely working (remember the early VAEs and GANs?). We were one of the groups pushing early 3D deep learning. It was exciting, but chaotic. There was no PyTorch, and even simple operators had to be hand-written in CUDA. Lots of fun! Our main focus was 3D scene reconstruction and semantic scene understanding. For scanning, we used Microsoft Kinects with methods like Voxel Hashing or BundleFusion, which led to scene understanding works like ScanNet and Matterport3D. And, of course, 3D face reconstruction—which led to the legendary Face2Face paper by @JustusThies, got me on Jimmy Kimmel Live, and ultimately sparked the foundation of @synthesiaIO. Coming back brings up so much nostalgia for those days before massive transformers took over. But beyond the research, a lot has evolved. On campus, new dorms have transformed the landscape, there's a shiny new data science building, and an incredible lineup of new CS faculty. Palo Alto has changed, too. There’s a new bikeway on El Camino, and Cal Ave is now a pedestrian zone with many shops having changed. (Un)surprisingly, the Nuthouse didn't survive. The building sits empty like one of its many peanut shells. Still, it's a vibrant area with fantastic food, great to hang out after an intense day of research. Die Luft der Freiheit weht!

English

206

15.8K

NVIDIA AI@NVIDIAAI·1d

@DataChaz Thanks for sharing Charly - super cool work 🙌

English

146

Charly Wargnier@DataChaz·3d

NVIDIA JUST DROPPED AN OPEN-SOURCE WORLD MODEL THAT RUNS ON YOUR GAMING PC 🤯 Instead of melting your GPU, @NVIDIAAI’s new 2.6B-parameter model, SANA-WM, brings 60 seconds of controllable video straight to your laptop. > Just feed it an image > a prompt > a camera path .. and you can literally fly through the scene 🔥 It works on less than 8GB of VRAM and plugs right into tools you already use like ComfyUI and Diffusers. The core advantages: → full 6-axis camera control → 36x faster than older open models → uses extreme 32x compression to stay tiny → upscale to 2K using the LTX2 refiner Best part? It's 100% free and open-source (Apache-2.0 licensed) repo link in 🧵↓

English

7.8K

NVIDIA AI@NVIDIAAI·1d

@AiXsatoshi 💚

QME

2.2K

AI✖️Satoshi⏩️@AiXsatoshi·1d

1枚600WのGPUを3枚装備した

日本語

161

10.3K

NVIDIA AI@NVIDIAAI·2d

@firstadopter 💚

QME

1.9K

tae kim@firstadopter·3d

Imagine graduating college with a comp sci degree, announcing on LinkedIn that you got a job at Nvidia and having Dwight Diercks!?! leave a congratulations comment on your post.

English

2.1K

236K

NVIDIA AI@NVIDIAAI·2d

@GregEstes @twominutepapers @liu_mingyu 👀

QME

Greg Estes@GregEstes·3d

@twominutepapers @liu_mingyu Two of my favorite people. Lots of brains and general awesomeness in one photo.

English

103

Two Minute Papers@twominutepapers·3d

Spent a bit of time learning from my friend and legendary researcher @liu_mingyu. Huge honor, thank you! Btw, he is hiring - more info below.

English

3.6K

NVIDIA AI@NVIDIAAI·3d

@LyalinDotCom @NVIDIAAIDev 🙌

QME

1.9K

Dmitry Lyalin@LyalinDotCom·3d

Setting up a DGX Spark! Unboxing time. Setup is stupid easy. Plugin. Connect to its WiFi hotspot and use web UI to configure account, connect it to your home network and it updates the device for you. ❤️🔥

English

103

8.6K

NVIDIA AI@NVIDIAAI·3d

@TheAhmadOsman ⏳️

QME

2.7K

Ahmad@TheAhmadOsman·3d

@NVIDIAAI When 👀👀👀

English

2.7K

Ahmad@TheAhmadOsman·4 Nis

Models that I can not wait to run on my AI cluster at home in 2026 > MiniMax M3 (Multimodal) > NVIDIA Nemotron 3 Ultra (~500B) > Kimi K3 > DeepSeek V4 This is going to be the year of Local LLMs

English

534

19.2K

Keşfet

@MichaelGannotti @Shaughnessy119 @tmophoto @aijoey @nvidia @NVIDIAAIDev @juliarturc @IntelFactorAI