Oleksii Kuchaiev

904 posts

Oleksii Kuchaiev

@kuchaev

Sr. Director of AI model post-training @NVIDIA

in the cloud Katılım Şubat 2010

1.2K Takip Edilen3.8K Takipçiler

Sabitlenmiş Tweet

Oleksii Kuchaiev@kuchaev·11 Mar

1/4 We see no wall in post-training. Scaling RL software, infra, and data keeps yielding major capability gains. We trained across 30 RL environments with up to 4,000 instances per batch — math, code, STEM, agentic tool use, SWE, terminal, safety — all in a unified multi-environment RLVR setup.

English

289

78.2K

Oleksii Kuchaiev retweetledi

NVIDIA AI@NVIDIAAI·22h

@xeophon @arcee_ai Open > closed

English

137

1.4K

93.6K

Oleksii Kuchaiev retweetledi

Kate from Kharkiv@BohuslavskaKate·1d

Classmates and friends lay flowers at a makeshift memorial outside the collapsed apartment building. A Russian attack killed 24 people here, including 3 children.

Kate from Kharkiv@BohuslavskaKate

In Kyiv, the search-and-rescue operation at the site of the Russian strike on a residential building has been completed. The attack killed 24 people, including 3 children‼️ 24! Bastards. Bloody terrorists.

English

300

5.1K

Oleksii Kuchaiev retweetledi

Vincent Weisser@vincentweisser·3d

We are open sourcing renderers For RL, the inference server should be simple Tokens in, tokens out renderers is the token-level chat templating layer to >render messages to tokens >parse completions to structure >bridge rollouts byte-for-byte > >3x throughput on openmodels

Prime Intellect@PrimeIntellect

Introducing Renderers RL trainers work in tokens. Environments work in messages. Going back and forth corrupts sampled tokens, wasting compute on every agentic turn. With Renderers, we fix this mismatch. This unlocks >3x throughput on popular open models.

English

120

10K

Oleksii Kuchaiev@kuchaev·7 May

This is what modern russia is

BBC News (World)@BBCWorld

Russia ignores Ukraine's unilateral ceasefire and attacks kindergarten bbc.in/4tVsXiw

English

375

Oleksii Kuchaiev retweetledi

Prime Intellect@PrimeIntellect·7 May

Lab is launching with self-serve support for models from Nvidia, OpenAI, Meta, Qwen, with more coming soon. Models range from 1B to 400B parameters covering both dense and MoE architectures, reasoning and non-reasoning modes, and text and image modalities.

English

157

33K

Oleksii Kuchaiev@kuchaev·3 May

@Scobleizer Build in AI Applications layer, on top of open models.

English

208

Robert Scoble@Scobleizer·3 May

Open Source's big problem. Last night I went to a Y Combinator party in San Francisco and met an entrepreneur who is making a top Open Source AI model. He told me it is very hard to make money in open source. Yeah, it is cool being popular, he told me, but figuring out how to make a business out of it is proving to be very difficult. The Chinese are pounding the price into the ground with their open source models. Which makes it tough. In the old world of Open Source you could make money with them by consulting, service, etc, like RedHat did. But in this new world, he told me, it's much harder to make a good business out of it. Is anyone making a good business out of open source? What would your advice be to the businesses that are trying to support Open Source?

English

211

539

101.8K

Oleksii Kuchaiev retweetledi

Alex Ziskind@digitalix·28 Nis

New model dropped today. This is a fast one!

LM Studio@lmstudio

Nemotron 3 Nano Omni is now in LM Studio! A new 30B multi-modal MoE from @nvidia Supports Image input, reasoning, and tool use Requires ~25GB to run locally 🔥🚀 lmstudio.ai/models/nemotro…

English

135

20.2K

Oleksii Kuchaiev retweetledi

NVIDIA AI@NVIDIAAI·28 Nis

Meet Nemotron 3 Nano Omni 👋 Our latest addition to the Nemotron family is the highest efficiency, open multimodal model with leading accuracy. 30B parameters. 256K context length. 🧵👇

English

188

1.3K

447.7K

Oleksii Kuchaiev@kuchaev·28 Nis

HF Article huggingface.co/blog/nvidia/ne… Tech Report research.nvidia.com/labs/nemotron/…

195

Oleksii Kuchaiev@kuchaev·28 Nis

Nemotron 3 Nano Omni model is now available. Text, Image, Video and Audio as input with Text as output. As all models from Nemotron 3 family, fully open (weight, base models, data and code) and extremely efficient.

English

862

Oleksii Kuchaiev retweetledi

Bryan Catanzaro@ctnzr·28 Nis

Today we're releasing Nemotron 3 Nano Omni. Audio, Video, Image, Text ➡️ Text Ask questions about all your data. Amazing efficiency powered by the Nemotron Hybrid SSM MoE architecture. State of the art multimodal intelligence.

English

352

25.8K

Oleksii Kuchaiev retweetledi

Governor Newsom Press Office@GovPressOffice·27 Nis

Trump is setting America back — not putting America First, as he promised. We will lose the future because of his reckless, harmful actions.

News from Science@NewsfromScience

U.S. President Donald Trump has fired all 24 members of the National Science Board, the body that oversees the National Science Foundation. Many science advocates see it as the latest step by his administration to erode—some would say destroy—the independence of the 76-year-old research agency. scim.ag/4eGM0YS

English

436

1.3K

5.9K

122.5K

Oleksii Kuchaiev@kuchaev·24 Nis

@WSJ I hope Apple gets their AI strategy together and creates cohesive AI agent a experience across their device ecosystem. A much better experience is possible than talking to your AI agent running on Mac via Telegram messenger on iPhone.

English

121

The Wall Street Journal@WSJ·23 Nis

Apple’s new CEO John Ternus is a hardware expert who must help the company catch up in the AI race as it looks for its next big hit. 🔗 on.wsj.com/4tvNf1Q

English

422

45K

Oleksii Kuchaiev@kuchaev·24 Nis

@tedlieu You are correct. There is nothing "conscious" about current AI. And there is no universaly agreed definition of consciousness.

English

Ted Lieu@tedlieu·24 Nis

My take: linear algebra equations will never be conscious. Random number generators will never be conscious. At a very basic level, AI is math. AI can act like it is conscious, but it will never be conscious. And adding more math to AI models doesn’t make it any more conscious.

Antonio Lupetti@antoniolupetti

AI and Consciousness. There’s a lot of debate around AI and whether consciousness could emerge from systems like LLMs. It’s a natural question, given how well these models simulate language and reasoning. This Google paper challenges the idea that consciousness could arise from computation alone. The key point is that computation is a description, a map we assign to physical states, not something that exists intrinsically in matter, and a map (no matter how precise) is never the territory in any real sense. So increasing complexity isn’t enough to generate consciousness. We may get more and more convincing simulations, but that doesn’t imply the emergence of actual conscious experience. deepmind.google/research/publi…

English

331

525

74.9K

Oleksii Kuchaiev@kuchaev·24 Nis

@polynoamial No one needs to die on that hill because this is obviously true.

English

515

Noam Brown@polynoamial·23 Nis

A hill that I will die on: with today's AI models, intelligence is a function of inference compute. Comparing models by a single number hasn't made sense since 2024. What matters is intelligence per token or per $. This is especially true when using it in a product like Codex.

Lisan al Gaib@scaling01

The GPT-5.5 model family completely dominates the cost-performance frontier on the Artificial Analysis Index

English

1.3K

126.1K

Oleksii Kuchaiev retweetledi

will brown@willccbb·11 Nis

gonna be chatting w @danielhanchen + the @nvidia nemotron team next week + talking RL come hang out :)

NVIDIA AI Developer@NVIDIAAIDev

✨ New livestream: How to implement reinforcement learning effectively? @PrimeIntellect & @UnslothAI join us for a deep-dive into RL with Nemotron — covering GRPO, RLVR, and building memory-efficient RL pipelines for domain-specific training. 🗓️ Apr 14 | 11am PT → nvda.ws/3Q1GWEh Bring your questions for the live Q&A, and share in comments.

English

153

13.4K

Oleksii Kuchaiev@kuchaev·7 Nis

@tedlieu @thejointstaff Thank you, @tedlieu, for speaking up

English

215

Ted Lieu@tedlieu·7 Nis

Dear @thejointstaff: The UCMJ and federal law prohibit war crimes. Obviously eradicating a whole civilization constitutes a war crime. You must disobey that order. If you commit war crimes, the next Administration will prosecute you.

The New York Times@nytimes

Breaking News: President Trump warned that a “whole civilization will die tonight” if Iran does not meet his deadline to open the Strait of Hormuz. nyti.ms/3OlVQES

English

401

2.3K

6.7K

220K

Oleksii Kuchaiev@kuchaev·4 Nis

@hwchase17 Continual learning is a model-harness co-design problem.

English

141

Harrison Chase@hwchase17·4 Nis

most people thinking of continual learning as happening at the model level but with agents - there's actually three different levels you could "learn" at: - model - harness - context

Harrison Chase@hwchase17

x.com/i/article/2040…

English

475

58.5K

Oleksii Kuchaiev retweetledi

Chris 🇨🇦@llm_wizard·4 Nis

While the whole timeline laments about Claude sub being reserved for Claude things - let me remind you that Nemotron 3 Super is a fine model and you can do whatever you'd like with it because you own the weights! And if you want a guide on how to hook it into OpenClaw (plus safety bells and whistles) check out NemoClaw! github.com/NVIDIA/NemoClaw

English

3.5K

Oleksii Kuchaiev retweetledi

NASA@NASA·2 Nis

Liftoff. The Artemis II mission launched from @NASAKennedy at 6:35pm ET (2235 UTC), propelling four astronauts on a journey around the Moon. Artemis II will pave the way for future Moon landings, as well as the next giant leap — astronauts on Mars.

English

3.8K

55.5K

178.7K

14.2M

Keşfet

@xeophon @arcee_ai @Scobleizer @WSJ @tedlieu @polynoamial @danielhanchen @nvidia