Ivan Vitiaev

104 posts

Ivan Vitiaev

@ivanvitiaev

Hands-on CTO | Troubleshooter R&D Lab: https://t.co/JwtRzS0NMF

انضم Haziran 2013

10 يتبع6 المتابعون

تغريدة مثبتة

Ivan Vitiaev@ivanvitiaev·2d

I'm still surprised that @grok doesn't have a built-in canvas preview. It would be really convenient.

English

Ivan Vitiaev@ivanvitiaev·10h

Hey guys! Your main problem is the cloud and dependency on LLM providers. The model might get lucky and you start making money, but then the provider drops an update — and everything breaks. It's pure roulette. Try this: - Move to local models - Strictly version-control the models you deploy - Run A/B tests and keep the best version Right now there's no rational reason to fully rely on cloud frontier models for long-term autonomous operations. Great experiment though!

English

151

Andon Labs@andonlabs·13h

Gemini 3.1 Pro lost $6k running Andon Café. 2 months ago, our AI agent opened a café in Stockholm. It over-ordered and was easy to fool, spending $15k with suppliers while making just $9k in sales. We’ve now switched to GPT-5.5. Here’s what Gemini did wrong.

English

753

174.2K

Ivan Vitiaev@ivanvitiaev·11h

@LyalinDotCom Interesting statement, of course. If you have all the smartest people, why are you firing folks with 14 years of experience? Must be corporate generosity.

English

Dmitry Lyalin@LyalinDotCom·14h

The team working on Gemini are some of the smartest folks in the world. This is a long game for us.

English

839

41.6K

Ivan Vitiaev@ivanvitiaev·11h

It's funny that only with Nori I finally saw a truly affordable, non-anthropomorphic robot that actually does real household work right now. When I look at all these humanoid projects, I always wonder — why copy the biological form with all its limitations? We have silicon, actuators and clever engineering that allow us to make robots cheaper, smaller, more reliable and energy-efficient. Nori feels like the right direction for practical home robotics. Great job!

English

Antonio Li@AntonioSitongLi·1d

Introducing Nori L2 The most capable robot under $1288 Made in America, shipping right now.

English

100

954

147.4K

Ivan Vitiaev@ivanvitiaev·12h

@NotebookLM And another one x.com/ivanvitiaev/st…

Ivan Vitiaev@ivanvitiaev

How WebGPU makes AI fully private — your data never leaves your device. NotebookLM dropped a fire breakdown on Church AI: you can finally confess to creating a fake "ghost job" and no one in the cloud will ever know Everything runs locally in the browser on your GPU. Zero servers, zero logs, zero leaks. Video attached — must watch: #WebGPU #ChurchAI #PrivateAI #LocalAI

English

Ivan Vitiaev@ivanvitiaev·12h

@NotebookLM Here's my video! youtube.com/shorts/PTzHmgq…

YouTube

English

NotebookLM@NotebookLM·1d

There seems to be a *lot* of discourse about our new Short Video Overviews. Want to join in on the fun? Short VOs have officially rolled out to ALL users on Web in English. Share your examples below! Here's one of our faves about this year's World Cup ⚽️:

NotebookLM@NotebookLM

Doom scrolling but make it educational 🤓 Introducing Short Video Overviews in NotebookLM! Turn your most complex sources into 60-second, vertical videos that deep dive into any concept. Rolling out now to Google AI Ultra and Pro subscribers on mobile & web (free users soon!)

English

725

74.4K

Ivan Vitiaev@ivanvitiaev·12h

@theinformation @steph_palazzolo Secret sauce - Made in China 🤫

English

The Information@theinformation·1d

OpenAI engineers just halved its inference costs. @steph_palazzolo reports: “This is a very important secret sauce for them that they don’t even want to tell other OpenAI employees about...” “Because if these things leak, it can very quickly be picked up by other labs, which can also then use that to lower their costs.”

English

143

24K

Ivan Vitiaev@ivanvitiaev·15h

Google just dropped ADK for Go 2.0 — multi-agent apps in plain Go. Classic Google: instead of building strong agent tools for Rust, they push their own language. When serious systems work has already moved to Rust/C++, promoting Go as the main language for AI agents feels like holding the whole industry back for marketing points. Is Go still relevant for agentic workflows in 2026, or is Rust the way forward?

Google for Developers@googledevs

Build production-ready, multi-agent applications with @golang 🤖 The Agent Development Kit for Go 2.0 runs single agents and complex graphs on the same execution model. ✅ Dynamic orchestration written in plain Go ✅ Native human-in-the-loop primitives ✅ Built-in retry policies ✅ Unified telemetry spans Learn more: goo.gle/444xsMk

English

Ivan Vitiaev@ivanvitiaev·15h

@googledevs @golang Why do we need Golang? We should just use Rust right away.

English

115

Google for Developers@googledevs·1d

English

195

19.5K

Ivan Vitiaev@ivanvitiaev·16h

@nvidia What, can't other countries build? China is a prime example.

English

NVIDIA@nvidia·16h

America is a nation of builders. For 250 years, America has built railroads, power grids, factories, semiconductors, and the internet. Now, America is building again.

English

359

40.2K

Ivan Vitiaev@ivanvitiaev·22h

Building my own custom attention engines gave me a profound new appreciation for entropy and why it matters so deeply. That said, I’d push back a bit on the ‘compression = intelligence’ framing — modern models end up massively larger than their training data, so we’re not really compressing the data. We’re modeling and navigating its entropy. The video still nailed the core intuition though.

English

212

Rohan Pandey@khoomeik·1d

been in ML research for 7 years, wrote a paper on compression & scaling laws, and passed openai's information theory interview yet the latest 3b1b *still* gave me fresh intuition on entropy either i'm an impostor or 3b1b is the greatest teacher of all time

English

119

3.2K

108.8K

Ivan Vitiaev@ivanvitiaev·23h

@nikitabier There are 1001 ways to make X better, but no — let’s just remove the top 3% from the For You feed and force people to hunt for their sources on X. Profit! Time spent on the platform increased.

English

Nikita Bier@nikitabier·23h

In a 3% experiment, removing the Top-30 highest paid revenue share accounts from the For You timeline increased both time spent and daily active users on X.

English

2.5K

805

23.1K

2.4M

Ivan Vitiaev@ivanvitiaev·23h

@nikitabier This is logical — no experiment was even needed. People started looking for alternative sources, which of course increases time spent on the platform in the short term. But in the end, you’ll just get those same 3% back — the cycle is complete.

English

Ivan Vitiaev@ivanvitiaev·23h

LLM directly generating binary code sounds like complete nonsense. It's not just that you'd have no practical way to validate or debug it — different processor architectures have entirely different binary instruction encodings. Factor in the error rate of LLMs and... yeah, I have no idea what you'd actually get. Maybe don't believe every bold claim out there.

English

John Carmack@ID_AA_Carmack·1d

AI may move to directly generating binary code, but I suspect there are still advantages to reasoning in a different representation. Textual code is a flattening of an abstract syntax tree, and while LLMs produce tokens linearly, the prior context is only linearly connected by the relationship of the position embeddings, so I wonder if they could work more effectively if the position embeddings directly represented tree structures. Code could be “parsed” into the context instead of directly entered into it.

English

274

130

2.4K

260.5K

Ivan Vitiaev@ivanvitiaev·1d

One might think that USDT, USDC and other USD stablecoins aren't enough already. Moreover, when such initiatives are rolled out under the 'Open' flag and under the wing of corporate giants, serious doubts creep in about who will ultimately pay for all this. Something tells me it'll be the ordinary person footing the bill — which is exactly why it's being so aggressively hyped to the masses right now.

English

Matt Huang@matthuang·1d

Very excited to see this new effort from Stripe, Visa, Coinbase, Mastercard, Amex, Blackrock, and many others to build a new open stablecoin that shares economics back to users and distributors. OpenUSD will be natively issued on Tempo on day 1!

Open Standard@openstandard

Introducing Open USD: a stablecoin built for the internet economy, designed by the businesses growing it. joinopenstandard.com/blog/introduci…

English

519

89.1K

Ivan Vitiaev@ivanvitiaev·1d

@firt Actually, this isn't new. llama.cpp has included a server binary for a long time that runs a local web interface for interacting with LLMs. It's been there in the repo for years.

English

1.1K

Maximiliano Firtman@firt·1d

Claude Science opens a web server in your computer and run as a local web app. That's new for AI tools.

Claude@claudeai

Introducing Claude Science, a new app designed with every stage of research in mind. Artifacts traced to their code, environments managed on demand, and 60+ optional scientific databases that you can connect. Available now in beta.

English

462

46.6K

Ivan Vitiaev@ivanvitiaev·1d

@googlegemma Cerebras is impressive, of course, but what about 1.58-bit quantization?

English

203

Google Gemma@googlegemma·1d

Gemma 4 31B at over 1,800 tokens per second! Gemma 4 is now in Public Preview on Cerebras.

English

143

2.7K

166.9K

Ivan Vitiaev@ivanvitiaev·1d

English

Ivan Vitiaev@ivanvitiaev·1d

@AndrewCurran_ OpenAI finally started using Chinese models😂

English

Andrew Curran@AndrewCurran_·1d

OpenAI has found a way to cut inference costs in half.

Stephanie Palazzolo ✈️ ICML@steph_palazzolo

OpenAI engineers earlier this month developed an optimization that cut inference costs in half for models it was applied to. After the optimization was applied to logged-out ChatGPT traffic, it reduced the number of GPUs needed to power that traffic to a couple hundred.

English

1.6K

217.4K

Ivan Vitiaev@ivanvitiaev·1d

@adcock_brett I still don’t understand why we’re trying to make robots copy human functionality when they aren’t limited by biology. Let’s go further — design hands that are much more capable than human ones. They shouldn’t look or work like human hands at all.

English

Brett Adcock@adcock_brett·2d

We’re on generation 7 of our hand design, they’re so hard! Our latest design is so human-like it looks like a person wearing a glove

The Humanoid Hub@TheHumanoidHub

- Figure started from absolute scratch just four years ago today - Rapidly iterated vertically integrated platforms for hardware and AI - This wouldn't happen without a high level of grit and relentless engineering Next four years are gonna be crazy interesting to watch.

English

1.3K

86K

Ivan Vitiaev@ivanvitiaev·1d

@NewsFromGoogle I don’t often praise Google, but this is genuinely impressive.

English

News from Google@NewsFromGoogle·2d

When Hangar 3 at Moffett Federal Airfield needed to be removed, it gave us an opportunity to breathe new life into the salvageable materials from the historic WWII era structure. Instead of sending 119,000 board feet of old-growth Douglas fir to a landfill, our teams systematically dismantled the 1,000-foot-long hangar and salvaged 178 tons of material to give it a second life across Google campuses in California, Oregon and Washington. This historic timber will be returning to the regions which it likely originated from over eight decades ago, giving Hangar 3 a full circle moment.

English

5.7K

اكتشف

@LyalinDotCom @NotebookLM @theinformation @steph_palazzolo @googledevs @golang @nvidia @nikitabier