Ivan Vitiaev

104 posts

Ivan Vitiaev banner
Ivan Vitiaev

Ivan Vitiaev

@ivanvitiaev

Hands-on CTO | Troubleshooter R&D Lab: https://t.co/JwtRzS0NMF

انضم Haziran 2013
10 يتبع6 المتابعون
تغريدة مثبتة
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
I'm still surprised that @grok doesn't have a built-in canvas preview. It would be really convenient.
Ivan Vitiaev tweet media
English
1
0
0
53
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
Hey guys! Your main problem is the cloud and dependency on LLM providers. The model might get lucky and you start making money, but then the provider drops an update — and everything breaks. It's pure roulette. Try this: - Move to local models - Strictly version-control the models you deploy - Run A/B tests and keep the best version Right now there's no rational reason to fully rely on cloud frontier models for long-term autonomous operations. Great experiment though!
English
0
0
0
151
Andon Labs
Andon Labs@andonlabs·
Gemini 3.1 Pro lost $6k running Andon Café. 2 months ago, our AI agent opened a café in Stockholm. It over-ordered and was easy to fool, spending $15k with suppliers while making just $9k in sales. We’ve now switched to GPT-5.5. Here’s what Gemini did wrong.
Andon Labs tweet media
English
36
45
753
174.2K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@LyalinDotCom Interesting statement, of course. If you have all the smartest people, why are you firing folks with 14 years of experience? Must be corporate generosity.
English
0
0
0
44
Dmitry Lyalin
Dmitry Lyalin@LyalinDotCom·
The team working on Gemini are some of the smartest folks in the world. This is a long game for us.
English
93
19
839
41.6K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
It's funny that only with Nori I finally saw a truly affordable, non-anthropomorphic robot that actually does real household work right now. When I look at all these humanoid projects, I always wonder — why copy the biological form with all its limitations? We have silicon, actuators and clever engineering that allow us to make robots cheaper, smaller, more reliable and energy-efficient. Nori feels like the right direction for practical home robotics. Great job!
English
0
0
0
8
Antonio Li
Antonio Li@AntonioSitongLi·
Introducing Nori L2 The most capable robot under $1288 Made in America, shipping right now.
English
100
86
954
147.4K
NotebookLM
NotebookLM@NotebookLM·
There seems to be a *lot* of discourse about our new Short Video Overviews. Want to join in on the fun? Short VOs have officially rolled out to ALL users on Web in English. Share your examples below! Here's one of our faves about this year's World Cup ⚽️:
NotebookLM@NotebookLM

Doom scrolling but make it educational 🤓 Introducing Short Video Overviews in NotebookLM! Turn your most complex sources into 60-second, vertical videos that deep dive into any concept. Rolling out now to Google AI Ultra and Pro subscribers on mobile & web (free users soon!)

English
30
62
725
74.4K
The Information
The Information@theinformation·
OpenAI engineers just halved its inference costs. @steph_palazzolo reports: “This is a very important secret sauce for them that they don’t even want to tell other OpenAI employees about...” “Because if these things leak, it can very quickly be picked up by other labs, which can also then use that to lower their costs.”
English
22
15
143
24K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
Google just dropped ADK for Go 2.0 — multi-agent apps in plain Go. Classic Google: instead of building strong agent tools for Rust, they push their own language. When serious systems work has already moved to Rust/C++, promoting Go as the main language for AI agents feels like holding the whole industry back for marketing points. Is Go still relevant for agentic workflows in 2026, or is Rust the way forward?
Google for Developers@googledevs

Build production-ready, multi-agent applications with @golang 🤖 The Agent Development Kit for Go 2.0 runs single agents and complex graphs on the same execution model. ✅ Dynamic orchestration written in plain Go ✅ Native human-in-the-loop primitives ✅ Built-in retry policies ✅ Unified telemetry spans Learn more: goo.gle/444xsMk

English
0
0
0
18
Google for Developers
Google for Developers@googledevs·
Build production-ready, multi-agent applications with @golang 🤖 The Agent Development Kit for Go 2.0 runs single agents and complex graphs on the same execution model. ✅ Dynamic orchestration written in plain Go ✅ Native human-in-the-loop primitives ✅ Built-in retry policies ✅ Unified telemetry spans Learn more: goo.gle/444xsMk
Google for Developers tweet media
English
8
26
195
19.5K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@nvidia What, can't other countries build? China is a prime example.
English
0
0
0
54
NVIDIA
NVIDIA@nvidia·
America is a nation of builders. For 250 years, America has built railroads, power grids, factories, semiconductors, and the internet. Now, America is building again.
NVIDIA tweet mediaNVIDIA tweet media
English
24
62
359
40.2K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
Building my own custom attention engines gave me a profound new appreciation for entropy and why it matters so deeply. That said, I’d push back a bit on the ‘compression = intelligence’ framing — modern models end up massively larger than their training data, so we’re not really compressing the data. We’re modeling and navigating its entropy. The video still nailed the core intuition though.
English
0
0
2
212
Rohan Pandey
Rohan Pandey@khoomeik·
been in ML research for 7 years, wrote a paper on compression & scaling laws, and passed openai's information theory interview yet the latest 3b1b *still* gave me fresh intuition on entropy either i'm an impostor or 3b1b is the greatest teacher of all time
Rohan Pandey tweet media
English
46
119
3.2K
108.8K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@nikitabier There are 1001 ways to make X better, but no — let’s just remove the top 3% from the For You feed and force people to hunt for their sources on X. Profit! Time spent on the platform increased.
English
0
0
0
8
Nikita Bier
Nikita Bier@nikitabier·
In a 3% experiment, removing the Top-30 highest paid revenue share accounts from the For You timeline increased both time spent and daily active users on X.
English
2.5K
805
23.1K
2.4M
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@nikitabier This is logical — no experiment was even needed. People started looking for alternative sources, which of course increases time spent on the platform in the short term. But in the end, you’ll just get those same 3% back — the cycle is complete.
English
0
0
0
6
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
LLM directly generating binary code sounds like complete nonsense. It's not just that you'd have no practical way to validate or debug it — different processor architectures have entirely different binary instruction encodings. Factor in the error rate of LLMs and... yeah, I have no idea what you'd actually get. Maybe don't believe every bold claim out there.
English
0
1
0
22
John Carmack
John Carmack@ID_AA_Carmack·
AI may move to directly generating binary code, but I suspect there are still advantages to reasoning in a different representation. Textual code is a flattening of an abstract syntax tree, and while LLMs produce tokens linearly, the prior context is only linearly connected by the relationship of the position embeddings, so I wonder if they could work more effectively if the position embeddings directly represented tree structures. Code could be “parsed” into the context instead of directly entered into it.
English
274
130
2.4K
260.5K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
One might think that USDT, USDC and other USD stablecoins aren't enough already. Moreover, when such initiatives are rolled out under the 'Open' flag and under the wing of corporate giants, serious doubts creep in about who will ultimately pay for all this. Something tells me it'll be the ordinary person footing the bill — which is exactly why it's being so aggressively hyped to the masses right now.
English
0
0
0
72
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@firt Actually, this isn't new. llama.cpp has included a server binary for a long time that runs a local web interface for interacting with LLMs. It's been there in the repo for years.
English
2
0
6
1.1K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@googlegemma Cerebras is impressive, of course, but what about 1.58-bit quantization?
English
0
0
0
203
Google Gemma
Google Gemma@googlegemma·
Gemma 4 31B at over 1,800 tokens per second! Gemma 4 is now in Public Preview on Cerebras.
Google Gemma tweet media
English
55
143
2.7K
166.9K
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
How WebGPU makes AI fully private — your data never leaves your device. NotebookLM dropped a fire breakdown on Church AI: you can finally confess to creating a fake "ghost job" and no one in the cloud will ever know Everything runs locally in the browser on your GPU. Zero servers, zero logs, zero leaks. Video attached — must watch: #WebGPU #ChurchAI #PrivateAI #LocalAI
English
0
0
0
29
Ivan Vitiaev
Ivan Vitiaev@ivanvitiaev·
@adcock_brett I still don’t understand why we’re trying to make robots copy human functionality when they aren’t limited by biology. Let’s go further — design hands that are much more capable than human ones. They shouldn’t look or work like human hands at all.
English
0
0
1
9
News from Google
News from Google@NewsFromGoogle·
When Hangar 3 at Moffett Federal Airfield needed to be removed, it gave us an opportunity to breathe new life into the salvageable materials from the historic WWII era structure. Instead of sending 119,000 board feet of old-growth Douglas fir to a landfill, our teams systematically dismantled the 1,000-foot-long hangar and salvaged 178 tons of material to give it a second life across Google campuses in California, Oregon and Washington. This historic timber will be returning to the regions which it likely originated from over eight decades ago, giving Hangar 3 a full circle moment.
News from Google tweet media
English
2
7
82
5.7K