Max Sapo

803 posts

Max Sapo

@MaxHappyverse

Co-Founder & CEO @ Happyverse AI. Ex-Google TPU team & Etched. Stanford MBA/MS grad. Georgia Tech drop out. Dad.

San Francisco, CA Katılım Ocak 2023

765 Takip Edilen533 Takipçiler

Sabitlenmiş Tweet

Max Sapo@MaxHappyverse·12 Kas

We launched Happyverse 2.0 - a platform for building lifelike, real‑time Confidants (AI avatars + agents) that hold genuine conversations, stay tied to real data, and deliver measurable results for individuals and businesses. producthunt.com/products/happy… Please vote for us on Producthunt and share any feedback you might have! Also, huge thanks to our business partners for your support - @googlecloud (amazing Gemini team in particular - @DynamicWebPaige @triswarkentin @OfficialLoganK and others!), @pipecat_ai (@kwindla and others!), @elevenlabs (congrats with your own launch & summit!) and many others! P.S. Here is a Google Meet conversation with my real-time app.happyverse.ai/CEO Confidant and our CTO, Nicholas 🙂

San Francisco, CA 🇺🇸 English

30.8K

Max Sapo retweetledi

FlyMy.AI@FlyMy_AI·4 May

🚀Today we ship @FlyMy_AI Agents. The world's first all-in-one agentic cloud. The modern way to build, integrate, and scale production AI agents. 3 steps to a production agent: 1. Connect your work tools to FlyMy 2. Describe what the agent should do - in text or 5 lines of code 3. Set execution rules: manual, scheduled, or integrated into your backend 4. Done! Agent works on scale! Everything in one place: 800+ MCPs, hundreds of AI models, brain, memory, sandboxes. Stop building from scratch. Stop waiting for infra. Compress 6 months into a day. #1 on @ArtificialAnlys benchmarks. Stable, secure, scalable from day one. Try FlyMy.AI →

English

5.2K

Max Sapo@MaxHappyverse·24 Nis

Amin Vahdat is one of the most underrated leaders of our generation. His ability to zoom in / zoom out is just second-to-none.

Shanu Mathew@ShanuMathew93

Amin Vahdat live at Transition-AI 2026 with @CatalystPod. Google's Chief Technologist for AI Infrastructure - the man in charge of Google's $175–185B 2026 CapEx and the one who's said compute capacity needs to 2x every 6 months! Key takeaways: Training → Inference cascade: -Frontier GW training clusters have a ~1–2 year useful life; then capacity cycles to serving -Inference doesn't need GW scale — <100 MW can do useful work on large models -"Entering the age of inference" is now real as agents explode DC footprint -Speed-of-light latency starts to matter as models get faster — geography becomes UX + reliability -"a medium number of medium-sized data centers, augmented with a small number of large ones" (few large training clusters, inference can be mix of 10s to 100s of MW) Reliability reframe -4-nines isn't intrinsic: "we should be thinking about lower reliability power delivery overall" -Do you make the this trade: 4-nines at half capacity, or 2-nines (3.65 days downtime/yr) at 2x capacity? Customers "very often" pick 2x Behind the meter -Google actually prefers grid-connected capacity = BTM is a bridge, not a destination -BTM is about a different latency: time-to-delivery of capacity -Bridge sources: turbines, gas, mobile generation. Permanent: solar, wind, nuclear -Stranded BTM? "I'd love to have that problem." Energy is the limiter Co-design = capability -Google co-designs across Gemini ↔ software ↔ TPU ↔ rack ↔ DC ↔ power ↔ building -A few percent at each interface compounds into real advantage Bottlenecks -Won't force-rank chips/power/labor/EPC: "10am it's labor, noon it's power, 2pm it's chips — every day" -YoY efficiency is real: this year's capacity would've cost ~1.2x last year Building fungibility is dead, purpose-build is back -25-yr buildings vs. 5–6 chip generations. Disk rack vs. GPU rack = ~100x watts/sq ft and widening -Old world: build for fungibility (compute wasn't dominant cost). New world: "this is a GPU building, that's a TPU building" -Density wasn't maxed before because flexibility was worth more. That's flipped One of the most important pods of the year from an AI leader at one of the most important companies at the center of it all! Great job @shaylekann

English

162

Max Sapo@MaxHappyverse·24 Nis

@sundeep 1M+ TPU 8t though, slightly more than 960K A5Xs :)

English

sunny madra@sundeep·23 Nis

You want scale: 960,000 NVIDIA Rubin GPUs in a multisite cluster “At Google Cloud Next, Google announced A5X powered by NVIDIA Vera Rubin NVL72 rack-scale systems, which — through extreme codesign across chips, systems and software — deliver up to 10x lower inference cost per token and 10x higher token throughput per megawatt than the prior generation. A5X will use NVIDIA ConnectX-9 SuperNICs, combined with next-generation Google Virgo networking, scaling to up to 80,000 NVIDIA Rubin GPUs within a single site cluster and up to 960,000 NVIDIA Rubin GPUs in a multisite cluster, enabling customers to run their largest AI workloads on NVIDIA‑optimized infrastructure” blogs.nvidia.com/blog/google-cl…

English

108

8.7K

Max Sapo@MaxHappyverse·24 Nis

@gdb Cool, just sent an email :)

English

Greg Brockman@gdb·23 Nis

we're rolling codex out to whole companies/enterprises. ping me gdb@openai.com if of interest!

Sam Altman@sama

We tried a new thing with NVIDIA to roll out Codex across a whole company and it was awesome to see it work. Let us know if you'd like to do it at your company!

English

1.1K

128.2K

Max Sapo@MaxHappyverse·24 Nis

@sama Is Codex on Cerebras and your own ASICs coming soon as well?

English

785

Sam Altman@sama·23 Nis

We tried a new thing with NVIDIA to roll out Codex across a whole company and it was awesome to see it work. Let us know if you'd like to do it at your company!

English

487

421

8.2K

Max Sapo@MaxHappyverse·24 Nis

@JeffDean @AcquiredFM @gilbert @djrosent It was such a great session @JeffDean - you and Amin are an amazing duo - thanks for doing so many great things for the world! excited to try out TPU v8 when it becomes available!

English

455

Jeff Dean@JeffDean·23 Nis

I had a good time discussing yesterday's Google TPU v8t and v8i announcement at Cloud Next with Amin Vahdat along with @AcquiredFM hosts @gilbert and @djrosent. The blog post announcement has lots of details about these new chips: blog.google/innovation-and… Here's a thread of some particular things I'm excited about:

English

519

158.2K

Max Sapo@MaxHappyverse·23 Nis

@DynamicWebPaige @GoogleDeepMind @GoogleStartups @EnchantedTools @GoogleAIStudio @livekit We like PipeCat a little more :)

English

👩‍💻 Paige Bailey@DynamicWebPaige·23 Nis

The expo floor at #googlecloudnext is crazy right now, make sure to catch all of the @googledeepmind and @GoogleStartups booths! Shown below: robotics (with @EnchantedTools running Gemini Live from @GoogleAIStudio), TPUs, and our @livekit partners.

Jaana Dogan ヤナドガン@rakyll

If you are at Google Cloud Next, come and join us at the Gemini Playspace to talk about Gemini models.

Paradise, NV 🇺🇸 English

Max Sapo@MaxHappyverse·23 Nis

Wow, even @elonmusk acknowledges this! #GoTPUs!

Elon Musk@elonmusk

@sundarpichai TPUs are underrated

English

Max Sapo@MaxHappyverse·23 Nis

cloud.google.com/blog/products/…

ZXX

Max Sapo@MaxHappyverse·23 Nis

Met lots of good old friends at @googlecloud #Next2026 and even took a selfie with #TPUv4, my first product launch that I led back in 2022. Can’t believe it’s been 4 years ago - kids grow fast :) Link to a blog in comments below (for those curious to compare what was state-of-the-art then vs now. 🙂)

English

Max Sapo@MaxHappyverse·23 Nis

@Jason I’m in

English

@jason@Jason·23 Nis

We started an AI founder twitter group... reply with "I'm in" if you're a founder and want to be added

English

10.8K

135

4.6K

903.4K

Max Sapo@MaxHappyverse·15 Mar

Met @que_tourist earlier this week. My best celebrity selfie of the year so far 🤗🤙

English

Max Sapo@MaxHappyverse·27 Şub

@HappyverseAI webinar on 2/26! Tomorrow we are teaching a diverse group of founders, knowledge workers and creators how to build and deploy their own Digital Employee — a real-time AI video agent that can sell, screen, coach, and represent you 24/7. Day 5 of our AI Bootcamp covers: → The architecture behind sub-500ms real-time video agents → Writing system prompts that actually govern behavior → Connecting your agent to calendars, Google Meet/Zoom and agentic tools → Security, guardrails, and going from demo to business asset You leave with a working, deployable digital employee — not a concept. Joining me are my colleagues and advisers - Nora Salim and Mariana Tataryn. Last chance to grab a seat: luma.com/s4uydmv4 Use code HAPPY for a discount. See you there 🤝

San Francisco, CA 🇺🇸 English

Max Sapo@MaxHappyverse·13 Şub

@dylan522p Go with Inference Dylan

San Francisco, CA 🇺🇸 English

129

Dylan Patel@dylan522p·13 Şub

$1,000 for whoever comes up with the best name replacement for InferenceMAX InferenceMAX 2.0 dropping soon but we have to rename it because HBO MAX sent us a cease and desist. We have all NVIDIA GPUs from h100 to GB300 on large MoEs with SOTA optimizations like Disagg PD tested

English

360

295

59.4K

Max Sapo@MaxHappyverse·13 Şub

@dylan522p Haha, how can I send "cease and decist" to HBO Max lol

San Francisco, CA 🇺🇸 English

189

Max Sapo@MaxHappyverse·11 Oca

@venturetwins Wait, there are two Yang Mun monks - one is "itsyangmuns" with ~960K and another one "yangmunus" with ~2.5M - and both are AI avatars it looks like? Wondering if it's the same creator or one is a copycat of the other

San Francisco, CA 🇺🇸 English

440

Justine Moore@venturetwins·10 Oca

Stumbled upon an insanely popular monk on Instagram named Yang Mun. He started posting videos in October and is now about to cross 1M followers - people love his “ancient Chinese wisdom.” Almost no one seems to realize that he’s entirely AI-generated 🤯

English

535

215.2K

Max Sapo@MaxHappyverse·11 Oca

Actually, you don't need that many tools - you can do everything with @HappyverseAI or our peers like HeyGen or Tavus. In addition, we also support real-time conversations for custom avatars, and we have our own Happyverse.ai/monk 🙂 (which just haven't pushed it through Insta yet cause they don't support real-time video comms unfortunately)

San Francisco, CA 🇺🇸 English

320

Justine Moore@venturetwins·10 Oca

If I had to guess - this creator generates the scripts with ChatGPT, makes the image with NB Pro, does the voice on Eleven, and then lip syncs with Hedra/Veed/OmniHuman. It’s wild that you can use this pipeline to make a fully faceless, automated character! Here’s the page ⬇️