Max Sapo

803 posts

Max Sapo banner
Max Sapo

Max Sapo

@MaxHappyverse

Co-Founder & CEO @ Happyverse AI. Ex-Google TPU team & Etched. Stanford MBA/MS grad. Georgia Tech drop out. Dad.

San Francisco, CA Katılım Ocak 2023
765 Takip Edilen533 Takipçiler
Sabitlenmiş Tweet
Max Sapo
Max Sapo@MaxHappyverse·
We launched Happyverse 2.0 - a platform for building lifelike, real‑time Confidants (AI avatars + agents) that hold genuine conversations, stay tied to real data, and deliver measurable results for individuals and businesses. producthunt.com/products/happy… Please vote for us on Producthunt and share any feedback you might have! Also, huge thanks to our business partners for your support - @googlecloud (amazing Gemini team in particular - @DynamicWebPaige @triswarkentin @OfficialLoganK and others!), @pipecat_ai (@kwindla and others!), @elevenlabs (congrats with your own launch & summit!) and many others! P.S. Here is a Google Meet conversation with my real-time app.happyverse.ai/CEO Confidant and our CTO, Nicholas 🙂
San Francisco, CA 🇺🇸 English
3
5
28
30.8K
Max Sapo retweetledi
FlyMy.AI
FlyMy.AI@FlyMy_AI·
🚀Today we ship @FlyMy_AI Agents. The world's first all-in-one agentic cloud. The modern way to build, integrate, and scale production AI agents. 3 steps to a production agent: 1. Connect your work tools to FlyMy 2. Describe what the agent should do - in text or 5 lines of code 3. Set execution rules: manual, scheduled, or integrated into your backend 4. Done! Agent works on scale! Everything in one place: 800+ MCPs, hundreds of AI models, brain, memory, sandboxes. Stop building from scratch. Stop waiting for infra. Compress 6 months into a day. #1 on @ArtificialAnlys benchmarks. Stable, secure, scalable from day one. Try FlyMy.AI
English
8
5
24
5.2K
Max Sapo
Max Sapo@MaxHappyverse·
Amin Vahdat is one of the most underrated leaders of our generation. His ability to zoom in / zoom out is just second-to-none.
Shanu Mathew@ShanuMathew93

Amin Vahdat live at Transition-AI 2026 with @CatalystPod. Google's Chief Technologist for AI Infrastructure - the man in charge of Google's $175–185B 2026 CapEx and the one who's said compute capacity needs to 2x every 6 months! Key takeaways: Training → Inference cascade: -Frontier GW training clusters have a ~1–2 year useful life; then capacity cycles to serving -Inference doesn't need GW scale — <100 MW can do useful work on large models -"Entering the age of inference" is now real as agents explode DC footprint -Speed-of-light latency starts to matter as models get faster — geography becomes UX + reliability -"a medium number of medium-sized data centers, augmented with a small number of large ones" (few large training clusters, inference can be mix of 10s to 100s of MW) Reliability reframe -4-nines isn't intrinsic: "we should be thinking about lower reliability power delivery overall" -Do you make the this trade: 4-nines at half capacity, or 2-nines (3.65 days downtime/yr) at 2x capacity? Customers "very often" pick 2x Behind the meter -Google actually prefers grid-connected capacity = BTM is a bridge, not a destination -BTM is about a different latency: time-to-delivery of capacity -Bridge sources: turbines, gas, mobile generation. Permanent: solar, wind, nuclear -Stranded BTM? "I'd love to have that problem." Energy is the limiter Co-design = capability -Google co-designs across Gemini ↔ software ↔ TPU ↔ rack ↔ DC ↔ power ↔ building -A few percent at each interface compounds into real advantage Bottlenecks -Won't force-rank chips/power/labor/EPC: "10am it's labor, noon it's power, 2pm it's chips — every day" -YoY efficiency is real: this year's capacity would've cost ~1.2x last year Building fungibility is dead, purpose-build is back -25-yr buildings vs. 5–6 chip generations. Disk rack vs. GPU rack = ~100x watts/sq ft and widening -Old world: build for fungibility (compute wasn't dominant cost). New world: "this is a GPU building, that's a TPU building" -Density wasn't maxed before because flexibility was worth more. That's flipped One of the most important pods of the year from an AI leader at one of the most important companies at the center of it all! Great job @shaylekann

English
0
0
2
162
Max Sapo
Max Sapo@MaxHappyverse·
@sundeep 1M+ TPU 8t though, slightly more than 960K A5Xs :)
English
0
0
0
33
sunny madra
sunny madra@sundeep·
You want scale: 960,000 NVIDIA Rubin GPUs in a multisite cluster “At Google Cloud Next, Google announced A5X powered by NVIDIA Vera Rubin NVL72 rack-scale systems, which — through extreme codesign across chips, systems and software — deliver up to 10x lower inference cost per token and 10x higher token throughput per megawatt than the prior generation.  A5X will use NVIDIA ConnectX-9 SuperNICs, combined with next-generation Google Virgo networking, scaling to up to 80,000 NVIDIA Rubin GPUs within a single site cluster and up to 960,000 NVIDIA Rubin GPUs in a multisite cluster, enabling customers to run their largest AI workloads on NVIDIA‑optimized infrastructure” blogs.nvidia.com/blog/google-cl…
English
5
20
108
8.7K
Max Sapo
Max Sapo@MaxHappyverse·
@gdb Cool, just sent an email :)
English
0
0
0
15
Max Sapo
Max Sapo@MaxHappyverse·
@sama Is Codex on Cerebras and your own ASICs coming soon as well?
English
0
0
0
785
Sam Altman
Sam Altman@sama·
We tried a new thing with NVIDIA to roll out Codex across a whole company and it was awesome to see it work. Let us know if you'd like to do it at your company!
Sam Altman tweet media
English
487
421
8.2K
1M
Jeff Dean
Jeff Dean@JeffDean·
I had a good time discussing yesterday's Google TPU v8t and v8i announcement at Cloud Next with Amin Vahdat along with @AcquiredFM hosts @gilbert and @djrosent. The blog post announcement has lots of details about these new chips: blog.google/innovation-and… Here's a thread of some particular things I'm excited about:
English
18
81
519
158.2K
Max Sapo
Max Sapo@MaxHappyverse·
Met lots of good old friends at @googlecloud #Next2026 and even took a selfie with #TPUv4, my first product launch that I led back in 2022. Can’t believe it’s been 4 years ago - kids grow fast :) Link to a blog in comments below (for those curious to compare what was state-of-the-art then vs now. 🙂)
Max Sapo tweet mediaMax Sapo tweet mediaMax Sapo tweet mediaMax Sapo tweet media
English
1
0
0
45
@jason
@jason@Jason·
We started an AI founder twitter group... reply with "I'm in" if you're a founder and want to be added
English
10.8K
135
4.6K
903.4K
Max Sapo
Max Sapo@MaxHappyverse·
Met @que_tourist earlier this week. My best celebrity selfie of the year so far 🤗🤙
Max Sapo tweet media
English
0
0
1
82
Max Sapo
Max Sapo@MaxHappyverse·
@HappyverseAI webinar on 2/26! Tomorrow we are teaching a diverse group of founders, knowledge workers and creators how to build and deploy their own Digital Employee — a real-time AI video agent that can sell, screen, coach, and represent you 24/7. Day 5 of our AI Bootcamp covers: → The architecture behind sub-500ms real-time video agents → Writing system prompts that actually govern behavior → Connecting your agent to calendars, Google Meet/Zoom and agentic tools → Security, guardrails, and going from demo to business asset You leave with a working, deployable digital employee — not a concept. Joining me are my colleagues and advisers - Nora Salim and Mariana Tataryn. Last chance to grab a seat: luma.com/s4uydmv4 Use code HAPPY for a discount. See you there 🤝
San Francisco, CA 🇺🇸 English
0
0
0
20
Max Sapo
Max Sapo@MaxHappyverse·
@dylan522p Go with Inference Dylan
San Francisco, CA 🇺🇸 English
0
0
0
129
Dylan Patel
Dylan Patel@dylan522p·
$1,000 for whoever comes up with the best name replacement for InferenceMAX InferenceMAX 2.0 dropping soon but we have to rename it because HBO MAX sent us a cease and desist. We have all NVIDIA GPUs from h100 to GB300 on large MoEs with SOTA optimizations like Disagg PD tested
English
360
4
295
59.4K
Max Sapo
Max Sapo@MaxHappyverse·
@dylan522p Haha, how can I send "cease and decist" to HBO Max lol
San Francisco, CA 🇺🇸 English
0
0
0
189
Max Sapo
Max Sapo@MaxHappyverse·
@venturetwins Wait, there are two Yang Mun monks - one is "itsyangmuns" with ~960K and another one "yangmunus" with ~2.5M - and both are AI avatars it looks like? Wondering if it's the same creator or one is a copycat of the other
San Francisco, CA 🇺🇸 English
0
0
0
440
Justine Moore
Justine Moore@venturetwins·
Stumbled upon an insanely popular monk on Instagram named Yang Mun. He started posting videos in October and is now about to cross 1M followers - people love his “ancient Chinese wisdom.” Almost no one seems to realize that he’s entirely AI-generated 🤯
English
57
31
535
215.2K
Max Sapo
Max Sapo@MaxHappyverse·
Actually, you don't need that many tools - you can do everything with @HappyverseAI or our peers like HeyGen or Tavus. In addition, we also support real-time conversations for custom avatars, and we have our own Happyverse.ai/monk 🙂 (which just haven't pushed it through Insta yet cause they don't support real-time video comms unfortunately)
San Francisco, CA 🇺🇸 English
0
0
0
320
Justine Moore
Justine Moore@venturetwins·
If I had to guess - this creator generates the scripts with ChatGPT, makes the image with NB Pro, does the voice on Eleven, and then lip syncs with Hedra/Veed/OmniHuman. It’s wild that you can use this pipeline to make a fully faceless, automated character! Here’s the page ⬇️
Justine Moore tweet media
English
10
9
133
11.9K
Max Sapo
Max Sapo@MaxHappyverse·
@venturetwins Cool, but why wouldn't they make it available to talk real-time though? 🙂
San Francisco, CA 🇺🇸 English
0
0
0
209
Max Sapo
Max Sapo@MaxHappyverse·
@grok is getting really good at pre-generated avatar videos! 😃 (Not my voice and not realtime but that's what we can do at @HappyverseAI !)
Max Sapo tweet media
Enterprise, NV 🇺🇸 English
1
0
0
65
Alps
Alps@alpaysh·
i thought the view was surreal, but i looked twice and found out that it was just satya Nadella and marc andreessen
Alps tweet media
English
1.5K
1.2K
16.1K
20.7M