
Baseten
2.3K posts

Baseten
@baseten
Inference is everything.




We now have a product specifically created for AI labs and their closed-weight models: we'll take care of not just inference, but auth, rate limits, metering, and billing integrations. We'll take care of providing both shared and dedicated inference, compliance needs, and matching end customers' geo requirements (us, ca, eu, uk, aus, jp, etc). It's called Baseten Frontier Gateway and is already battle-tested by multiple AI labs, like Poolside and their impressive Laguna M.1 agentic coding model.













The pipeline of wisdom? Fully loaded. The close rate on great advice? 100%. I got a chance to sit down with the one and only @DannieHerz, president at @baseten, and talk all things GTM in this new era of AI. A few of my favorite takeaways: ✅ The AI-era sales hire is the tech enthusiast who uses the product, talks to engineers in their network, and earns trust by not sounding rehearsed. ✅ Culture = intensity + joy. Hire killers who are also a delight. ✅ Great recruiting isn't a pitch, it's relationship driven. No hard sell, just real work together with great people. Dannie joined Baseten for that exact reason.



The pipeline of wisdom? Fully loaded. The close rate on great advice? 100%. I got a chance to sit down with the one and only @DannieHerz, president at @baseten, and talk all things GTM in this new era of AI. A few of my favorite takeaways: ✅ The AI-era sales hire is the tech enthusiast who uses the product, talks to engineers in their network, and earns trust by not sounding rehearsed. ✅ Culture = intensity + joy. Hire killers who are also a delight. ✅ Great recruiting isn't a pitch, it's relationship driven. No hard sell, just real work together with great people. Dannie joined Baseten for that exact reason.


I sat down with Bola, the brain behind our Baseten Frontier Gateway, to unpack why we built it, how we're already powering labs like Poolside, and where we're taking it next. If you're building a model lab, this one's for you. 00:00 Introductions 00:46 The Jagged Frontier Concept & Cambrian explosion of AI labs 03:34 The DeepSeek effect 06:06 What labs need to build a business & why existing gateways don’t work for inference 09:22 The gateway as a core infrastructure piece and what it actually does 11:44 Poolside as the launch partner for Baseten 13:42 What the gateway means for Baseten and closing message for labs founders

We serve Qwen3-TTS on vLLM-Omni at $3 per 1M characters. That's 90% lower in cost than comparable closed-source TTS APIs. Our engineers optimized a single-replica serving stack to get there. Details on the optimized stack and cost per concurrent stream here.








