Baseten

2.3K posts

Baseten banner
Baseten

Baseten

@baseten

Inference is everything.

San Francisco and New York Katılım Mart 2021
340 Takip Edilen10.1K Takipçiler
Baseten
Baseten@baseten·
We serve Qwen3-TTS on vLLM-Omni at $3 per 1M characters. That's 90% lower in cost than comparable closed-source TTS APIs. Our engineers optimized a single-replica serving stack to get there. Details on the optimized stack and cost per concurrent stream here.
Ian Carrasco@ia_n_ai

x.com/i/article/2054…

English
3
3
48
21K
Baseten retweetledi
adam bain
adam bain@adambain·
This is a great piece from @baseten's @oneill_c: The question every app layer company is now asking is: "how do we resist commodification to deliver better results for customers?" The answer is specialized models based on your unique understanding of who you serve every day.
Charlie O'Neill@oneill_c

x.com/i/article/2054…

English
0
1
10
5.7K
Baseten retweetledi
Barry McCardel
Barry McCardel@barrald·
when I first heard of @baseten they were basically a competitor but then I met @tuhinone and I liked him, and when I heard they were pivoting to inference, I was relieved because I didn't want to compete against him and then he hired @DannieHerz and I was angry because I wish I had thought of it and now they're a critical partner for us as we embrace our own many model future at @_hex_tech I'm so happy for all their success and very excited to share what we've been working on with them!
Tuhin Srivastava@tuhinone

x.com/i/article/2054…

English
0
6
96
22.4K
Baseten retweetledi
sarah guo
sarah guo@saranormous·
The starting premise @Conviction was that AI (general models at scale) were a broad shift in computing. This has come to pass But the way AI benefits many users more powerfully is going to be more distributed product/research work, in partnership with humans who do the work
Tuhin Srivastava@tuhinone

x.com/i/article/2054…

English
7
9
146
41.6K
Baseten
Baseten@baseten·
“The question every app layer company is now asking is no longer ‘how do we use AI?’ It is ‘how do we resist commodification to deliver better results for customers?’ The answer is specialized models based on your unique understanding of who you serve every day. The big labs can’t do it, but you can.” Baseten’s Head of Model Training, @oneill_c, on the wave of AI companies using post-training to deliver better results for customers via specialized models.
Charlie O'Neill@oneill_c

x.com/i/article/2054…

English
0
0
22
1.7K
Tereza Tizkova
Tereza Tizkova@tereza_tizkova·
Visited my first-ever conference as a sponsor, and it was a wakeup call! 🫥So I made a "guide to not wasting money on conference booths." Below are some learnings, checklist, & things I really loved online and irl. Im also giving credits there to my and my friends' favorite conference booths. Check it out!🔗👇
Tereza Tizkova tweet mediaTereza Tizkova tweet mediaTereza Tizkova tweet mediaTereza Tizkova tweet media
English
2
2
48
3.5K
Baseten retweetledi
Dannie Herzberg
Dannie Herzberg@DannieHerz·
Every day we're seeing more companies emerge with specialized models that push the SOTA forward. Congrats to the Speechify team on the launch of SIMBA 3.0! Very happy to partner with you. More here: baseten.co/resources/cust…
Dannie Herzberg tweet media
English
1
2
10
1.4K
ian
ian@IKorovinsky·
just wrapped up my first week @baseten on the post-training team can’t wait to make some models go brrr 💚
ian tweet media
English
25
2
152
5.3K
Baseten
Baseten@baseten·
We're proud to share our partnership story with @SpeechifyAI. Speechify just announced SIMBA 3.0, ranked top 10 globally on the @ArtificialAnlys TTS leaderboard and the most cost-efficient model by far in that tier. We’re honored to serve the full SIMBA TTS family and other core workloads for Speechify, achieving: → 44% lower cost per 1M characters → 30-50% lower p99 latency → 4.5x faster cold starts Read the full case study here:
Baseten@baseten

x.com/i/article/2054…

English
1
6
16
1.2K