Baseten

2.2K posts

Baseten banner
Baseten

Baseten

@baseten

Inference is everything.

San Francisco and New York เข้าร่วม Mart 2021
332 กำลังติดตาม8.8K ผู้ติดตาม
ทวีตที่ปักหมุด
Baseten
Baseten@baseten·
We’re thrilled to announce that we have raised $300M at a $5B valuation. The round is led by IVP and CapitalG, both doubling down on their investment in Baseten, and joined by 01A, Altimeter, Battery Ventures, BOND, BoxGroup, Blackbird Ventures, Conviction, Greylock, and NVIDIA. Read more here: baseten.co/blog/announcin…
Baseten tweet media
English
41
24
328
283.2K
Baseten รีทวีตแล้ว
Dannie Herzberg
Dannie Herzberg@DannieHerz·
Nearly half of all U.S. physicians rely on OpenEvidence every single day - at the bedside, in the operating room, and at the point of care. It surfaces research in real time, exactly when doctors need it most. Baseten powers the inference that makes this possible. Thank you, Team @EvidenceOpen, for what you do and for the trust.
English
1
1
14
934
Baseten รีทวีตแล้ว
Tuhin Srivastava
Tuhin Srivastava@tuhinone·
OpenEvidence has become the default medical knowledge platform for over 40% of U.S. physicians; it's relied on daily for the highest-stakes decisions in medicine. Baseten is honored to power the inference behind it.
English
6
16
155
19.7K
Fareeha
Fareeha@fareehasala·
they inference engineered my ice cream???
Fareeha tweet media
English
4
0
31
1K
Baseten รีทวีตแล้ว
OpenEvidence
OpenEvidence@EvidenceOpen·
Over 1 million clinical questions hit OpenEvidence every day. More than half the practicing physicians in the US rely on us at the point of care, mid-decision, with a patient in front of them. Downtime in that moment has real consequences. We partner with @baseten for our inference infrastructure to make sure answers are always there when physicians need them. They stopped by our office to talk about what that looks like under the hood.
English
6
14
83
95.6K
Baseten
Baseten@baseten·
Gemma 4 is live on Baseten and available to all customers on day 0 via the Baseten model library. All models in the Gemma 4 family are multimodal, supporting text and image inputs with text output. Key capabilities include: -> Advanced reasoning and thinking -> Coding and function calling -> OCR for document understanding -> Long context windows up to 256K tokens But the most impressive is how Gemma 4 is pushing the boundaries of model architecture with innovations including alternative attention mechanisms, Proportional RoPE, Per-Layer Embeddings (PLE), KV-Cache Sharing, native aspect ratio handling for vision, and a smaller frame window for audio. All are designed to improve efficiency, accuracy, and scalability. Try it today: baseten.co/library/publis…
Baseten tweet media
English
3
7
43
2.3K
Baseten รีทวีตแล้ว
Rachel Rapp
Rachel Rapp@rapprach·
Our last Women in AI event might just convince me to move to NYC... I was the solo-female-engineer during all of my engineering career (similar story in academia). Being surrounded by so many excellent engineers and technical leaders—100% women—was a treat. 🩷 If you're in NYC, we're doing it again at Le Labo soon. DM me if you want to join!
Rachel Rapp tweet media
English
2
1
18
1.1K
Baseten รีทวีตแล้ว
NVIDIA
NVIDIA@nvidia·
Delivered performance, not peak chip specifications, drives AI factory productivity. Rigorous benchmarks are the only way to see past the noise. In MLPerf Inference v6.0, NVIDIA extreme co-design delivered the highest token output across the broadest range of models and scenarios. Maximizing token output drives down token cost and maximizes AI factory productivity. Read the blog post to dive into the details: nvda.ws/41aqALX @Baseten, @CoreWeave, @mlcommons
English
25
33
138
24.2K
Baseten
Baseten@baseten·
@antonpme Hi Anton - could you DM the email associated with your account so we can locate the support records and better investigate?
English
0
0
0
36
Anton P. 👽
Anton P. 👽@antonpme·
@baseten I already emailed you about this, but my main question is: what was I charged for? I haven't had any activity since I created the account. I’ve never used anything, and when I checked the dashboard and all the stats, there are zero projects.
English
1
0
1
84
Anton P. 👽
Anton P. 👽@antonpme·
🚨 @baseten is charging users for accounts with zero usage. No deployments, no requests, no nothing. The monthly bill anyway for something I didn't even subscribe to. Asked support to stop last month. Got a refund. This month: charged again. WTF? Wild business model: bill people for services they never used, hope they don't notice. 🤬
Anton P. 👽 tweet media
English
1
1
5
516
Baseten รีทวีตแล้ว
Harry Partridge
Harry Partridge@part_harry_·
Pretraining is data-inefficient. This is entirely a consequence of the fact that we throw away the KV cache after every forward-backward step! If we can integrate efficient KV cache compaction into pretraining, we will unlock human level data efficiency. Neural KV cache compaction makes this possible.
Charlie O'Neill@oneill_c

x.com/i/article/2039…

English
12
35
301
38.4K
Baseten
Baseten@baseten·
What if LLMs could remember as humans do? LLM memory is either perfect and lossless or ultra-compressed. What does a slightly compressed working memory to extend its context window look like? Our researchers built a 7M-parameter perceiver that compresses KV caches 8x while retaining 90%+ factual retention. Unlike existing compaction methods, we trained a model to do this in a single forward pass. We see this as the first step toward models that actually learn from experience.
Charlie O'Neill@oneill_c

x.com/i/article/2039…

English
1
9
54
5.3K
Baseten
Baseten@baseten·
@rapprach can't wait for the Baseten-branded clothing line
English
0
0
1
118
Sriram
Sriram@tsriram·
Update: @philipkiely from @Baseten is joining us for a lightning talk at Agents and Money. The lineup keeps on getting better! If you're building agents that need to be cost-efficient at scale, this is the room to be in, with the best in the game. April 15 | Fort Mason | 5 PM RSVP → luma.com/5nqllmul
Sriram@tsriram

I can wait on AGI, but can our agents get some money awareness... ASAP? 💸 Most agents are financially reckless by default. Add any complexity and you could one-shot your entire token budget. 🤯 Excited to co-host an evening in SF with @mastra_ai where we dig into the principles behind scaling radically efficient agents with some of the best minds in the arena. On the agenda: → a workshop with @bookercodes 🛠️ → a lightning talk ft. @swyx ⚡ → a fireside chat with @NotionHQ's @sarahmsachs and our co-founder Rajaraman Santhanam 🔥 Followed by food, drinks, and conversations. April 15 | Fort Mason | 5 PM | 100 spots RSVP → luma.com/5nqllmul

English
1
2
5
428
Baseten
Baseten@baseten·
We've had a great month of March! A brief recap: -> NVIDIA GTC, featuring books, ice cream, and swag -> KubeCon EMEA, including a 2000+ person House of Kube event -> AI engineering leaders dinners at Wolfsbane (SF) and Manhatta (NYC) -> Baseten-branded ice cream social in SF -> AI/ML trivia night in the West Village -> NYC office warming party Want to attend our next event? Sign up here baseten.co/resources/type…
English
0
0
3
699
Battery Ventures
Battery Ventures@BatteryVentures·
Congratulations to the 4 Battery portfolio companies recognized on the 2026 #ET30 list! Late: @baseten @Gong_io Giga: @ClickHouseDB @databricks The AI supercycle is giving rise to an entirely new stack, and we're proud to back the founders building across it💥
Battery Ventures tweet media
Wing VC@Wing_VC

The 2026 Enterprise Tech 30 is live! 🚀 60 companies shaping the future of enterprise. In partnership with @EricNewcomer at @NewcomerMedia See the full list: wing.vc/et30 #ET30

English
1
0
5
695
Baseten รีทวีตแล้ว
Charlie O'Neill
Charlie O'Neill@oneill_c·
Feel like this thesis gets more and more evidence behind it every day. Cursor, Chroma, Pinterest, Cognition, Decagon, Hippocratic, Intercom (and many many more behind the scenes) all realising that the way to own the compounding flywheel of value is specialising an open-source model to be incredibly good at the tasks you care about, with real data that continuously makes it better (which is the advantage you have over the big labs who will keep trying to eat your lunch with general intelligence improvements). my bet is that this will be the norm rather than the exception on a much shorter timeline than you think. Most companies worth their salt don't want to be beholden to or held hostage by a big lab. If you own your model, there's a huge surface area of things you can do on both the training and inference side to deliver the best possible experience to your end users
Baseten@baseten

Initially, we believed that open-source models could only accomplish a fraction of the tasks of closed-source models. Since then, researchers have discovered that open-source models can be highly specialized. This led to an explosion in use cases accomplished by open-source, driving industry demand. Baseten Head of Training @oneill_c on how specialization has transformed the open-source vs. closed-source debate.

English
1
9
55
12K