Ed Shrager

1.7K posts

Ed Shrager banner
Ed Shrager

Ed Shrager

@edshrag

Tweets about AI

San Francisco, CA Beigetreten Ağustos 2019
412 Folgt3.2K Follower
Angehefteter Tweet
Ed Shrager
Ed Shrager@edshrag·
1/ The personal agent, that’s the big thing - Bill Gates Introducing LLynx🐈, a building block to help enable action-oriented AI Agents. It’s fast, accurate & small (3B params) 🙏Built on resources from @langchain @hwchase17 @huggingface @_philschmid @chipro @lmoroney Demo👇
English
57
89
804
502.5K
Ed Shrager retweetet
Arsh Shah Dilbagi
Arsh Shah Dilbagi@arshdilbagi·
The hard part about LLM failures is that their outputs rarely look like failures. The demo “works.” The output sounds coherent. The user actively uses the product. And your dashboard looks normal. Meanwhile, the system can be wrong, unsafe, or quietly driving up token spend. And you won’t notice until the damage adds up. Prompts often serve as business logic (policies, safety, and product context). But many teams ship them without the basics, such as versioning, reviewable changes, end-to-end traces, and eval gates. In production, it doesn’t crash. It degrades via wrong answers, policy misses, and surprise spending. No crash. No error. No alert. I cover this exact issue in my @Stanford CS 224G guest lecture on AI Observability and Evaluations. Here are the core ideas: • If you only log the final output, you’re guessing. Full traces show where it broke. • Evals are feedback loops. Use clear pass/fail criteria tied to outcomes. • Run evals continuously on production traces and don’t wait for support tickets. The moat isn’t prompt cleverness. It’s a measured improvement. Full lecture + blog below 👇
English
29
19
128
20K
Ed Shrager retweetet
Tuhin Srivastava
Tuhin Srivastava@tuhinone·
Baseten’s day 0 bet was that inference was the technology that would enable the best user experiences AI could deliver–fast, smart, reliable, secure. And that those experiences would rely not only on a handful of giant general intelligence models, but millions of specialized models built by companies for their specific customers and use cases. Whether you’re a doctor, developer, lawyer, mechanic, researcher, construction worker, marketer, etc, you’re accelerated by specialized tools worthy of your craft. To me, this is one of the most meaningful promises AI can deliver on. We’re starting to see it now. Many of the main-character AI companies on the application layer are built on highly-specialized models for highly-specialized workflows–Abridge, Clay, Cursor, OpenEvidence, Hebbia, Mercor, Notion–these businesses are booming because customers love specialized tools. There are probably hundreds of custom models in production today. Soon, there will be thousands and then millions. All enabled by a high-performing inference layer. Inference has emerged as one of the hardest problems in modern AI systems. Delivering reliable, low-latency experiences requires deep coordination across distributed infrastructure, kernel-level performance, and software ergonomics—even world-class teams struggle to do this well. As a result, as consumers and developers, we’ve grown to accept sluggish performance, frequent downtime, and inconsistent quality across both application companies and model providers. Meanwhile, the demands on inference are accelerating: AI adoption is trending towards ubiquity with reasoning models that are orders of magnitude more compute-intensive. This will only increase as more companies catch on to the virtues of owning their end-to-end IP rather than relying on black-box model APIs on shared infrastructure. Whether we can realize the impact of this generational shift will depend on our ability to serve these models reliably at scale. We knew we could make the technology work, but the biggest delight of it all has been seeing what our customers do with it. The (many-model) future is bright.
Baseten@baseten

We’re thrilled to announce that we have raised $300M at a $5B valuation. The round is led by IVP and CapitalG, both doubling down on their investment in Baseten, and joined by 01A, Altimeter, Battery Ventures, BOND, BoxGroup, Blackbird Ventures, Conviction, Greylock, and NVIDIA. Read more here: baseten.co/blog/announcin…

English
48
39
222
78.4K
Nick Rudder
Nick Rudder@nrudder_·
.@get_sphere has raised a $21M Series A, led by @a16z, to build the world's first AI-native tax engine 🌎 There are two diametrically opposed forces that are coming to a head right now - companies are going global faster than ever but cross border compliance is becoming more and more complex. Current tax compliance products on the market can't scale globally because: 1. They do their tax research manually (which is inefficient, slow and prone to error) 2. They outsource data transmission to tax authorities via local service partners (no automation, manual back and forth with 3rd parties, exorbitant fees). Sphere has completely changed that. Our AI native tax engine collects, codifies and monitors tax law globally and our 100+ local rails into tax authorities fully automates the compliance lifecycle end to end. And indirect tax is just the start. What Deel did for global payroll compliance, Sphere is doing for revenue-based compliance. This round plants the flag in the ground for a new type of cross-border compliance system - one that leverages AI to understand the granular complexity of tax law and apply it to the product catalog of any business. A huge thank you to all our customers - including global companies like @Lovable, @elevenlabs, @Replit, @runwayml and @deel. A massive shout out to our investors who have come on somewhat of a non-linear journey - @mandrusko1, @a16z, @ycombinator, @felicis, @HarryStebbings, @20vcFund, @pioneer_fund, @mejiasebas, @RichAberman + many more. And most importantly a nod to my team, who is in the trenches with me every day, and my family, without whom we wouldn't be where we are. If you're going global, go with @get_sphere
English
23
20
143
99.1K
World Labs
World Labs@theworldlabs·
Introducing Marble by World Labs: a foundation for a spatially intelligent future. Create your world at marble.worldlabs.ai
English
358
610
3.3K
2.1M
Ed Shrager retweetet
okWOW
okWOW@okwowAI·
For decades, technology was inching closer to being more personal, more intuitive, more human. 80s: point and click instead of cryptic commands 90s: the web remembered our preferences. 2000s: touchscreens let us touch our ideas. 2010s: devices predicted our needs... 🧵 1/7
okWOW tweet media
English
3
10
22
6.1K
Ed Shrager retweetet
Tuhin Srivastava
Tuhin Srivastava@tuhinone·
Today, we’re excited to announce our $150M Series D, led by BOND, with Jay Simons joining our Board. We’re also thrilled to welcome Conviction and CapitalG to the round, alongside support from 01 Advisors, IVP, Spark Capital, Greylock Partners, Scribble Ventures, and Premji Invest. The last eighteen months have been a whirlwind; as the AI application layer has taken off, we've been proud to play a small part supporting world class companies run their production workloads. Thanks to all our customers including Abridge, Bland, Clay, Gamma, Mirage, OpenEvidence, Sourcegraph, WRITER, and Zed Industries. We’re just getting started. If you’re building the next generation of AI products, we’d love to work with you.
English
76
37
481
304.3K
Ed Shrager retweetet
Ed Shrager retweetet
NVIDIA AI
NVIDIA AI@NVIDIAAI·
📈 @baseten users are scaling smarter with us: ✅ 5× throughput on high-traffic endpoints ✅ 50% lower cost per token ✅ Up to 38% lower latency on the largest LLMs Built on NVIDIA Blackwell + TensorRT-LLM + Dynamo on @googlecloud—driving efficiency, speed & adoption at scale. Learn More: nvda.ws/4lUKT89
NVIDIA AI tweet media
English
8
22
106
15.2K
Ed Shrager retweetet
Baseten
Baseten@baseten·
Welcome to Baseten @DannieHerz! We’re thrilled to announce that Dannie Herzberg has joined as our new President to lead Baseten’s GTM and operations. As @tuhinone shared: "Dannie is biased towards action, dependable, and long-term in her thinking, and she knows that the customer experience is everything." Here’s to building the next chapter of Baseten with you, Dannie! Read more from Tuhin about Dannie here → baseten.co/blog/welcoming…
Baseten tweet media
English
15
8
105
97.3K
dax
dax@thdxr·
baseten was the worst perfoming on gosucoders qwen3 benchmarks last week but they called me immediately to figure out how to fix they've been working the last 3 days and now it's fixed - it's up there with the best options and it's fixed for everyone, not just opencode users
English
12
5
172
17.6K
Ed Shrager
Ed Shrager@edshrag·
@thdxr 💪💪💪 it's how you react + the result that counts 🚀
English
0
0
4
446
Ed Shrager retweetet
Amir Haghighat
Amir Haghighat@amiruci·
It's important to support newly released open-weight models on day 1. But it's not noteworthy. What's noteworthy is to have the inference optimization muscle to immediately blow the competition out of water on latency and throughput. As measured by OpenRouter:
Amir Haghighat tweet media
English
13
14
88
28.7K
Ed Shrager retweetet
Arsh Shah Dilbagi
Arsh Shah Dilbagi@arshdilbagi·
Most AI products fail in the first month. Not bad AI. Bad prompts. Teams at Discord, McKinsey, Salesforce, DoorDash, Reforge, and over 100K+ developers using us know why: Teams wing their prompts – test on 5 examples, ship to millions, pray it works. Today changes everything.
English
116
95
724
591.6K
Ed Shrager retweetet
Baseten
Baseten@baseten·
let there be inference
English
55
50
357
332K