Braintrust

658 posts

Braintrust banner
Braintrust

Braintrust

@braintrust

The observability layer for production AI.

Katılım Ağustos 2023
54 Takip Edilen6.6K Takipçiler
Sabitlenmiş Tweet
Braintrust
Braintrust@braintrust·
Braintrust has raised an $80M Series B. We're building the infrastructure that helps teams measure, evaluate, and improve their AI products. Don't take our word for it. Hear how @NotionHQ, @Vercel, @Navan, and @billcom use Braintrust to ship quality AI.
English
28
21
284
128K
Braintrust
Braintrust@braintrust·
An eval platform is more than just a test runner. Evals require shared definitions of "good," reliable data pipelines, labelling workflows, versioning, and trust in results across many teams and model changes. Hear about the design principles behind Braintrust's platform in this session from @aidotengineer.
English
1
0
3
363
Braintrust
Braintrust@braintrust·
Evals course module ten: building a multi-turn chat app. Move from single-turn to multi-turn use-cases by building a chatbot CLI app with production logging. Use init_logger, wrap_openai, and @ traced to capture every conversation as a single trace. More here → braintrustdata.link/evals-course-y…
English
0
0
0
135
Braintrust
Braintrust@braintrust·
Evals 101: a new course from Braintrust. Everything you need to know about evals, and how to do them yourself. Module one: Why are evals important? - the six most common problems developers face when shipping AI applications - why traditional software thinking doesn't apply to AI - how evals can fix these problems
English
5
1
72
35.6K
Braintrust
Braintrust@braintrust·
For AI PMs, evals are the new PRD. At @PLEDalliance Summit New York, Ameya Bhatawdekar discussed the new product development loop and how to translate every element of a traditional PRD into its eval equivalent.
Braintrust tweet media
English
1
0
4
220
Braintrust
Braintrust@braintrust·
If you're building AI products but aren't writing evals, this is the place to start. In Evals for engineers, solutions engineer Doug Guthrie will show you how to: - Instrument an agent with the Braintrust SDK - Look at traces across model calls, tool use, and outputs - Build datasets from failure modes and write scoring functions - Iterate on your prompt and measure quality over time
English
1
1
1
406
Braintrust
Braintrust@braintrust·
Braintrust x @Nasdaq Thank you to @wing_vc and congratulations to everyone on this year's Enterprise Tech 30.
Braintrust tweet media
English
1
3
10
627
Braintrust
Braintrust@braintrust·
The timeline view now shows token distribution and cache hit rates across your trace spans. Scale visualizations by tokens or cost instead of duration to diagnose context bloat and optimize LLM spend.
English
10
0
8
325