Derek Haynes

478 posts

Derek Haynes banner
Derek Haynes

Derek Haynes

@dhaynes23

Builder of observability, developer tooling, and AI systems

Colorado, USA Katılım Haziran 2008
77 Takip Edilen253 Takipçiler
Derek Haynes
Derek Haynes@dhaynes23·
@zeeg That UI is 🤌 ... how much back-and-forth did you have with Claude to put that together?
English
1
0
0
145
David Cramer
David Cramer@zeeg·
Shipped v1 of what I'm now calling Abacus (sorry not sorry). Pulls in Claude Code and Cursor usage, lets you look at them globally as well as per individual. Heavily focused on understanding consumption and cost. and its open source now, glhf github.com/getsentry/abac…
David Cramer tweet media
English
4
1
41
6.3K
Derek Haynes
Derek Haynes@dhaynes23·
Speaking as someone who has maintained codebases over many years—and who cares deeply about code quality and long-term velocity—my first move today when enhancing a project is Claude Code, not opening my IDE.
English
0
0
0
28
Derek Haynes
Derek Haynes@dhaynes23·
This is the kind of side project that never launched prior to Claude Code ... digging thru the OpenTelemetry schema, creating an OTEL collector, building a UI to align audio with trace spans ... this grunt work loses steam when it lives outside your day job.
English
0
0
0
37
Derek Haynes
Derek Haynes@dhaynes23·
@zeeg I like it ... similar approach I've used (jumps right to the implementation): #implementing-my-streamlined-edd-flow" target="_blank" rel="nofollow noopener">dlite.cc/2023/10/04/202… What is the output of assertLLMAgrees?
English
0
0
1
17
David Cramer
David Cramer@zeeg·
Im going to build a synthetic test suite that runs the production evals. I'll then take the results of those evals, and pass them through the LLM again to get it to "verify" the output roughly equals the output I want. assertLLMAgrees(output, "Indicates the game is 4 players");
Truckee, CA 🇺🇸 English
2
0
0
420
David Cramer
David Cramer@zeeg·
Thinking more on this "how do I test my LLM implementation", with my "I dont know shit" approach...
Truckee, CA 🇺🇸 English
1
0
3
556
Donn Felker
Donn Felker@donnfelker·
@NathanSRobinson I found I hit the quotas on Heroku quickly, especially for rails apps. After moving my stuff over to @hatchboxio with Digital Ocean droplets, my costs plumetted, performance skyrocketed and I'm good to go.
English
4
0
6
353
Nathan S. Robinson
Nathan S. Robinson@NathanSRobinson·
My app broke because too many people used it (and because performance was an afterthought when I built it) Guess I need to prioritize some of those performance optimization tasks that's been on my todo for a few days... I honestly thought I could get away without it for much longer.
Nathan S. Robinson tweet media
English
23
0
40
7.1K
Derek Haynes
Derek Haynes@dhaynes23·
@zeeg How did Sentry emerge as the dominant exception monitoring service against A LOT of other products?
English
0
0
1
32
Derek Haynes retweetledi
Andrei
Andrei@rushing_andrei·
Ruby AI Survey 2023: docs.google.com/forms/d/e/1FAI… We'd really appreciate 2 min of your time to get a better idea of what problems Rubyists are experiencing building with LLMs. The results will be shared!
English
0
7
9
2.1K
Rosmine
Rosmine@rosmine·
With so many different solutions for Chat with a PDF/knowledge database/website, how is there no Chat with AWS documentation?? It takes me at least 7 hours of searching to find anything in there
English
3
0
5
422
Derek Haynes
Derek Haynes@dhaynes23·
@zeeg From the post: > ...you’ll note Sentry existed for about five years before we launched the cloud service. That was half a decade of natural needs-driven growth... It's pretty crazy how much evals/monitoring/etc exist around LLM apps with relatively few AI apps deployed.
English
0
0
1
81
Derek Haynes
Derek Haynes@dhaynes23·
@_cartermp > analyzing data relevant to a query I think this would be interesting to explore w/Honeycomb .. similar to ChatGPT's Advanced Data Analysis. Sometimes I upload a CSV and just ask it to explore.
English
0
0
0
17
Derek Haynes
Derek Haynes@dhaynes23·
@_cartermp So, I wouldn't bet on AIOPs to magically make sense of messy data.
English
1
0
0
9
Derek Haynes
Derek Haynes@dhaynes23·
Last weekend I shared my streamlined Eval Driven Development (EDD) approach at the excellent AiPeaks.org. Evals were a big topic from @aiDotEngineer summit. What's worked for me:
English
1
0
0
102