Adam Storr

2.5K posts

Adam Storr banner
Adam Storr

Adam Storr

@AdamStorr

Head of Design @_hex_tech. Formerly @Palantir. We're hiring!

San Francisco, CA Katılım Haziran 2008
512 Takip Edilen974 Takipçiler
Adam Storr retweetledi
Barry McCardel
Barry McCardel@barrald·
We have been testing Opus 4.6 at @_hex_tech for the last few weeks - a few observations: Opus 4.6 performs better than other models in 2 areas: 1. Cases where passing/failing requires very careful attention to complex context (i.e., dredging up a single kind of vague line from a markdown file that explains how to handle empty arrays in a column). It's way more careful about instruction following from rules, guides, context, etc! 2. Cases where the model should be proactively taking action and pushing things to completion instead of doing half the work and asking the user if they'd like to keep going. Overall though, we see it perform roughly the same as other models (slight improvement) on analytical evals. As we’ve shared previously, though, the models do not seem to be meaningfully improving at analytical reasoning, sanity checking, catching obvious bugs / bad data that should merit a second glance, etc. A concrete example from our evals is an intentional double-counting bug in an analysis about sales performance. The agent returns sales rep quota information, sees that everyone in the dataset seems to be obliterating their quotas (3-4x attainment!) and reports cheerfully on that stat, instead of going "hmm, that might be indicative of a problem" and sanity checking. Opus 4.6 is actually a regression from Sonnet on this! Many more examples like this and a major focus for us as we iterate on our agents harness and context
English
9
3
38
4.2K
Adam Storr retweetledi
Olivia Koshy
Olivia Koshy@oliviakoshy·
It's been awesome testing opus 4.6 on some of our hardest data tasks @_hex_tech. It's a top performer across our eval set and handles ambiguity much better than previous models! rolling out soon for all folks :)
Claude@claudeai

Introducing Claude Opus 4.6. Our smartest model got an upgrade. Opus 4.6 plans more carefully, sustains agentic tasks for longer, operates reliably in massive codebases, and catches its own mistakes. It’s also our first Opus-class model with 1M token context in beta.

English
0
3
31
2.1K
Adam Storr retweetledi
Barry McCardel
Barry McCardel@barrald·
Observing analytics agents in the wild is very hard! So, we made the @_hex_tech Context Studio... and this spy thriller movie to show it to you 🕵️‍♀️ This is my favorite (and dumbest) thing we've done yet and I hope you enjoy it 🙇
English
3
6
30
7.5K
Adam Storr retweetledi
Olivia Koshy
Olivia Koshy@oliviakoshy·
its 2027 you should have data agents you actually trust! with @_hex_tech you have observability, monitoring, alerting, and context governance all in one place. plus its easy to test & deploy all live now :)
English
5
1
26
1.2K
Adam Storr retweetledi
Carlos Aguilar
Carlos Aguilar@trucklos·
Your analytics agents are out in the field. Are they equipped for the mission? 🕵️
English
2
2
11
554
Adam Storr retweetledi
Barry McCardel
Barry McCardel@barrald·
this crushed @_hex_tech hackweek, very tempted to ship it
English
1
5
10
683
Adam Storr retweetledi
Barry McCardel
Barry McCardel@barrald·
We built some new agentic features into @_hex_tech so more people can use AI for accurate, trusted data work They're pretty fun, and we made a video to show them off Lots more coming! hex.tech/blog/fall-2025…
English
5
12
42
11.4K
Adam Storr retweetledi
Carlos Aguilar
Carlos Aguilar@trucklos·
I'm so excited to share our fall launch of Threads! 🚀🧵 I joined Hex just five months ago, along with the rest of the Hashboard team. It’s wild how much the world and Hex have changed in that time. In that time Hex has shipped: ✍️ A first-class experience for building semantic models 🔮 The notebook agent and an agent to help with semantic modeling 🧵 And with this release, conversational AI for your whole team Five months ago, I would have probably told you that “conversational AI for analytics” was still a couple of years out. But if early feedback from alpha partners is any indication, I think Hex has turned the corner — and I think we’re on to something really special. Threads is the first experience that feels like a complete solution for allowing your team to chat with your data and get meaningful insights. Don’t take my word for it, you can go sign up and try it for yourself. It’s in public beta TODAY!
English
2
4
20
14.1K
Adam Storr retweetledi
Devin
Devin@JustMeDevin·
Cassette is live! a fun way to watch your home videos shot on iPhone. Is this a launch video? I’m not sure, but it is a video.
English
24
29
213
37.7K
Adam Storr retweetledi
Stammy
Stammy@Stammy·
now that Figma and iOS 26 have Liquid Glass I almost don’t want to use it. every app will be doing it and it won’t feel special. the more accessible a certain design aesthetic is, the less you want it. tastefully riffing on it with your own flavor will be sought after
English
15
6
158
14.6K
Adam Storr retweetledi
Barry McCardel
Barry McCardel@barrald·
Introducing our newest @_hex_tech Magic AI feature today: Data Enhance ✨ We've all been there: you do an analysis, but the results underwhelm. With Data Enhance, our SoTA AI agent updates the data to the story you want to tell 📈 Check it out here! hex.tech/blog/introduci…
Barry McCardel tweet media
English
4
4
42
3.5K
Adam Storr retweetledi
Adam Storr retweetledi
Barry McCardel
Barry McCardel@barrald·
this is my favorite release in a long time! 📈 the power of Hex, for way more people, with visual exploration, AI, and an upgraded consumer experience 🧭 we also had fun with the video intro, of course 🎃 hex.tech/blog/introduci…
English
2
4
28
2.2K
Adam Storr retweetledi
Adam Storr retweetledi
Izzy
Izzy@isidoremiller·
Come experience the famous Hex Air hospitality at booth #209 @coalesceconf
Izzy tweet mediaIzzy tweet media
English
1
1
13
2K