Harry Marr

2.1K posts

Harry Marr banner
Harry Marr

Harry Marr

@harrymarr

Member of Technical Staff at Anthropic. Previously at @github, @dependabot, @monzo, @gocardless.

Brooklyn, NY Katılım Mayıs 2009
364 Takip Edilen1.5K Takipçiler
Harry Marr retweetledi
Claude
Claude@claudeai·
Introducing Claude Opus 4.6. Our smartest model got an upgrade. Opus 4.6 plans more carefully, sustains agentic tasks for longer, operates reliably in massive codebases, and catches its own mistakes. It’s also our first Opus-class model with 1M token context in beta.
English
1.7K
4.8K
39.4K
10.6M
Harry Marr retweetledi
Sam Bowman
Sam Bowman@sleepinyourhat·
Opus 4.5 is a very good model, in nearly every sense we know how to measure. I’m also confident that it’s the model that we understand best as of its launch day: The system card includes 150 pages of research results, 50 of them on alignment.
Claude@claudeai

Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.

English
10
28
460
44.7K
Harry Marr
Harry Marr@harrymarr·
It’s a good’un - really excited to see this ship. I’ve had lots of moments of joy watching it crush problems I thought it wouldn’t be able to handle. It’s not only smarter, but also more token-efficient *and* 1/3 the cost per token compared to the previous Opus.
Claude@claudeai

Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.

English
0
0
1
254
Harry Marr retweetledi
David Mytton
David Mytton@davidmytton·
Big day - announcing @arcjet's Series A + our new local AI security model 🚀 An opt-in AI layer that runs expert security analysis for every request, entirely locally. 🏡 Accurate detection is the hardest part of security. 🎯 Legacy network-edge tools see packets, not users or logic. Real context lives in your code - where better decisions can actually be made. 🤖 That’s why we built Arcjet’s first AI security model. 🛡️ It runs inference locally in milliseconds, right inside your request handlers. Adds an extra layer to your defenses so you can ship faster and safer. 🍰 Arcjet now protects 500+ production apps used by 1,000+ developers - stopping bots, scrapers, spam, and fake accounts. 🕷️ So we’ve raised an $8.3M Series A led by @pluralplatform, bringing total funding to $12M. Also participating: @a16z, @seedcamp, @feross, and @jeffiel 💸 I'm excited to work alongside a small but exceptional team building the security platform that ships with your code. 🚀
David Mytton tweet media
English
7
15
42
8.5K
Harry Marr retweetledi
Claude
Claude@claudeai·
Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.
Claude tweet media
English
1.1K
3.2K
20.1K
5M
Harry Marr retweetledi
David Mytton
David Mytton@davidmytton·
I’m excited to announce @arcjet has raised $3.6m seed funding to build the future of developer security, led by @zanelackey at @a16z! Also participating in the round are @seedcamp and a roster of great angels. Although deploying code has become simpler, production security remains too difficult. Developers need a set of tools that solve real problems in the way they’re used to - with code. Arcjet helps developers deal with: - Spam & fraudulent signups - Unwanted bots scraping content - Enforcing dynamic, per user rate limits. - Scanners and attacks like SQLi, XSS, etc Embedding security rules into the application also means they can be tested. If you’ve ever turned on a new security tool then you know the pain of suddenly breaking production because you couldn’t test it locally or on staging! For the last few years I’ve been playing with devtools all day every day for the @consoledotdev newsletter, so I’m excited to be working on bringing a focus on developer experience to security tooling! More details on the Arcjet blog (link below) @arupchak @charlesfitz @tweetsbycolin @duncanjennings @geoffbelknap @harrymarr @ianlivingstone @mxstbr @stopman @dessaigne @nitayj @synopsi @rishabhkaul @arcurn @swyx @t3dotgg @thomaspaulmann
David Mytton tweet media
English
9
14
71
18K
Harry Marr
Harry Marr@harrymarr·
@kaythaney yeah at first i just thought it was the folks above us using their vitamix, but then we got some proper creaking and shaking... how about you?
English
1
0
0
45
Harry Marr
Harry Marr@harrymarr·
I had no idea that earthquakes were part of the deal when I moved to brooklyn
English
1
0
7
1K
Harry Marr
Harry Marr@harrymarr·
@MTsireud I tried a third party extension a while back but it didn’t work. Maybe I should try again - I’ve been thinking “just one more month then _obviously_ this’ll be a native feature”, for about 16 months at this point!
English
0
0
1
85
Mark Tsirekas
Mark Tsirekas@MTsireud·
@harrymarr There’s been a few openAI tools based on the use case. Have you tried them? Still a good question why google hasnt
English
1
0
0
73
Harry Marr
Harry Marr@harrymarr·
A =gemini(prompt, range) function in Google Sheets would be so damn useful. I'd gladly pay for the tokens. It'd be trivial to build and could only help AI revenue. Seems like a no brainer. Curious why Google still hasn't shipped it
English
3
0
6
1.1K
Ben Gilbert
Ben Gilbert@gilbert·
When next week's @AcquiredFM drops, some percent of listeners will go "omg YES" and the others (majority?) will never have heard of the company at all.
English
42
14
200
35.5K
Pete Hamilton
Pete Hamilton@peterejhamilton·
We’ve just launched @incident_io On-call and I couldn’t be more excited! 🎉 Our goal was “so good, you’ll break things on purpose” and we’ve poured our hearts and souls into this one. I think it shows ❤️
English
49
49
315
150.3K
Harry Marr
Harry Marr@harrymarr·
@zeeg For OpenAI’s models, response_format: { type: “json_object” } should do the trick. For mistral etc, some providers (together, anyscale) have also implemented the same feature and even let you provide a schema: together.ai/blog/function-…
English
0
0
1
116
David Cramer
David Cramer@zeeg·
Whats the most reliably mechanism you've found for getting structured output from LLMs? ChatGPT's models were probably 90-95% reliable for JSON, but that error margin is even higher for some others.
English
13
0
8
5.4K
Harry Marr retweetledi
Erin Havens
Erin Havens@erinhavens·
Today, Dependabot will cut false positives and reduce alert fatigue substantially -- to the tune of 15% of all npm alerts. github.blog/2023-05-02-dep…
Erin Havens tweet media
English
2
11
29
7.5K
Harry Marr
Harry Marr@harrymarr·
But don't worry, if you do show it some love, it'll forgive you and come bounding back as eager as ever 🤖🤸
English
0
0
3
429