Geoffrey Angus

151 posts

Geoffrey Angus

@GeoffreyAngus

Building stuff. Formerly @Google, @Stanford.

San Francisco, CA 가입일 Kasım 2015

354 팔로잉199 팔로워

Geoffrey Angus 리트윗함

Anthropic@AnthropicAI·28 Şub

A statement on the comments from Secretary of War Pete Hegseth. anthropic.com/news/statement…

English

2.9K

6.7K

42.8K

17.6M

Geoffrey Angus 리트윗함

Zvi Mowshowitz@TheZvi·24 Kas

They're burying a lot here. There's a 66% price cut from Opus 4.1 to $5/$25, it uses fewer tokens to solve problems, upgrades to Claude Code in the app, no more length limits on conversations, no more Opus-specific plan caps...

Claude@claudeai

Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.

English

1.2K

224.9K

Geoffrey Angus 리트윗함

Claude@claudeai·24 Kas

English

1.1K

2.5K

19.3K

7.8M

Geoffrey Angus@GeoffreyAngus·18 Eki

@piercefreeman I agree and thought this was compelling specifically for logs alexzhang13.github.io/blog/2025/rlm/ Treat walls of logs like a human would: poke at it from a distance

English

Pierce Freeman@piercefreeman·18 Eki

two million tokens is such a crazy large amount of context (it's basically all of Shakespeare's known works) until you start trying to have it parse logs then you might as well be holding a magnifying glass up to an encyclopedia

English

327

Geoffrey Angus 리트윗함

sam mcallister@sammcallister·3 Eki

GOOD MORNING NEW YORK CITY COME DO YOUR BEST THINKING AT OUR THINKING SPACE IN THE WEST VILLAGE SAY NO TO SLOP

English

285

182

5.9K

2.8M

Geoffrey Angus 리트윗함

anton 🇺🇸@atroyn·30 Eyl

it’s difficult not to make the aesthetic comparison between the claude “keep thinking” ads and the sora 2 launch

Claude@claudeai

Keep thinking.

English

135

4.6K

356.7K

Geoffrey Angus 리트윗함

Sabri Eyuboglu@EyubogluSabri·29 Eyl

@StanfordHAI just ran this story on self-study and cartridges -- it's a really nice overview for those curious about our work

English

10.2K

Geoffrey Angus 리트윗함

Ethan Mollick@emollick·29 Eyl

I had some early access to Sonnet 4.5. It is a really good model. I saw especially big jumps in doing finance and statistics, which tend to get overlooked in the focus on coding.

English

910

61.9K

Geoffrey Angus 리트윗함

Sholto Douglas@_sholtodouglas·25 Eyl

Incredible work - this should immediately become one of the most important metrics for policy makers to track. We’re probably only a few months from crossing the parity line. Huge props to OAI for both doing the hard work of pulling this together and including our scores. Nice to see Opus on top :)

Tejal Patwardhan@tejalpatwardhan

Understanding the capabilities of AI models is important to me. To forecast how AI models might affect labor, we need methods to measure their real-world work abilities. That’s why we created GDPval.

English

903

162.4K

Geoffrey Angus 리트윗함

Lisan al Gaib@scaling01·22 Eyl

Tri Dao says Claude Code makes him 1.5x more productive and that it's quite helpful at writing Triton kernels

English

459

272K

Geoffrey Angus 리트윗함

M1@M1Astra·18 Eyl

the first good ai ad campaign

Claude@claudeai

Keep thinking.

English

354

9.6K

459K

Geoffrey Angus 리트윗함

Claude@claudeai·18 Eyl

Keep thinking.

English

870

26.6K

5.9M

Geoffrey Angus 리트윗함

Thariq@trq212·12 Eyl

The Claude Code SDK now supports custom tools and hooks directly in code. Additionally, we’ve refreshed all our docs with complete references and 10 new guides on how to utilize the SDK.

English

108

1.4K

639.3K

Geoffrey Angus@GeoffreyAngus·24 Ağu

@ShreyaR omg rip

English

shreya rajpal@ShreyaR·24 Ağu

TIL daily driver is dead

Juliana@juliazniv

new cafe in dogpatch just dropped ☕️🌿

English

3.7K

Geoffrey Angus@GeoffreyAngus·14 Ağu

@ShreyaR @snowglobe_so So cool!!!!

English

Geoffrey Angus 리트윗함

shreya rajpal@ShreyaR·14 Ağu

Introducing ❄️ @snowglobe_so, the simulation engine for AI chatbots. Magically simulate the behavior of your users to test and improve your chatbots. Find failures before your users do.

English

116

568.1K

Geoffrey Angus 리트윗함

will brown@willccbb·17 Tem

cant stop thinking about this one insanely elegant, seems insanely powerful

English

841

102.1K

Geoffrey Angus 리트윗함

Together AI@togethercompute·2 Tem

Announcing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. Built in collaboration with the @Agentica_ team. 💪 DeepSWE is trained with rLLM, Agentica’s modular RL post-training framework for agents. rLLM makes it easy to build, train, and deploy RL-tuned agents on real-world workloads — from software engineering to web navigation and beyond. 🤗 As always, we’re open-sourcing everything: not just the model, but the training code (rLLM), dataset (R2EGym), and training recipe for full reproducibility. 🔥 Train DeepSWE yourself. Extend it. Build your own local agents. No secrets, no barriers. DeepSWE and rLLM mark our major shift: from training language reasoners to building language agents that can truly learn from experience. We believe the future of AI lies in experience-driven learning — and we’re here to democratize it. Welcome to the era of experience. 🌍

English

497

270.1K

Geoffrey Angus 리트윗함

Pierce Freeman@piercefreeman·24 Haz

Text diffusion models might be the most unintuitive architecture around Like: let's start randomly filling in words in a paragraph and iterate enough times to get something sensible But now that google's gemini diffusion is near sota, I think we need to take them seriously

English

700

Geoffrey Angus 리트윗함

Dylan Patel@dylan522p·23 Haz

The Nvidia Tensor Core is the most important evolution of computer architecture in the last decade We explain why / how it's evolved Shout out to collaborators @bfspector @tri_dao @colfaxintl @charles_irl @ia_buck Neil Movva Jonah Alben esp @simonguozirui for the cutest cover pic

SemiAnalysis@SemiAnalysis_

NVIDIA Tensor Core Evolution From Volta To Blackwell Amdahl’s Law, Strong Scaling Asynchronous Execution Blackwell, Hopper, Ampere, Turing, Volta semianalysis.com/2025/06/23/nvi…

English

319

50K

탐색

@piercefreeman @StanfordHAI @ShreyaR @snowglobe_so @Agentica_ @elonmusk @BarackObama @taylorswift13