
Phil
2.2K posts




cool


If your kid’s lemonade stand processes 0.5–1% of US GDP, then yes, that’s a fair analogy for @tryramp. Ramp’s data is useful for the same reason it gets cited at all: it is quite consistent with the revenue figures OpenAI and Anthropic release. If it weren’t, no one would care.

Introducing Desloppify v.0.8. Thanks to many workflow improvements + new agent planning tools, it can now run for days on end - autonomously finding, understanding, & fixing large and small code quality problems. There's no reason your slop code can't be beautiful!


Ok svg benchmark has been saturated. It does it in real time so you can watch. These guys released the StarVector paper last year.



This is crazy


Qwen 3.5 goes bankrupt on Vending-Bench 2



Gemini 3 Deep Think scores 84.6% on ARC-AGI-2





















