
Eric Hartford
12K posts

Eric Hartford
@QuixiAI
We make AI models Dolphin and Samantha BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4 https://t.co/3ri2GbXrQB https://t.co/zH0F3pTjjY @dphnAI



Something BIG is coming





Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.

After seeing these tweets, I decided to try it out on my own old Ubuntu computer with RTX 1070 GPU (the one that I just upgraded from 16.04 all the way to 24.04 the other day). Asked Codex on my Mac to connect to that machine and install and test qwen3 8b. So far really impressive - running 30 tok/s!












