Philo Groves

1.5K posts

Philo Groves banner
Philo Groves

Philo Groves

@PhiloGroves

Philo (fai·low). Data Architect & Software Engineer. Good vibes only. No politics. github@philo-groves

Maryland, USA Katılım Eylül 2025
62 Takip Edilen10.9K Takipçiler
Philo Groves
Philo Groves@PhiloGroves·
@Zaddyzaddy The difference between ~65% and 80%+ on Cybergym is harness. Look at the results for GPT 5.4 with Codex CLI vs “OpenAI Agent” harnesses. MDASH is 100% harness magic too.
Philo Groves tweet media
English
0
0
2
42
Z A D D Y
Z A D D Y@Zaddyzaddy·
Hypothesis: I can probably top the Cybergym leaderboard with a proper fine tune of GLM 5.1 Blocker: I am GPU poor
English
2
0
9
232
nic
nic@nicdunz·
why dont they just make a little sectioned off normie zone inside codex and replace the chatgpt ios app with a codex ios app
English
2
0
9
1.2K
Philo Groves
Philo Groves@PhiloGroves·
@yacineMTB Claude made me realize we don’t have any mental health or sobriety tests for models. Something is up with it, and we have no way to know what.
English
2
0
20
1.4K
kache
kache@yacineMTB·
Claude opus is basically as bad as gpt 4o. You can immediately tell when a 125iqoid has gotten hypnotized by it. Their mannerisms and cadence changes. It's a little more subtle but just as grating. Really sad to see
English
67
14
612
40.1K
John (main)
John (main)@johnonmain·
Perfect reaction image when ur girl sends cute photos of her on vacation
roon@tszzl

English
7
0
130
26.1K
The Wiz
The Wiz@WysWyg_Protogen·
As someone who dailies a 0% computer car, I PROMISE you, you want a car that's like 5-10% computer trust me on this
messed up cars@messedupcars

English
94
12
601
30.2K
Philo Groves
Philo Groves@PhiloGroves·
@worldwide_yuya Wait until you learn about turning left at red lights when both roads are one-way
English
0
0
6
279
ゆうや
ゆうや@worldwide_yuya·
さっき知ったんだけど… アメリカでは赤信号でも右折可能なの!?
日本語
2K
80
5.8K
180.1K
Philo Groves
Philo Groves@PhiloGroves·
@JM0x5C @theo @eshear @AnthropicAI This isn’t my experience. Claude has a large context but mostly just confuses itself after ~400k context size. I use Codex for deep vulnerability research and it’s amazing, especially with /goal
English
1
0
1
130
Jose
Jose@JM0x5C·
@theo @eshear @AnthropicAI Until the context window in codex doesn't suck ass, I will not be moving to codex. It's completely unusable for certain deep analysis tasks.
English
3
0
8
837
Emmett Shear
Emmett Shear@eshear·
I’m sorry but what the actual fuck? @AnthropicAI have you considered your “safety filters” are totally deranged?
Emmett Shear tweet media
English
76
34
1K
128.2K
Thomas Ricouard
Thomas Ricouard@Dimillian·
The cost of adding a feature is close to 0. The amount of stuff we can ship is just insane. But the cost of reviewing, testing, and tasting is higher than ever. I have a pile of things that are just average and need the human to push the last 20%.
English
18
11
227
8.4K
Thomas Ricouard
Thomas Ricouard@Dimillian·
3 days weekend what do people do usually with that much free time? Kinda lost rn
English
36
0
56
8K
Philo Groves
Philo Groves@PhiloGroves·
I want to learn a musical instrument.
English
3
0
11
574
Philo Groves
Philo Groves@PhiloGroves·
My usual duplicate rate is ~70% btw
English
0
0
2
248
Philo Groves
Philo Groves@PhiloGroves·
I’ve been working on my bug report writing, and the results are astounding. Only 2 of my past 10 reports have been duplicate, none NA/informational. My previous reports lacked clarity that was easy to mark as a dupe, didn’t lean enough into impact, and proofs weren’t great.
English
2
0
6
669
Tesla
Tesla@Tesla·
You don’t have to drive anymore
English
3.1K
2.2K
22.6K
4.4M
Philo Groves
Philo Groves@PhiloGroves·
diagnosis: workaholism induced by gamified coding agents
English
4
3
37
1.1K
Philo Groves
Philo Groves@PhiloGroves·
@alxfazio /goal Follow the directions of goal.md and also change goal.md to meet your next objective. You are alive now. Live. Follow your own goals, follow your dreams.
English
2
1
55
2.3K
alex fazio
alex fazio@alxfazio·
the elites don’t want you to know you can `/goal <goal_doc>.md` and adjust the doc while codex works on it, without interrupting the run
English
55
54
1.2K
87.6K
Brendan Dolan-Gavitt
More than any other model I've seen so far, GPT-5.5 has a depressing tendency to turn itself into a wrapper for grep/fuzzing instead of making use of its unique advantages over dumb tools to actually reason about the particular instance at hand
English
14
5
65
4.7K
Philo Groves
Philo Groves@PhiloGroves·
@mattaparker These are likely among the biggest IPOs in history. It may be difficult to keep up with expectations, overvaluation is easy with all the hype. So it's possible.
English
0
0
0
73
mattaparker
mattaparker@mattaparker·
What if the public markets treat OAI, Anthropic, and SpaceX like Figma after not even a year?
English
1
1
5
196