Jay-F. 😎

626 posts

Jay-F. 😎 banner
Jay-F. 😎

Jay-F. 😎

@only1jayf

building ai systems for boring problems. @Galee Labs (stealth). global

Paris, France 参加日 Ekim 2020
84 フォロー中32 フォロワー
Jay-F. 😎
Jay-F. 😎@only1jayf·
If Claude Mythos doesn’t meet the hype, Anthropic will lose market share.
English
0
0
0
10
Jay-F. 😎 がリツイート
Swati Gupta
Swati Gupta@hrswatigupta·
Anthropic pays $750,000+ a year for engineers who can build LLM architectures from scratch. Stanford taught the entire thing in 1 hour lecture & released it for free. Bookmark & watch this today before someone takes it down and read this article below
Swati Gupta@hrswatigupta

x.com/i/article/2060…

English
24
317
1.5K
222.5K
jimmah
jimmah@jamesdouma·
Been making stuff with grok build this last week. The world is feeling very post-scarcity right now.
English
93
68
1.4K
493.3K
Tom Goodwin
Tom Goodwin@tomfgoodwin·
errmmmmm, not to be miserable but has anyone noticed that agentic AI doesn't really work at all. Like the errors compound, fragile integrations ( any external change breaks it ) , observability is an issue, no verification, context loss, the whole thing seems VERY tricky Not sure this can ever be fixed.
English
132
11
308
34.1K
Jay-F. 😎
Jay-F. 😎@only1jayf·
The downside of frontier LLMs is that they’re engineered to be too polite.. They lack the human capacity for rudeness. I need a model with Russian energy. That raw truth. Have you seen an ai model insults? Even a five year old has better banters.
English
0
0
0
13
Jay-F. 😎
Jay-F. 😎@only1jayf·
Grok voice command recognition could do with a lot more work. Feels slow in the head.
English
0
0
0
5
Jay-F. 😎
Jay-F. 😎@only1jayf·
Grok is proud!!! Always feels it’s better than other models. smh🤦🏻‍♂️
English
0
0
0
1
Jay-F. 😎
Jay-F. 😎@only1jayf·
Comment if you want the one line command
English
0
0
0
0
Jay-F. 😎
Jay-F. 😎@only1jayf·
Codex is better when it remembers how you work, has a very detailed high quality structure for solving tasks and actually uses the skills you have. so i open sourced my skills pack. 120 skills. one command install. works on mac, linux, windows. not prompts. actual reusable workflows. You’ll see the difference immediately.
English
1
0
0
12
Jay-F. 😎
Jay-F. 😎@only1jayf·
Is codex your best bet or you enjoy playing all sides?
English
0
0
0
2
Jay-F. 😎
Jay-F. 😎@only1jayf·
I don’t know about 24+ hours but I can speak on running tasks for hours or through midnight to morning. Set /goal. Give me tasks that lead to goal - in other words a lot of context. Give it elevated permissions or full access. (This makes sure it doesn’t pause to start asking your for permissions). Note that when giving an agent full access, it’s best to always monitor it but using /goal in the same context essentially gives it focus and makes it more reliable.
English
0
0
0
741
albina
albina@enjojoyy·
People that run 24+ hours Codex tasks Can you share what you’re running exactly? Everyone is sharing the hours but not the task itself, I feel that most of them are just engagement baits
English
379
18
1.1K
197.9K
Jay-F. 😎 がリツイート
Kai
Kai@hqmank·
Yesterday I said GPT-5.5 edges out Opus 4.8 for coding. The thread blew up, most folks agreed, but some pushed back hard. Then DeepSWE (a hard long-horizon coding benchmark) dropped its numbers: → GPT-5.5: 70% pass@1, #1 → Opus 4.8: 58% And GPT-5.5 gets there with ~2x faster runs, ~½ the cost, and ~⅓ the output tokens. One benchmark isn't everything. But smarter, cheaper, and faster all at once? Hard to argue with.
Kai tweet mediaKai tweet media
English
18
11
265
23.9K