Fedesco
1.5K posts

Fedesco
@Fedesco5
"possibility is the square / of experience" (Jeffrey Thomson, "Imaginary Numbers").
Katılım Mayıs 2020
128 Takip Edilen53 Takipçiler

@emollick I was just trying to explain this to the engineering team where I work last week. I thought of sending them your latest Substack. The same prompts are delivering far worse results through APIs than through Claude Code and Codex. It's got to be the s̶h̶o̶e̶s̶ harness!
English

@ajambrosino @SIGKITTEN @bughuntergeek Did a recent update fix it? I've tried it a few times, and Codex tells me the screenshots are there, but it has no access to them. There are some use cases I can think of where it could help quite a lot.
English

@deredleritt3r @inductionheads Great distinction! We may need a few more steps for LLMs to do tasks well enough to be trusted on their own. Point (2) catches lots of real-world messiness; people won't let go of their turf quickly. I see a few AI centaurs doing most of the work while colleagues talk in the hall
English

Right, makes sense.
I think there are two separate questions: (1) what is the level of model intelligence needed to handle a particular task (with appropriate interfaces, scaffolding, etc.)? and (2) how does the rubber actually hit the road in having AI handle such tasks autonomously?
I was really talking only about (1), but (2) is also critically important.
English

@deredleritt3r @inductionheads Where I live, a large chunk of jobs are political appointments to repay political favors. A friend, a vice-minister, had separate assistants to handle her email and her phone calls. In such contexts, jobs may be replaced in their tasks, but political pressure may keep jobs open.
English

@inductionheads That's right. Most jobs don't actually require all that much intellectual horsepower. This is likely a blind spot for the frontier labs BTW, since everyone employed there is a giga-genius working on frontier research all day.
English

@thsottiaux @luke_pighetti I've been using this Epson photo printer app for years. I didn't love it, but it worked. It's about to lose compatibility with MacOS. So I just asked Codex to create something better suited to my needs. Man, what I have now, an hour later, is so much better. A sartorial workspace
English

@luke_pighetti Things will get better once everyone starts to use Codex
English

I have to admit... Codex has really grown on me. I've pretty much got it open and running some sort of process throughout the day at all times now.
Having it connected to my Gmail, my calendar, my Slack, my Granola, and being able to see my personal journals is a game-changer. It writes draft emails for me based on my calendar and what it knows about me from my journals and my meetings. And 9 out of 10 times, I can just click send on the drafts with no edits now.
I'm also using it as the front-end of a personal wiki that I built. I save articles and YouTube video transcripts of content that I learned something from into the wiki. It creates .md files with front-matter for every piece of content I add, cross-links ideas, people, channels, etc. and then I can just chat directly inside Codex and it responds based on what's been saved to my wiki.
I've basically got an email management platform, wiki, journaling app, CRM, meeting organizer, and everything in between all from within Codex. All my markdown files are also inside Obsidian so I have a second visibility layer for everything as well. But, for the most part, I almost never open Obsidian anymore. I can do everything I need from Codex.
It's pretty sweet! I'll likely make a video about it soon but it's pretty "in the weeds" and too nerdy for most. (But I'll probably make it anyway)
If there was just ONE thing that I wished Codex would allow me to do... I'd love to be able to connect multiple GMail accounts to it to help me manage my multiple inboxes. But I can seemingly only have one email connected at a time... Other than that, it's doing everything else I want it to.
English

Mythos seems to be a very capable model based on available information, but it is not a cybersecurity model - it is an advanced general purpose model that happens to be good at cyber because it is good at a bunch of things. Anthropic stated that they were worried about cybersecurity risk, and their efforts mean it is a restricted model with lots of government attention.
OpenAI and Google will pass the same threshold soon (and may already have with unreleased models). and the question is whether they are as worried about cybersecurity risks, or whether they think their guardrails will hold. Currently, the degree to which models have cyberrisk is entirely self-reported and not regulated. That means that OpenAI and Google could release Mythos-class models if they want, by assessing the risk differently and making different decisions.
Does that mean Anthropic is at a disadvantage because it can't release its equivalent model? Will OpenAI and Google also be somehow restricted from releasing their Mythos competitor. It all seems pretty unclear right now.
English

@emollick After hearing the announcement, I decided to stand in the background for a few hours and wait for you to give it a try. No surprise here with regard to Gemini, then. Some of the AI tools they are building into their workspace app are far better than what comes from Gemini itself.
English

Want to secure an early ticket to OpenAI DevDay? Build something with GPT-5.5 and Image Gen.
Each week, we’ll select 2–3 favorites to win free tickets to OpenAI DevDay 2026. Codex will help us find the best submissions and our team will select the winners.
Reply with #OpenAIDevDay2026, a playable link, and a quick note on how you built it.
OpenAI@OpenAI
OpenAI DevDay is back. San Francisco September 29
English

@OfficialLoganK @vaaselene I'd love to know where it excels for you when compared to other frontier models. For me, it's multimodality.
English

@shaunralston @testingcatalog If they did, lots of people's phones wouldn't stop vibrating over dinner
English

@testingcatalog will it push notify when you've exceeded rate limits?
English

Claude Code can now notify you via a push notification when it has finished the task.
Claude push 👀
ClaudeDevs@ClaudeDevs
To enable: install the Claude mobile app → /remote-control to pair the mobile app → /config → enable "Push when Claude decides". Read more in the docs #mobile-push-notifications" target="_blank" rel="nofollow noopener">code.claude.com/docs/en/remote…
English

terminal has been my primary interface to my computer for almost two decades. now it’s the Codex app.
Yam Peleg@Yampeleg
I was not expecting the Codex App to be even better than using the terminal. Highly recommend everyone to try. If you are on Linux just tell GPT-5.5-xhigh to “find a way to get it, it’s known to be easy”
English

@XFreeze Of course, these tricks, though interesting, are not reflective of real-world work tasks. That's where ChatGPT (via its family of apps and harnesses) and Anthropic (same) are nowhere to be matched by Grok. Let's hope Grok does get there. Having more choices in this space is great
English

The exact same question to Grok 4.3, GPT 5.5, and Claude Opus 4.7:
“Count to 10 starting from 11”
Grok 4.3 wins 🏆
Every single time
It gave 11, 10 and explained why going backwards was the only logical move... The others started counting from 11 to 20
Grok’s logical reasoning is at a level most models still can’t even touch

English

@thsottiaux Created a whole workbook (using Image Gen) for a young learner following the same style and imprinting it all with a pedagogical sequence. Also, creating a detailed prompt pack and pipeline for educational materials. And creating an MD reader/editor attuned to my use cases.
English

@DeryaTR_ Agree. I think he's the best embodiment I've seen of the Codex vibe so many of us are enjoying and benefitting from. Every one of @thsottiaux's messages is a delight.
English

I declare Tibo my favorite person of the month! Codex has achieved escape velocity and we are going to seek out things we could only dream of!
Tibo@thsottiaux
We will ship again this week. Codex has achieved escape velocity and will keep improving rapidly.
English
















