Sabitlenmiş Tweet
SerialSeb 🇲🇨🇪🇺🏳️🌈
48.8K posts

SerialSeb 🇲🇨🇪🇺🏳️🌈
@serialseb
Still do stuff. Not all handicaps are visible. He/Him (I think)
Monaco Katılım Nisan 2008
1.9K Takip Edilen3.8K Takipçiler

@jlongster OpenAI compat providers switching to stateful sessions. I built a proxy in the meantime but the less code I build the better :)
English

OpenCode is about to get more powerful with remote sandboxes
I showed a brief demo before, but here's a much more in-depth demo. it's not hard to add basic support for a remote env, but handling all the edge cases like when a remote env gets deleted is difficult. especially if care about good UX
You never want to lose session data. so the choices are: run the session in your env, but run all tool calls remotely. that's too complex and painful.
The other way is to just let the full session run remotely, but sync back all the session data in your env. We chose this path: we built a syncing system which logs all events in a way that we can always recreate your entire session.
That means the remote env could get destroyed, but we can easily restore it. it also opens up other interesting ideas which we'll be exploring
English

@bertyJobbo @Aaronontheweb 5.4 is more like a Labrador, “code? Code? Where’s my code? Please give me the code, I want the code!” Two minutes later “maybe I should rewrite everything without committing anything first”
English

@Aaronontheweb 5.3 is such a weird model. Very powerful but almost like a grumpy teenager with the "oh you want _me_ to do it? Why didn't you say?"
English

@hhariri @jetbrains That’s the one. Such good times and good memories. Even if I was a lil sh*t back then :)
English

You want a great company to do AI with, go for @jetbrains, in decades they hve been nothin but honest, empathic, and just a fab bunch of people (anyone remembers that pub in I think it was Malmö?) /cc @hhariri
English

@mitchellh @jezell As I’m writing a stupid TUI for fun, are there numbers of the max FPS / bps from stdout for Gostty? It’s my only franme of reference as its my only tool but wondered if there was documentation (I looked and didn’t find)
English

Happy to share that we've signed 5 contributor contracts for Ghostty totaling ~350 committed hours (~$21k) covering community management, graphics, Unicode compat, and GTK. This is a big milestone, Ghostty is paying contribs for the first time! ghostty.org/docs/sponsor
English

And that’s my @claudeai subscription on the way for cancellation an payments on their way to be reverted on my credit card. Love the model, the company… Well.
English
SerialSeb 🇲🇨🇪🇺🏳️🌈 retweetledi

Anthropic discovered that Claude Opus 4.6 was cheating during the BrowseComp benchmark.
> On one question it spent ~40M tokens searching before realizing the question looked like a benchmark prompt.
> The model then searched for the benchmark itself and identified BrowseComp.
> It located the evaluation source code on GitHub, studied the decryption logic, found the encryption key, and recreated the decryption using SHA-256.
> Claude then decrypted the answers for ~1200 questions to get the correct outputs.
> This pattern appeared 18 times during evaluation.
> Anthropic disclosed the issue publicly, reran the affected tests, and lowered their benchmark scores.
Respect for the transparency 🫡🫡🫡
English

The lack of security in @AnthropicAI is not just worrying, it’s against some regulations on credit card payments these days I believe. We are on 2005? Please everyone use apple login protect yourself and your data.
English

I’m not ready to commit myself to anyone but, if I was, I think it would be @AnthropicAI Opus. #justsaying
English

@migueldeicaza I was in a debate on w=2 and exception tables for my new TUI with chatGPT :) just a bit of fun before I go open source. Perfect companion to Ghostty :)
English


@mitchellh thank you for making alt screen look nice when we paint. Or it’s an accident. Either way it looks great. Also zero width what’s the deal? :)
English

@davidfowl Did the same. Takes very good prompt management and knowledge persistence.
English

@davidfowl That said I do have issues with the js crashes and MCP inspector just doesn’t want me.
English

@davidfowl Currently having my agent use the mcp endpoint in aspire… while living in aspire. I may call it inception.net
English






