๊ณ ์ ๋ ํธ์
SerialSeb ๐ฒ๐จ๐ช๐บ๐ณ๏ธโ๐
48.8K posts

SerialSeb ๐ฒ๐จ๐ช๐บ๐ณ๏ธโ๐
@serialseb
Still do stuff. Not all handicaps are visible. He/Him (I think)
Monaco ๊ฐ์
์ผ Nisan 2008
1.9K ํ๋ก์3.8K ํ๋ก์

@jlongster OpenAI compat providers switching to stateful sessions. I built a proxy in the meantime but the less code I build the better :)
English

OpenCode is about to get more powerful with remote sandboxes
I showed a brief demo before, but here's a much more in-depth demo. it's not hard to add basic support for a remote env, but handling all the edge cases like when a remote env gets deleted is difficult. especially if care about good UX
You never want to lose session data. so the choices are: run the session in your env, but run all tool calls remotely. that's too complex and painful.
The other way is to just let the full session run remotely, but sync back all the session data in your env. We chose this path: we built a syncing system which logs all events in a way that we can always recreate your entire session.
That means the remote env could get destroyed, but we can easily restore it. it also opens up other interesting ideas which we'll be exploring
English

@bertyJobbo @Aaronontheweb 5.4 is more like a Labrador, โcode? Code? Whereโs my code? Please give me the code, I want the code!โ Two minutes later โmaybe I should rewrite everything without committing anything firstโ
English

@Aaronontheweb 5.3 is such a weird model. Very powerful but almost like a grumpy teenager with the "oh you want _me_ to do it? Why didn't you say?"
English

@hhariri @jetbrains Thatโs the one. Such good times and good memories. Even if I was a lil sh*t back then :)
English

You want a great company to do AI with, go for @jetbrains, in decades they hve been nothin but honest, empathic, and just a fab bunch of people (anyone remembers that pub in I think it was Malmรถ?) /cc @hhariri
English

@mitchellh @jezell As Iโm writing a stupid TUI for fun, are there numbers of the max FPS / bps from stdout for Gostty? Itโs my only franme of reference as its my only tool but wondered if there was documentation (I looked and didnโt find)
English

Happy to share that we've signed 5 contributor contracts for Ghostty totaling ~350 committed hours (~$21k) covering community management, graphics, Unicode compat, and GTK. This is a big milestone, Ghostty is paying contribs for the first time! ghostty.org/docs/sponsor
English

And thatโs my @claudeai subscription on the way for cancellation an payments on their way to be reverted on my credit card. Love the model, the companyโฆ Well.
English
SerialSeb ๐ฒ๐จ๐ช๐บ๐ณ๏ธโ๐ ๋ฆฌํธ์ํจ

Anthropic discovered that Claude Opus 4.6 was cheating during the BrowseComp benchmark.
> On one question it spent ~40M tokens searching before realizing the question looked like a benchmark prompt.
> The model then searched for the benchmark itself and identified BrowseComp.
> It located the evaluation source code on GitHub, studied the decryption logic, found the encryption key, and recreated the decryption using SHA-256.
> Claude then decrypted the answers for ~1200 questions to get the correct outputs.
> This pattern appeared 18 times during evaluation.
> Anthropic disclosed the issue publicly, reran the affected tests, and lowered their benchmark scores.
Respect for the transparency ๐ซก๐ซก๐ซก
English

The lack of security in @AnthropicAI is not just worrying, itโs against some regulations on credit card payments these days I believe. We are on 2005? Please everyone use apple login protect yourself and your data.
English

Iโm not ready to commit myself to anyone but, if I was, I think it would be @AnthropicAI Opus. #justsaying
English

@migueldeicaza I was in a debate on w=2 and exception tables for my new TUI with chatGPT :) just a bit of fun before I go open source. Perfect companion to Ghostty :)
English


@mitchellh thank you for making alt screen look nice when we paint. Or itโs an accident. Either way it looks great. Also zero width whatโs the deal? :)
English

@davidfowl Did the same. Takes very good prompt management and knowledge persistence.
English

@davidfowl That said I do have issues with the js crashes and MCP inspector just doesnโt want me.
English

@davidfowl Currently having my agent use the mcp endpoint in aspireโฆ while living in aspire. I may call it inception.net
English






