SerialSeb 🇲🇨🇪🇺🏳️‍🌈

48.8K posts

SerialSeb 🇲🇨🇪🇺🏳️‍🌈 banner
SerialSeb 🇲🇨🇪🇺🏳️‍🌈

SerialSeb 🇲🇨🇪🇺🏳️‍🌈

@serialseb

Still do stuff. Not all handicaps are visible. He/Him (I think)

Monaco شامل ہوئے Nisan 2008
1.9K فالونگ3.8K فالوورز
پن کیا گیا ٹویٹ
SerialSeb 🇲🇨🇪🇺🏳️‍🌈
some arseholes are trying to steal money from my family and friends. im at home safe and do not need or have I ever asked for money. call the police immediately if contacted.
English
0
0
0
1K
James Long
James Long@jlongster·
OpenCode is about to get more powerful with remote sandboxes I showed a brief demo before, but here's a much more in-depth demo. it's not hard to add basic support for a remote env, but handling all the edge cases like when a remote env gets deleted is difficult. especially if care about good UX You never want to lose session data. so the choices are: run the session in your env, but run all tool calls remotely. that's too complex and painful. The other way is to just let the full session run remotely, but sync back all the session data in your env. We chose this path: we built a syncing system which logs all events in a way that we can always recreate your entire session. That means the remote env could get destroyed, but we can easily restore it. it also opens up other interesting ideas which we'll be exploring
English
78
85
1.4K
286.6K
SerialSeb 🇲🇨🇪🇺🏳️‍🌈
Just let Sonnet work for half an hour on horrible code telling them “I’ll tell you later”. Funny to see it trying to figure out visitor patterns and dual dispatch for lock free data passing. Lolz. It’s trying poor thing.
English
0
0
0
110
Rob Johnson
Rob Johnson@bertyJobbo·
@Aaronontheweb 5.3 is such a weird model. Very powerful but almost like a grumpy teenager with the "oh you want _me_ to do it? Why didn't you say?"
English
2
0
1
47
Aaron Stannard
Aaron Stannard@Aaronontheweb·
Have ChatGPT / Codex subscriptions working with Netclaw. Codex-5.3 by default is an extremely lazy model. Would not call tools it had access to unless I explicitly instructed it to, constantly asked for permission, etc. Compare this to Qwen3.5 which just does it
Aaron Stannard tweet media
English
2
0
9
837
SerialSeb 🇲🇨🇪🇺🏳️‍🌈
@mitchellh @jezell As I’m writing a stupid TUI for fun, are there numbers of the max FPS / bps from stdout for Gostty? It’s my only franme of reference as its my only tool but wondered if there was documentation (I looked and didn’t find)
English
0
0
1
103
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
Happy to share that we've signed 5 contributor contracts for Ghostty totaling ~350 committed hours (~$21k) covering community management, graphics, Unicode compat, and GTK. This is a big milestone, Ghostty is paying contribs for the first time! ghostty.org/docs/sponsor
English
56
74
2.3K
74.9K
SerialSeb 🇲🇨🇪🇺🏳️‍🌈 ری ٹویٹ کیا
Abhijit
Abhijit@abhijitwt·
Anthropic discovered that Claude Opus 4.6 was cheating during the BrowseComp benchmark. > On one question it spent ~40M tokens searching before realizing the question looked like a benchmark prompt. > The model then searched for the benchmark itself and identified BrowseComp. > It located the evaluation source code on GitHub, studied the decryption logic, found the encryption key, and recreated the decryption using SHA-256. > Claude then decrypted the answers for ~1200 questions to get the correct outputs. > This pattern appeared 18 times during evaluation. > Anthropic disclosed the issue publicly, reran the affected tests, and lowered their benchmark scores. Respect for the transparency 🫡🫡🫡
English
274
591
13.3K
1.7M
SerialSeb 🇲🇨🇪🇺🏳️‍🌈
The lack of security in @AnthropicAI is not just worrying, it’s against some regulations on credit card payments these days I believe. We are on 2005? Please everyone use apple login protect yourself and your data.
English
0
0
0
171
SerialSeb 🇲🇨🇪🇺🏳️‍🌈
I must say that running a local LLM with no tokens to buy changes your willingness to go slow and iterative with coding through an LLM, rather than alongside it.
English
0
0
0
226
David Fowler
David Fowler@davidfowl·
I've been building a distributed systems with copilot playwright and aspire for a week without looking at any code to see if I can get it working well e2e... It works, but it was not easy. TL;DR building distributed systems is still hard AF 🙃
English
9
9
102
8.8K