spion

29.2K posts

spion

@spion

Fullstack SWE. ex-Apple. Prefer insightful discussion to debate. Rust, TypeScript, Effect, SolidJS, localfirst, devops, keto, stats/science, audio/DSP

London, England Katılım Şubat 2008

1.2K Takip Edilen1.4K Takipçiler

spion@spion·15h

@thdxr did LLMs cause open source software developers to re-evaluate if they should continue to publish open source?

English

dax@thdxr·22h

whether this is true or not it's going to cause every company producing open source models to re-evaluate if they should continue to do so that is incredibly frustrating

sumit@sumitdotml

now a deleted tweet, probably nothing

English

1.7K

160.9K

spion@spion·1d

@onehappyfellow (Under the constraints of a non-GCed, high-performance language that is memory-safe, its actually not bad at all. The issue is that GCs exist and are good)

English

100

spion@spion·1d

@onehappyfellow Rust is not tasteless, its just that the constraints the language picked to start with were unfortunate.

English

323

One Happy Fellow@onehappyfellow·1d

rust is a good language, it's just tasteless go is a programming language

English

176

10.6K

spion retweetledi

David Cramer@zeeg·1d

1) not surprising whatsoever 2) this is exactly what I keep saying about models not being powerful enough today the fact that they can do so much with lossy compression is amazing, but there's no magic here imo (for transformers) context windows need to be 1-2 orders of magnitude larger for the future people keep saying is reality, and even then the compute is probably not worth it

Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English

103

9.8K

spion@spion·2d

@ChShersh many talk about rustability

English

Dmitrii Kovanikov@ChShersh·2d

Everyone talks about scalability. Nobody talks about cpluplusability.

English

166

6.8K

spion@spion·2d

@DanielW_Kiwi Also the code still sucks, quite often.

English

spion@spion·2d

@DanielW_Kiwi (things = code output. models we useful before that too for understanding and debugging and small tightly supervised changes)

English

Daniel 🦔@DanielW_Kiwi·2d

I'm really interested in knowing what causes this assessment vs the 1000x results others are claiming. It's very hard to understand the real differences here. Is it a difference in opinion on code quality. Is it a difference in driving the tools?

vaxry@vaxryy

All the AI talk, so I actually tried, but after 400 thousand tokens the result is pretty bad, I am writing this by hand. It will take days instead of 2 hours but at least it will work properly...

English

2.1K

spion@spion·2d

@headinthebox @boleroo wishlist: language where types are versioned entities in append-only store; compiler generates bidirectional transforms between versions (you can override default), database data tagged with schema version, automatically project through transforms on read

English

spion@spion·2d

@headinthebox @boleroo i.e. systems that will remove the reasons to fear commitment.

English

Erik Meijer@headinthebox·3d

What do the doubters see that we don't? Or, do we see something they don't. IMHO, the biggest mystery right now in our field.

David Cramer@zeeg

i can write 50k lines of code a day and it will absolutely not generate any tangible lasting value nor will these 16k, from Garry or anyone else (sorry, but its the truth)

English

31.7K

spion@spion·2d

@headinthebox @boleroo possible solution: user-agent.md and skills/complain/SKILL.md

English

spion@spion·2d

@headinthebox @boleroo Thats true, but you can't cheaply validate them unless you are the only user. And if there are other users and you come in with zero commitments, they may be less invested in committing their time too.

English

spion@spion·2d

@headinthebox @boleroo (the main exception is when building for yourself only)

English

spion@spion·2d

@headinthebox @boleroo do you see why zero-commitment exploring might not yield much insight?

English

spion@spion·3d

@headinthebox maybe you should then play for more than 10 minutes?

English

Erik Meijer@headinthebox·3d

I mean, I get the Luddites who see their craft evaporate in front of their eyes. But if you played with one of the coding agents for just 10 minutes, it must be crystal clear what the future is.

English

5.2K

spion@spion·3d

@zeeg brb writing complain/SKILL.md

English

spion@spion·3d

@zeeg Which does get me thinking. What if we define a user-agent? 😀

English

David Cramer@zeeg·3d

where's all those billion dollar businesses built from gas town and other slop farms? oh

English

359

43.8K

Keşfet

@thdxr @onehappyfellow @ChShersh @DanielW_Kiwi @headinthebox @boleroo @elonmusk @BarackObama