gerred

11.3K posts

gerred

@devgerred

senior principal mts. model whisperer. chasing the speed of light.

가입일 Mart 2020

1.2K 팔로잉2.5K 팔로워

gerred@devgerred·31m

@lumpenspace if you go by russia offering a sub to new zealand to pay off debts, you could just put the whole nuclear sub in a lake and a datacenter on top theguardian.com/world/2013/oct…

English

clumps@lumpenspace·36m

btw what's stopping the navy from putting whatever is inside nuclear subs in a lake or somewhere offshore and building a data centre on top?

English

139

gerred@devgerred·6h

*kantians reasoning nervously amongst themselves*

bubble boi@bubbleboi

We killed the Kantian leader of Iran.. next up is a Hegelian.

English

gerred@devgerred·6h

@fujikanaeda @HanchungLee @samuelcolvin

GIF

QME

Eric W. Tramel@fujikanaeda·7h

@HanchungLee @samuelcolvin you know someone will have written autodiff in the type system itself

English

Samuel Colvin@samuelcolvin·14h

PYTHON + RUST. Python inside the sandbox, Python (and some Rust) outside the sandbox. Typescript for frontend developers trying to stay relevant, like Mastra. By the way, the SF bubble is the one place where TS is popular for AI: `openai` package has 53m weekly downloads on PyPI, and 10m on NPM. AND THAT GAP IS WIDENING - used to be 4x, now 5x.

jason liu@jxnlco

Future of AI

English

147

23K

gerred@devgerred·7h

@thdxr i know I just overloaded every noun I used but I'm tracking what you wrote, compressed it down hard.

English

gerred@devgerred·7h

@thdxr tbqh maybe it's the editor that's skeumorphic. what you're wanting in "using it less" is real, but much like spreadsheets mirror their analogue equivalents, I think a lot about post-intelligence age UX. i don't think we're going back to the editor to solve the supervision problem

English

1.4K

dax@thdxr·7h

when we first started working on opencode days would pass by where we wouldn't use it and go back to our editor for everything had to actively try and switch our workflow now we're on the opposite extreme where we're all trying to use it less - crazy how fast that happened

English

336

19.6K

gerred@devgerred·7h

Oh, sweet summer child

Spiro Floropoulos@spirodonfl

Metaverse shutting down after spending 800000000000000000000000000000000000 freedom dollars on it proves LLMs can go away too. These things can be put back in the box for better things.

English

137

gerred@devgerred·8h

me n who

English

gerred@devgerred·8h

@jxnlco @jekbradbury You could totally do a pseudo prefix+(simulated, I suppose) radix caching from there to go further but that's really a profiling question from there. With hierarchical KV cache though, it's not out of the question.

English

gerred@devgerred·8h

I obviously don't have knowledge of what they're doing over there but it's a reasonable way to structure it, much like caching compilation graphs, pre compiling and shipping versioned KV caches is a pretty obvious optimization. Ant has a lot of caching options in their docs that would lead to this has at least entered their minds vs the less sophisticated pre-caching everyone else outside of Google does. I used to use GCP's provisioned caching as a very cost effective, high context limit docs oracle because I could provision and expire it at my leisure.

English

110

gerred@devgerred·12h

I'm betting the Anthropic ban of OpenCode is as technical and cost-saving as it is political. I've long argued there's a moat to be had by closing third party tools to subs. CC can rely on KV caching across every instance, and have KV caches on a per-organization basis for further customization for their largest customers. They can, across their entire fleet, pre-compute 1/3-1/2 (if not more) of every CC user's system prompt. By encouraging baking this into MDM and enterprise plans too, they can further negotiate that out in these large contracts. It also potentially lets them do some more clever things than just pure prefix caching and make specific tradeoffs you don't just get by allowing anybody to use those endpoints. At least that's how I'd do it. It surprised me it took THIS long.

English

4.8K

gerred@devgerred·9h

@SemiAnalysis_ @clanker_

QAM

SemiAnalysis@SemiAnalysis_·13h

Olympian Gold Medalist Alysa Liu, recently went viral for her Teen Vogue rant on OpenAI Codex. “I can see why Sam Altman open sourced Codex. Clearly the experience is significantly worse than Claude Code. I was unable to feel the AGI using Codex. As oppose to using Claude Code, I felt the enlightenment coming and support UBI ”

English

986

168.7K

gerred@devgerred·10h

@stochasticchasm I can't wait to rent your instance in cortical labs cloud.

English

stochasm@stochasticchasm·11h

they're capturing my cudagraphs tomorrow

English

gerred@devgerred·11h

@trq212 That's what I was about to ask after looking at the code, I'm REALLY WANTING claude/channels or another experimental extension here like the apps one. Would love to chat more about this, push notifications would be huge. Will there be a SEP for MCP?

English

2.9K

Thariq@trq212·11h

This is a research preview that we'll be expanding more on. Read more in our docs on Channels here: code.claude.com/docs/en/channe…

English

473

156.7K

gerred 리트윗함

Thariq@trq212·11h

We just released Claude Code channels, which allows you to control your Claude Code session through select MCPs, starting with Telegram and Discord. Use this to message Claude Code directly from your phone.

English

1.3K

1.8K

19.6K

4.1M

gerred@devgerred·11h

@deepfates @noveltokens What about the Memb--...oh.

English

🎭@deepfates·14h

@noveltokens Somebody stop me

English

🎭@deepfates·14h

yes that's the Technical Staff

Jacques@JacquesThibs

i like to imagine every ai lab passes around an arcane staff as their talking stick during standups while standing around a summoning circle

English

116

3.9K

gerred@devgerred·12h

@_BILLDING_ They can also better avoid shenanigans like quantized KV caching for their own products. So then there's even a quality edge CC can have. I'm like the only inferencing expert that actually puts a product hat on it feels like sometimes.

English

gerred@devgerred·12h

@_BILLDING_ Yep now imagine that cached at mass scale across every instance for a long time, instead of letting that go cold (like any API user's would). Hierarchical caching is expensive but the cost savings are very worth it.

English

269

gerred@devgerred·13h

@kalomaze "why does this model not generalize to my hello world program written in Piet?"

English

147

kalomaze@kalomaze·13h

the kind of person who asks "but does this transfer generalize to Brainfuck?" is simply not being a serious person tbqh

English

1.6K

kalomaze@kalomaze·13h

not shocking at all; the models don't want to write in byzantine esoteric languages instead of python or rust or whatever

Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English

213

13.1K

탐색

@lumpenspace @fujikanaeda @HanchungLee @samuelcolvin @thdxr @jxnlco @jekbradbury @SemiAnalysis_