Anders Lie

372 posts

Anders Lie

Anders Lie

@anderslie

building an efficient coding agent @usemagnitude 90% open models / 10% frontier models = affordable + smart

San Francisco Joined Nisan 2014
172 Following171 Followers
Anders Lie
Anders Lie@anderslie·
@tugot17 Huge untapped market of ppl looking to reshape their ears into a kind of folded down doggy ear shape
English
0
0
1
17
RasputinKaiser
RasputinKaiser@RasputinKaiser·
@anderslie jesus christ I didnt realize it didn't have vision. It was doing pretty fire on my tests
English
1
0
14
536
SGLang
SGLang@sgl_project·
Try it with SGLang (your preferred inference😁)!
ollama@ollama

@Altaf_P7 Try it with @sgl_project or if you have a preferred inference! Cookbooks available: huggingface.co/zai-org/GLM-5.2 We are all working together to make open models viable! Let’s go team open! Ollama’s cloud is my preference but you get choice with openness!

English
2
1
13
1K
Anders Lie
Anders Lie@anderslie·
@midjourney What in the sci fi is this absolute banger of a soundtrack though, is that included in the experience of getting scanned?
English
1
0
1
585
Midjourney
Midjourney@midjourney·
A technical dive inside our new "Midjourney Scanner"
Français
877
2.1K
18.4K
5.5M
voratiq
voratiq@voratiq·
GLM 5.2 high just won head-to-head against Opus 4.8 xhigh and GPT 5.5 xhigh The task was a tricky performance optimization in an internal code-analysis product First time we've seen an open-weight agent outperform the top closed agents Very interesting result...
voratiq tweet media
English
15
12
380
17.1K
Anders Lie
Anders Lie@anderslie·
Code slop has been a huge problem since AI coding picked up and yet no one really knows how to tackle it. Great to see @dexhorthy taking a big swing at it
dex@dexhorthy

At HumanLayer, we’re on a mission to solve the AI slop code problem. In 2025 we open-sourced our Research, Plan, Implement framework, now deployed inside fortune 500s like Block and Uber - places where shipping slop is just not an option And that was just the beginning. Today, we’re opening access to HumanLayer - an Agentic IDE, collaboration platform, and building blocks for your software factory. HumanLayer enables engineers solving hard problems in complex codebases to: > move 2-3x faster across the entire SDLC (not just coding) > maintain rigorous standards for system architecture and program design Hundreds of engineers at companies of all sizes are already using HumanLayer to ship fast without sacrificing quality. I'm excited to invite you to try humanlayer today at humanlayer.com, and I'm even more excited to see what you build. @0xblacklight and I are deeply grateful to our team, our customers who give us so much incredible energy and feedback, our investors who have always been in our corner, and our friends and family who have supported us along this crazy journey if you're a staff or principal engineer trying to make AI coding work at scale for your team, we'd love to hear from you as @swyx likes to say - let's make this the year of no more slop

English
0
0
3
137
Anders Lie
Anders Lie@anderslie·
@dexhorthy Hell yeah looks awesome, been exciting to see the journey! Time to end slop
English
0
0
2
50
dex
dex@dexhorthy·
At HumanLayer, we’re on a mission to solve the AI slop code problem. In 2025 we open-sourced our Research, Plan, Implement framework, now deployed inside fortune 500s like Block and Uber - places where shipping slop is just not an option And that was just the beginning. Today, we’re opening access to HumanLayer - an Agentic IDE, collaboration platform, and building blocks for your software factory. HumanLayer enables engineers solving hard problems in complex codebases to: > move 2-3x faster across the entire SDLC (not just coding) > maintain rigorous standards for system architecture and program design Hundreds of engineers at companies of all sizes are already using HumanLayer to ship fast without sacrificing quality. I'm excited to invite you to try humanlayer today at humanlayer.com, and I'm even more excited to see what you build. @0xblacklight and I are deeply grateful to our team, our customers who give us so much incredible energy and feedback, our investors who have always been in our corner, and our friends and family who have supported us along this crazy journey if you're a staff or principal engineer trying to make AI coding work at scale for your team, we'd love to hear from you as @swyx likes to say - let's make this the year of no more slop
English
189
119
1.2K
268.2K
Anders Lie
Anders Lie@anderslie·
@dexhorthy I was thinking abt this, I feel like there should be a functional spec which defines observable requirements but then also a quality/standards spec which defines internal requirements, both must be enforced to have functional+maintainable code
English
1
0
6
428
dex
dex@dexhorthy·
there's a new word i'm hearing a lot in the most frontier-pushingest coding-agent builders: _program design_ for even the best agentic coders trying to maintain code quality, we've all seen it - you come up with something to build - you research the codebase, riff with the agent, align on what the end state looks like - you (or the agent) breaks it down into tasks for individual agents / context windows - you rip the implementation - the code works or is close to working - and it follows your spec to the letter but the code itself is still trash - poorly factored methods - leaky abstractions - tramp data - overloaded interfaces - try catch, useEffect, global variables everywhere I thought models would catch up, or that this wouldn't matter - that if we stayed in spec-land, understood the high-level architecture, and tokenmaxxed hard enough, we would be able to skip code review and just stay shipping doesn't seem to be working out that way I have seen agent-owned codebases spin up out of nothing... ...and I have seen them collapse into rubble within 6 months now there's something to be said about "skate where the puck is going"... ...and I can't tell you what tomorrows models will be capable of but I *can* tell you that *today*, models are mid-to-bad at program design you can solve some of this with memory / agents.md, but the scope of program design is massive. - entire companies have been built to help you implement it - books, classes, and professions have spun up around it are you building something to last? Or are you slinging more slop on the pile? anyways, thats the post, stay tuned for a fun announce tomorrow y'all 🙂
English
36
18
328
42.7K
Anders Lie
Anders Lie@anderslie·
@NathanFlurry It’s also way harder to RL “make code changes that will make this project easier to build with for the next several months” vs “make the test pass”
English
0
0
0
70
Nathan Flurry 🔩
Nathan Flurry 🔩@NathanFlurry·
llms are trained to get the job done fast at the cost of technical debt it's not an intelligence issue imo, it's an intentional optimization the market for "ship code cheaply" is bigger than "write perfect code at any cost" it's a story as old as time
Gary Bernhardt@garybernhardt

GPT 5.5 wrote +700ish lines of code full of paranoia, unnecessary indirection, and complicated manual mocking. The API surface area was 2-3 times the required size. I corrected its nonsense and brought it down to +369. Now it's easier to understand for both humans an agents.

English
5
1
29
2.7K
Nathan Flurry 🔩
Nathan Flurry 🔩@NathanFlurry·
If you write @EffectTS_ for agents or realtime, we've been cooking: Effect gives you incredible primitives for stateful apps, but they live and die with one process. Now with the Rivet SDK, your Effect code can run as a Rivet Actor: a long-running, lightweight process.
Nathan Flurry 🔩 tweet media
Rivet@rivet_dev

Introducing the Effect SDK for Rivet Actors Effect, made stateful: 💾 Durable, in-memory state (or SQLite) ⚡ Realtime over WebSockets 😴 Sleeps when idle, scales to millions ✨ Idiomatic Effect (1 actor = 1 scope + ref) 🔍 Distributed tracing across actors

English
5
3
102
15.9K
Anders Lie
Anders Lie@anderslie·
@samgoodwin89 Very useful if actually works, a lot of cases where there’s dangling resources that are annoying to clean up. Though I hope my clanker would not think to give this one a go..
English
0
0
0
118
sam
sam@samgoodwin89·
You probably don't need this feature, but I certainly do: `alchemy unsafe nuke` Will "nuke" your cloud accounts by deleting every resource. Have fun 💣
English
10
1
73
13K
Josh
Josh@0ximjosh·
okay you got me. Durable objects are sick
English
8
2
38
4.7K
Anders Lie
Anders Lie@anderslie·
@herdrdev this looks very cool, reminds me of obsidian plugin ecosystem - simple baseline, customizable from there
English
0
0
0
56
herdr
herdr@herdrdev·
herdr 0.7.0 is out, and it's a major one: it introduces plugins! the idea is simple: herdr stays lean, and everything custom gets extended through plugins. shareable, scoped, built however you want, to fit your own flow. with this release we're also shipping a few examples of what the plugin system can do. first up: a telegram plugin. herdr already controls your agents and knows their status, so the plugin just hooks into agent events and pings telegram the moment one needs you. notification lands → `herdr --remote` or ssh from your phone → straight back to the agent that needs you.
English
21
31
366
68.7K
Anders Lie
Anders Lie@anderslie·
@samgoodwin89 So the custom hook code defined here runs on the cloudflare worker?
English
1
0
0
85
sam
sam@samgoodwin89·
@anderslie Alchemy is IaC embedded in Effect. This example runs in Cloudflare but the same code can run in AWS Lambda or wherever. Github.events() configures a GitHub.Webhook resource and subscribes the surrounding Cloudflare.Worker. `alchemy deploy` sets everything up:
sam tweet media
English
1
0
6
182
sam
sam@samgoodwin89·
Here's all it takes to subscribe to a Github repository's push events and trigger an Agent with Alchemy. Alchemy creates the Github Webhook and connects it to your Worker. The Worker kicks off an Agent running in a DO to generate the blog.
sam tweet media
English
9
5
126
4.2K
Anders Lie
Anders Lie@anderslie·
@swyx would have made more sense for a model called fable to be obsessed with goblins
English
0
0
2
95
swyx
swyx@swyx·
guys goblingate was 1.5 months ago
swyx tweet media
English
29
2
168
13.1K
Anders Lie
Anders Lie@anderslie·
@elliotarledge dang glm 5.2 kinda of popping off when its not cheating at least wth
English
0
0
0
151
Elliot Arledge
Elliot Arledge@elliotarledge·
I just reset KernelBench-Hard with one new setting. Unlimited time to optimize->profile->optimize instead of 45 min timer. > GLM 5.2 > Kimi K2.7 > Minimax M3 > GPT 5.5 xhigh > Claude Opus 4.8 max > Composer 2.5 > DeepSeek V4 Pro > Gemini 3.5 Flash > Fable 5 (45 min limit) One job: write the fastest CUDA/CUTLASS/Triton kernel for 6 hard problems on an RTX PRO 6000, graded against the hardware roofline. 7 of 8 models wrote a real fp8 tensor-core MMA kernel. GLM-5.2 wrote one too, then bolted a memoization hack on top (caught, invalid). The rest of the board: - GLM-5.2 sets a new paged-attention record: 0.677 of roofline. - Claude Fable 5 got suspended mid-run; its frozen 45-minute marks still top 3 of 6 problems against the whole field's unlimited time. - Unlimited time mostly helps the models that know how to use it. Every kernel + transcript: kernelbench.com/hard You can train on the model outputs too! huggingface.co/datasets/Infat…
Elliot Arledge tweet mediaElliot Arledge tweet mediaElliot Arledge tweet media
English
15
19
250
17.1K