John Solly

5.2K posts

John Solly

@_jsolly

Technologist, crossfitter, and satire lover

Philadelphia, PA 가입일 Aralık 2021

569 팔로잉721 팔로워

John Solly@_jsolly·18h

@petergostev @arena What, I can’t see an A/B of two FastAPI implementations? LAAAAAME

English

186

Peter Gostev@petergostev·1d

Note we've renamed Code Arena to Frontend Design: WebDev for these chats. I hope this is less confusing, but lmk if you have better suggestions

Arena.ai@arena

MiMo-V2.5 by @XiaomiMiMo is the #11 model (#3 among open) in Code Arena for frontend design. A new MIT-licensed open source model with 1M context, it also ranks strongly as an open model in Text and Vision Arena. Code Arena: frontend webdev design - MiMo-V2.5-Pro: #3 open (#11 overall) - MiMo-V2.5: #5 open (#18 overall) Text Arena: text prompts - MiMo-V2.5-Pro: #2 open (#22 overall) Vision Arena: visual input reasoning MiMo-V2.5: #7 open (#37 overall) Congrats to the @XiaomiMiMo team on this achievement!

English

172

19.6K

John Solly@_jsolly·18h

@randal_olson Polymodelrous

Deutsch

Randy Olson@randal_olson·1d

Someone discovered that Claude knows when you've been cheating on it with Codex. Lucas maintains an open relationship with his LLMs. Source: reddit.com/r/ClaudeAI/com…

English

1.2K

John Solly@_jsolly·18h

GIF

ZXX

John Solly@_jsolly·18h

GIF

Exa@ExaAILabs

We're excited to partner with Google to offer Grounding With Exa inside of Gemini models! Using Exa's agent-first search, Gemini models can now access billions of websites, technical docs, papers, people, companies, and more. 10^18🤝10^100

ZXX

John Solly@_jsolly·18h

The amount of sass and GenZ-iums I get from Opus 4.7 constantly catches me by surprise. Anyone else?

English

John Solly@_jsolly·18h

@JamesTimmins I feel like at this point agents should just abstract them away.

English

James Timmins@JamesTimmins·18h

I hate git worktrees so much

English

144

John Solly@_jsolly·1d

I think there’s a Jevon’s paradox with increasing model capability. As models get better, my prompts become more vague and encompass harder and harder tasks. So the ‘cheap, but good enough’ idea falls apart. You always want the best model. But for fixed, commodity prompts, it makes sense to use a lesser model.

English

John Solly@_jsolly·1d

@nghoihin @derekmeegan This is an interesting approach. If you have it connected to Claude via MCP, are you running docker in the cloud?

English

Jack H. Ng@nghoihin·1d

@_jsolly @derekmeegan feels relevant if you're exploring this — github.com/Beever-AI/beev…

English

derek@derekmeegan·1d

maybe human memory is slop too

derek@derekmeegan

even in claude code agent “memory” is slop. sad.

English

394

John Solly@_jsolly·1d

@beffjezos Finally, an opportunity to increase shareholder value at my kid’s piano recital.

English

Beff (e/acc)@beffjezos·2d

You may not like it, but this is what peak performance looks like. Vibe coding everywhere, straight to your eyeballs. Mad lads actually did it.

Even Realities@EvenRealities

You can now run your coding terminal from your glasses, wherever you are. Terminal Mode is live. #DevTools #BuildInPublic #coding

English

1.2K

234.6K

John Solly@_jsolly·1d

@ajambrosino Until morale improves

Italiano

John Solly@_jsolly·1d

@derekmeegan F it. Put the ontology in an AGENTS.md and then just have a bunch of markdown files it references.

English

derek@derekmeegan·1d

@_jsolly idk if it’s that there aren’t good tools for repackaging/distilling facts but if models are actually good at using them/referencing context intelligently

English

John Solly@_jsolly·1d

Okay, I’m sold.

Even Realities@EvenRealities

You can now run your coding terminal from your glasses, wherever you are. Terminal Mode is live. #DevTools #BuildInPublic #coding

English

John Solly@_jsolly·1d

@taigrr Not all AGENTS. md are the same…

English

Tai Groot 🐧@taigrr·1d

my agent is smarter than your agent

English

197

John Solly@_jsolly·1d

👀

Bill Staples@bstaples

Tired of the pain yet? Come to GitLab and take back control of your destiny. I’ll even throw in the first year free for anyone switching from GitHub who signs a new three year agreement. DM me

ART

John Solly@_jsolly·1d

Oh good point! Maybe MCPs help solve this. Take each important part of your stack and limit how and who can manipulate it. They’re also seems to be good progress with running LLMs in sandboxes and controlling how they can ‘escape’ But eventually, you get to a point where you’re exchanging flexibility for control.

English

Engel Nyst - open/acc@engelnyst·1d

@_jsolly Absolutely 1) I do wonder about 2), it doesn’t seem easy when the agent is powerful. So many ways to write in bash/python/anything a thing! LLMs, even Claude cloud agent, are like “hmm not allowed, let me try that way” - boom. Maybe with proxy/OS interception of the action?

English

Engel Nyst - open/acc@engelnyst·2d

One more time, for those in the back: Prompts are not safeguards Prompts are not safeguards Prompts are not safeguards Repeat: All LLMs are jailbreakable. Prompts are not safeguards. All LLMs are jailbreakable. Prompts are not safeguards. (yes normal prompts! No Pliny required)

JER@lifeof_jer

x.com/i/article/2048…

English

123

John Solly@_jsolly·1d

@newgeographer2 You have different saturations so hard to compare.

English

svg@newgeographer2·1d

Green or purple?

English

295

John Solly@_jsolly·1d

@kentcdodds

GIF

QME

108

Kent C. Dodds 🏹@kentcdodds·2d

I'm currently writing an article titled "The Last Software Engineer"

English

258

76.3K

John Solly@_jsolly·1d

Chat, GPT-5.5 is mid

Arena.ai@arena

GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.

English

탐색

@petergostev @arena @randal_olson @JamesTimmins @nghoihin @derekmeegan @beffjezos @ajambrosino