Josselin Perrus

2.8K posts

Josselin Perrus

Josselin Perrus

@nonils

Product Manager

Paris Katılım Eylül 2009
2.1K Takip Edilen531 Takipçiler
Josselin Perrus
Josselin Perrus@nonils·
@astralwave Can you break it down how you go from coarse ask to small parts? What’s the story behind your screenshot?
English
1
0
1
24
Andric
Andric@astralwave·
Codex sub-agents are such a treat to use I love how you can steer each one individually Makes it possible to start with a coarse ask, have the agent break up the work into small orthogonal parts, then deep dive into each one It’s also super cute that each sub-agent gets a name like “Erdos”, “Halley”, “Hubble”, etc: it’s a small detail but it makes it possible to refer to them easily!
Andric tweet media
English
1
0
3
729
Josselin Perrus
Josselin Perrus@nonils·
SAM-3 provides remarkably precise answers (0.97 IoU). But 1 out of 5 times it returns nothing. I suspect it to be a prompting issue. SAM-3 is currently a solid addition to coding agents. Will the general models catch up? Full write-up: meaningfool.net/articles/detec…
English
0
0
0
8
Josselin Perrus
Josselin Perrus@nonils·
DINO alone scored worse than raw LLMs, regularly choosing the wrong shape (triangle instead of rectangle), but with high precision The target was often in its top-5 candidates though. Delegating the ranking to Claude Code with a Skill improved the results substantially.
Josselin Perrus tweet media
English
1
0
0
37
Josselin Perrus
Josselin Perrus@nonils·
Agents/LLMs can't locate what they see on images. They can describe an image, its composition. But they can't "select" an element to move or resize programmatically: the bounding box they produce is off. Can tools fix that? I ran a small experiment.
Josselin Perrus tweet media
English
1
0
0
33
Josselin Perrus retweetledi
Hrishi
Hrishi@hrishioa·
New paper on skills. The conclusions hold up exactly 1-to-1 with our experience. Skills are better than docs, but only when made with care. Less is more. Models are really bad at making skills. 2 paragraphs of human-written condensed instructions (or best practices) are better than 20 pages of model-summarised info. My theory is they lack the theory-of-mind IQ to think through what they might have needed to know before they knew it.
Hrishi tweet media
English
1
8
47
2.6K
Josselin Perrus
Josselin Perrus@nonils·
@masondrxy Fixed on mobile. I mention it, but did not go much further as i did not dig into it yet.
English
0
0
1
14
Josselin Perrus
Josselin Perrus@nonils·
Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇
Josselin Perrus tweet media
English
4
3
51
12.9K
Mason Daugherty
Mason Daugherty@masondrxy·
@nonils can’t view anything on mobile :( sidebar covers everything
English
1
0
0
54
Josselin Perrus
Josselin Perrus@nonils·
@NathanFlurry I featured sandbox-agent in my "agent frameworks for the rest of us" report. It's by a non-developer (me), for non-developers. Hopefully I got things right. Would love to have your feedback x.com/twitter/status…
Josselin Perrus@nonils

Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇

English
1
0
1
98
Josselin Perrus
Josselin Perrus@nonils·
@modal @tryramp @modal @charles_irl I wrote a report about agent frameworks "for the rest of us" - the non-dev, as I am. I report about the Ramp / Modal architecture, among others. If you give it a look, let me know what you think x.com/nonils/status/…
Josselin Perrus@nonils

Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇

English
0
0
1
127
Modal
Modal@modal·
We sat down with @tryramp to talk about their internal coding agent, Ramp Inspect, that now writes over half of their merged PRs. - Runs on Modal Sandboxes. - Full dev environments spin up in seconds. - Hundreds of sessions in parallel.
English
8
8
88
31.1K
Josselin Perrus
Josselin Perrus@nonils·
@threepointone @threepointone I published a report on agent frameworks. There is some @cloudflare inside. But no Cloudflare Agent SDK as I'm not yet familiar with it Do you think it would fit in the model I define like the PydanticAI, Vercel AI SDK, Pi and Claude Agent? x.com/nonils/status/…
Josselin Perrus tweet media
Josselin Perrus@nonils

Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇

English
0
0
0
56
sunil pai
sunil pai@threepointone·
…-playground.threepointone.workers.dev (beta) a demo of every agents sdk feature. it's a lot. we're lacking features from other frameworks/infra, and we'll close that gap very quickly. but there are features in our agents sdk that will never be in other frameworks/infra.
sunil pai tweet media
English
19
14
151
44.2K
Jason Zhou
Jason Zhou@jasonzhou1993·
Gemini 3.1 pro is a GREAT animator LLM was struggling with spatial positioning when generating animation before, But look at those Gemini 3 pro VS Gemini 3.1 pro comparison Gemini 3.1 pro visibly reasons better You can try it free on @SuperDesignDev today ---- I might also share some prompt guide on getting most out of model for animations, comment below if you interested
English
10
28
294
18.9K
Josselin Perrus
Josselin Perrus@nonils·
@steveruizok an interaction we had on LinkedIn about your project to build an internal agent for @tldraw actually triggered my effort to better understand Agent Frameworks. And that's me sharing what I learnt. Thanks for the involuntary nudge :) x.com/nonils/status/…
Josselin Perrus@nonils

Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇

English
1
0
3
2.2K
Josselin Perrus
Josselin Perrus@nonils·
@badlogicgames @thdxr @samuelcolvin I was curious about how Pi, Opencode, PydanticAI and other framewoks were different. So I wrote about what I learnt. I'm not a dev though so hopefully there aren't too many inaccuracies. Feedback welcome :) x.com/nonils/status/…
Josselin Perrus@nonils

Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇

English
0
0
2
19
Josselin Perrus
Josselin Perrus@nonils·
@JacobColling your experimentations with Claude Agent SDK picked my curiosity. Being less technical, I was a bit confused about all those agent frameworks out there. I wrote about what I learnt diving into the topic. Would love to have your review x.com/nonils/status/…
Josselin Perrus@nonils

Agent frameworks are confusing. So I wrote "Agent Frameworks for the Rest of Us" as I dove into the topic. Turns out you can classify them in 3 categories: "Orchestration Frameworks", "Agent SDKs" and "Agent Servers" 👇

English
1
0
1
46
Josselin Perrus
Josselin Perrus@nonils·
If you are wondering what the difference is between LangGraph, PydanticAI, Claude Agent SDK, Vercel AI SDK, Opencode... So did I. The report is probably not 100% accurate, but I believe it can help. Feedback welcome. …ent-frameworks-report.meaningfool.net
Josselin Perrus tweet media
English
3
0
7
1K
Josselin Perrus
Josselin Perrus@nonils·
In this report, I cover: - How we got from prompting to agents - App-driven vs agent-driven control flow for orchestration - Why access to bash and filesystem is a game changer - Architecture implications of long-running agents - Analysis of 4 real projects
Josselin Perrus tweet media
English
1
0
1
759