Mudith Jayasekara

89 posts

Mudith Jayasekara

@mudithj

co-founder @parsedlabs sticking it to Big Token, half eng/cs phd @rhodes_trust @UniofOxford, scaling care through intelligence

SF Katılım Ocak 2018

297 Takip Edilen219 Takipçiler

Sabitlenmiş Tweet

Mudith Jayasekara@mudithj·29 Ağu

Rewatching the greats in this video never gets old. Grateful to now be making our own dent in defining the next paradigm of AI. Working with the most thoughtful and passionate people I know and backed by incredible investors (from LocalGlobe @svennj @asharoraa , HuggingFace @Thom_Wolf, DeepMind, NHS, and others). Thank you to our customers who care enough to glimpse into what the future of language models looks like. Let's build🫡

Charlie O'Neill@oneill_c

Today, we’re launching Parsed. We are incredibly lucky to live in a world where we stand on the shoulders of giants, first in science and now in AI. Our heroes have gotten us to this point, where we have brilliant general intelligence in our pocket. But this is a local minima. We now have an ecosystem of burgeoning tasks where each requires a different kind of intelligence, a different context, a whole host of implicit assumptions and latent knowledge and domain expertise that is very difficult to cram into a system prompt. The big labs want you renting their $50k/month amnesiac interns that forget everything between conversations. Generic behemoths that get quantised, versioned and deprecated behind the scenes, where the only element of control you have is your messy monolithic user prompt. We want people who need their own intelligence to be able to not only access it, but also control it. And whilst the big general models are unbelievably good chatbots and coding agents and purveyors of the world, specialisation of intelligence is required. Clinical scribes, marketing compliance agents, legal red-lining models, insurance policy recommenders, the list goes on. And so that’s what Parsed does: deploy your own frontier model that actually learns. We eval your specific task, build a custom evaluation harness, optimise a model just for you, and host it with continual learning. We bake all the context and knowledge of your task into the model itself, from your engineers to your domain experts to customer feedback, all in a tight SFT → RL loop, with useful interpretability made possible by the open-source ecosystem we build on top of. No more 2000-word prompts with seventeen "IMPORTANT: NEVER DO X" clauses. Your model gets better at YOUR job every single day; the amnesiac pseudo-gods have had their run. Your model, your data, your moat. Let's build 🫡

English

2.6K

Mudith Jayasekara@mudithj·1d

standing on the shoulder of giants

Tuhin Srivastava@tuhinone

x.com/i/article/2054…

English

372

Mudith Jayasekara retweetledi

Gabe Pereyra@gabepereyra·6d

Congrats to the @baseten team! Has been awesome working with @mudithj, @oneill_c, @mblau_ and team to post-train open weight models for long-horizon legal work. Early results on benchmarks like our Legal Agent Bench are super promising

Baseten@baseten

Open-source RL libraries break at frontier scale. We built Baseten Loops to fix this. Loops is a training SDK that takes you from your first RL run to production inference on a single platform: → Async RL so training and sampling overlap → 131K+ sequence length for agentic and long-horizon workflows → One command to promote your model to prod → Dedicated infra for predictable, repeatable performance We're excited to work with @harvey and @EvidenceOpen as early partners. Early access is open today: baseten.co/blog/introduci…

English

5.5K

Mudith Jayasekara retweetledi

Niko@nikogrupen·8 May

Infra for running / training long-horizon agents getting better by the day. Congrats to @oneill_c @mudithj & the @baseten team on the launch! Excited to be partnering with you all

Baseten@baseten

English

10.4K

Mudith Jayasekara@mudithj·8 May

Being able to take gradient steps on 1T+ sized models at long sequence lengths isn’t trivial and all the open source libraries start to break down when pushed. Baseten Loops does the hard work to simplify the gradient update to a couple of lines of code. We want ML teams to not worry about the infra + training library, and spend their time looking at their data and reward shaping. Loops gives everyone the ability to do frontier RL robustly and then deploy using Baseten’s inference stack to make the model go brrrr and all the 9s of uptime. At Baseten research, we’re just getting started. Online RL and ultra long context training coming soon...

Raymond Cano@vim_dzl

x.com/i/article/2052…

English

1.5K

Mudith Jayasekara@mudithj·6 May

Supporting the labs to democratise intelligence

Tuhin Srivastava@tuhinone

Model labs should spend their time pushing the frontier, not thinking about API keys, rate limits, metering, and billing. Today, we're launching Baseten Frontier Gateway: the fastest path from trained weights to a production, white-labeled API. baseten.co/blog/introduci…

English

270

Mudith Jayasekara@mudithj·6 May

Working on this with @harvey has really shown how thoughtful they are about embedding frontier legal reasoning into the models they serve. Lots of exciting work to come!

Gabe Pereyra@gabepereyra

x.com/i/article/2051…

English

521

Mudith Jayasekara@mudithj·22 Nis

@lachygroom @baseten we love to hear it

English

Lachy Groom@lachygroom·20 Nis

codex + computer use + @baseten is a magical experience for deploying OSS models in minutes, entirely hands off after initial prompt

English

325

34.9K

Mudith Jayasekara@mudithj·18 Nis

@thealexker very the difference between human vs LLM-generated

English

155

Alex Ker 🔭@thealexker·17 Nis

x.com/i/article/2041…

ZXX

737

106.9K

Mudith Jayasekara@mudithj·11 Nis

@clarejtbirch @TobinSouth I knew there was a reason ant hired him

English

clare ❤️‍🔥@clarejtbirch·11 Nis

“do you know I was Queensland state ballroom dancing champion in grade 7?” - @TobinSouth

English

1.1K

Mudith Jayasekara retweetledi

Charlie O'Neill@oneill_c·7 Nis

@_alyxya @baseten @mudithj and I are good routers (and we come with load balancer)

English

587

Mudith Jayasekara retweetledi

sshkhr@sshkhr16·5 Nis

If your startup doesn't have a Tri Dao on your inference team you're ngmi

Cody Steinmetz@0xCodyS

>Sees insane GLM-5/Kimi-K2.5 speeds >Looks inside >@tri_dao every time.

English

260

33.3K

Mudith Jayasekara@mudithj·3 Nis

@oneill_c @part_harry_ on ya bike if you don’t

English

182

Charlie O'Neill@oneill_c·3 Nis

If a researcher comes in and doesn't look at their data we take them out to pasture or get them to do qwen 3.5 forward passes by hand until they learn their lesson @mudithj @part_harry_

Leonard Tang@leonardtang_

x.com/i/article/2040…

English

12.2K

Mudith Jayasekara@mudithj·3 Nis

So much of the alpha in post-training comes from figuring out what the right learning signal is to gradient update on. Yes, algorithmic improvements are exciting for researchers, but LOOK AT YOUR DATA, LOOK AT YOUR DATA, LOOK AT YOUR DATA (@part_harry_ @oneill_c) is always what we end up falling back to as the most important lift.

Leonard Tang@leonardtang_

x.com/i/article/2040…

English

1.1K

Mudith Jayasekara@mudithj·1 Nis

Finding an intermediate memory layer between the full KV cache and lossy compression methods like natural language memory files is essential for real human work. The real human work that will be done by long horizon agentic workflows. This is some of the most exciting work we've done at Baseten yet, phase 2 and 3 coming soon.

Charlie O'Neill@oneill_c

x.com/i/article/2039…

English

798

Mudith Jayasekara retweetledi

Harry Partridge@part_harry_·30 Mar

Human-in-the-loop RL is necessarily done at group size 1; you cannot do a group of rollouts with only one human. i.e. there is no baseline for you to subtract for each input prompt. This is by far the most interesting and under-discussed part of this announcement. The same was true for their tab-completions model. From the wording in their posts, it sounds like they are using plain REINFORCE (no mention of value functions) with a large batch size + re-evaluating each checkpoint to guard against high variance. Cursor is implicitly revealing an important empirical result: with a large enough batch size, simple REINFORCE just works, no baseline needed. In other words, large scale continual learning is solved.

Cursor@cursor_ai

Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.

English

255

41K

Mudith Jayasekara@mudithj·29 Mar

@oneill_c time to migrate @part_harry_ midnight sober ‘drunk’ RL ramblings to twitter

will brown@willccbb

we’re in the process of migrating from Linear to Twitter

English

155

Mudith Jayasekara@mudithj·28 Mar

@oneill_c @Jozef_Nathaniel his favourite

English