John Schulman

178 posts

John Schulman

@johnschulman2

Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music

Katılım Mayıs 2021

1.8K Takip Edilen74K Takipçiler

John Schulman@johnschulman2·5d

@jifan_zhang @thinkymachines welcome!

English

1.1K

Jifan Zhang@jifan_zhang·6d

Just realized it’s already been a month since I joined @thinkymachines. Time flies fast here and I’ve been enjoying life, happier and more motivated than ever to work. Come join us!

English

136

9.5K

John Schulman@johnschulman2·20 Nis

@jankulveit good stuff -- any recs for the next things to read/watch on @akorinek's thinking, targeted at a non-economist audience?

English

104

11.5K

Jan Kulveit@jankulveit·19 Nis

1. Obviously Dario knows way more about the effects of AGI on the labour market than almost any economist, by the virtue of treating AGI seriously, and not "as if nothing ever happens" 2. Yes, listen to the actual expert: youtube.com/watch?v=Z8K-Np… 3. LeCun is not a serious voice.

YouTube

Yann LeCun@ylecun

Dario is wrong. He knows absolutely nothing about the effects of technological revolutions on the labor market. Don't listen to him, Sam, Yoshua, Geoff, or me on this topic. Listen to economists who have spent their career studying this, like @Ph_Aghion , @erikbryn , @DAcemogluMIT , @amcafee , @davidautor

English

309

98.8K

John Schulman@johnschulman2·13 Nis

Luke and Rudolf's writing on keeping humans central in an AI-powered world sparked a lot of discussion at Thinking Machines. For me, it captured some things I'd been thinking about but hadn't put as clearly. The more I got to know them and learned about their work, the more I wanted to work together. Really glad they're joining us.

Workshop Labs@WorkshopLabs

Workshop Labs is joining @thinkymachines. We believe there's a path for AI to make humans matter more. We couldn’t be prouder to join Thinking Machines to see this work through. workshoplabs.ai/blog/wsl-joini…

English

461

58.8K

John Schulman@johnschulman2·2 Nis

@arcee_ai congrats on the great release!

English

3.7K

Arcee.ai@arcee_ai·1 Nis

Today we're releasing Trinity-Large-Thinking. Available now on the Arcee API, with open weights on Hugging Face under Apache 2.0. We built it for developers and enterprises that want models they can inspect, post-train, host, distill, and own.

English

101

245

2.1K

698.4K

John Schulman@johnschulman2·26 Mar

Great work by Chroma training a search agent with SoTA efficiency. Lots of cool details: a prune tool for editing context mid-search, a synthetic data pipeline with verification steps, and a curriculum that shifts from recall to precision. Trained with Tinker!

Chroma@trychroma

Introducing Chroma Context-1, a 20B parameter search agent. > pushes the pareto frontier of agentic search > order of magnitude faster > order of magnitude cheaper > Apache 2.0, open-source

English

467

69.3K

John Schulman@johnschulman2·20 Mar

Models that are great at calibrated predictions will be transformative for decision making. Excited about Mantic's work and proud they're using Tinker. Their new blog post digs into their methodology and findings.

Toby Shevlane@tshevl

I always dreamed of AGI as a wise advisor for humanity. Although LLMs are great for coding & knowledge work, I wouldn’t trust them to give me advice on my career, business strategy, or policy preferences. How can we build AI systems optimized for wisdom? At Mantic we believe the unlock is prediction: predicting world events as accurately as possible, and hill-climbing this single metric. Today we share some recent progress on the Thinking Machines website, having found Tinker a great platform for our RL experiments. TL;DR: We RL-tune gpt-oss-120b to become a better forecaster than any other model. Having good scaffolding is a prerequisite. A fun result: our tuned model + Grok are decorrelated from the other best models, and so are the most indispensable when picking a team.

English

388

87.7K

John Schulman@johnschulman2·15 Mar

@stuhlmueller reach out if you want any help with this! would be interested to hear about your use case.

English

506

Andreas Stuhlmüller@stuhlmueller·7 Mar

Maybe Tinker + Qwen3.5-397B?

Filipino

312

Andreas Stuhlmüller@stuhlmueller·6 Mar

What's the best (large) model + service/infra for RL training agentic models with custom constitutions these days?

English

1.1K

John Schulman@johnschulman2·12 Mar

@neal_wu @thinkymachines @miramurati @soumithchintala @dchaplot @alexgartrell Welcome!

English

3.2K

John Schulman retweetledi

Neal Wu@neal_wu·12 Mar

I joined @thinkymachines to work with @johnschulman2, @miramurati, @soumithchintala, @dchaplot, @alexgartrell, and others on the future of collaborative AI! And I can confirm that we do have 45 lb weights. If you're interested in training weights and/or Tinkering with us, join us! #join-us" target="_blank" rel="nofollow noopener">thinkingmachines.ai/#join-us

English

1.3K

169.4K

John Schulman@johnschulman2·12 Mar

@regulargio @miramurati @soumithchintala @neal_wu @thinkymachines welcome!

English

2.1K

giovanni@regulargio·12 Mar

Joined @miramurati, @soumithchintala, @johnschulman2, @neal_wu at @thinkymachines, they promised me 45lb weights. No compute constraints, increased weights, and crushing PRs. Recruiting tells me u gotta apply: #join-us" target="_blank" rel="nofollow noopener">thinkingmachines.ai/#join-us

Thinking Machines@thinkymachines

We are partnering with @nvidia to power our frontier model training and platforms delivering customizable AI. thinkingmachines.ai/news/nvidia-pa…

English

343

74.4K

John Schulman@johnschulman2·6 Mar

@JingyuanLiu123 @Jianlin_S @clu_cheng Welcome!

English

1.2K

JingyuanLiu@JingyuanLiu123·6 Mar

Some updates: I've always been bullish on TML, and I actually joined TML this Monday Looking back, I am feeling so lucky that I have the privilege to work closely with the best optimization experts on the Muon optimizer ( @Jianlin_S from Kimi and @clu_cheng from Meta). Now I am so excited to be able to work with @jxbz and build new cool things! (On the other hand, there have always been some bad rumors about Meta TBD's potential failure. That's not true! From my personal experiences, it really has the best talents in the field, and I really enjoyed learning from the lab. The avocado model will for sure be great!)

JingyuanLiu@JingyuanLiu123

hmm I sort of disagree and I am bullish for TML. I think they really really have the top talents that I admire in the field, e.g. Jeremy and Sam for optimization, Songlin for Attn, Lia for MoE, Andrew for FSDPv2, and a bunch more folks it's just natural that it takes a while to publish good models: - dpsk starts to publish papers in 2023, even piblished dspkv2 (which I think is already amazing) in mid 2024 and nobody cares, until dpskv3 and r1 - msh took 10+ month to deliver a first not bad long ctx model in 2023 and be silent for the whole 2024 year, and starts to catch up gradually in 2025 - qwen starts to be a much better model than llama until qwen2.5, mid or late 2024, while the lab has been there forever it takes time to get infra and data done, but as long as you have good folks, and principled ways of doing science and experiments, some time or later, scaling laws will pay back

English

274

54.2K

John Schulman@johnschulman2·1 Mar

@jeremyphoward True, but let’s say there’s a system that’s supposed to have human oversight, but some operators start set up an auto approve system, similar to how Tesla drivers override the hands-on-the-wheel check. That’s the kind of thing you could detect with the right monitoring

English

Jeremy Howard@jeremyphoward·1 Mar

@johnschulman2 If a contract says you have to support autonomous weapons, then it doesn't matter what "safety stack" you have - you gotta deliver on the contract or you'll be in a *lot* of trouble.

English

109

8.4K

John Schulman@johnschulman2·1 Mar

There's some discussion about whether contract terms ("all lawful use" vs more specific terms) vs safety stack (monitoring systems) are more effective as safeguards against AI misuse. It'd be useful for someone to game out how they'd hold up against historical incidents of surveillance abuse like COINTELPRO, or what authoritarian governements do today.

English

373

48.2K

John Schulman@johnschulman2·1 Mar

I suspect usage policies are generally pretty weak, and what matters more is transparency (so abuses are harder to hide) and criminal penalties (whether the officials violating the policies will actually go to jail)

English

102

10.5K

John Schulman@johnschulman2·5 Şub

@YouJiacheng coming soon, first catching up on older stuff

English

5.3K

You Jiacheng@YouJiacheng·5 Şub

wait, TTT-discover is not mentioned?

Tinker@tinkerapi

Since Tinker launched, our community has used it to train state-of-the-art models, build infrastructure, and publish novel research. We will be highlighting this creative work in regular roundups, and hope to inspire your own Tinkering as well.

English

12.2K

John Schulman retweetledi

Tinker@tinkerapi·4 Şub

We’ve loved watching the Tinker community grow, and we're excited to have a place to share product updates, helpful recipes, and spotlights on the amazing things Tinkerers are building. Get started with Tinker here: thinkingmachines.ai/tinker/

English

181

130.4K

John Schulman@johnschulman2·18 Oca

@YifeiZhou02 welcome! excited to be working together

English

4.8K

Yifei Zhou@YifeiZhou02·17 Oca

Belated life update: I started my next chapter at Thinking Machines Lab this week, and it’s been an incredible experience — unmatched work culture and talent density. Extremely bullish on what the team is building 🚀

English

760

81.7K

John Schulman@johnschulman2·17 Oca

@josancamon19 @thinkymachines sorry for the delay -- I've been a bit busy, but one of us will look at your PRs shortly!

English

303

18.4K

Joan Cabezas@josancamon19·16 Oca

I should've known smth was going on at @thinkymachines cause @johnschulman2 stopped reviewing my PRs :(

English

105

23.4K

John Schulman@johnschulman2·9 Oca

@belindmo @thinkymachine very cool! am a big claude code user, so I'll be trying this out

English

4.4K

Belinda@belindmo·8 Oca

Did you know that Claude Code is so powerful now that it can fine-tune models for you? We made a Claude Code skill using @thinkymachine's Tinker to fine-tune models ->

English

114

1.7K

162.4K

John Schulman@johnschulman2·24 Ara

Humans are jagged, and organizations (from companies to civilizations) have evolved as harnesses to make best use of us despite our faults

xlr8harder@xlr8harder

Weirdly, I actually think Yann is making an important point here that is getting lost in semantics. Human intelligence also has jagged frontiers, we're just used to the shape.

English

660

123.9K

Keşfet

@jifan_zhang @thinkymachines @jankulveit @akorinek @arcee_ai @stuhlmueller @neal_wu @miramurati