John Schulman

178 posts

John Schulman banner
John Schulman

John Schulman

@johnschulman2

Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music

Katılım Mayıs 2021
1.8K Takip Edilen74K Takipçiler
Jifan Zhang
Jifan Zhang@jifan_zhang·
Just realized it’s already been a month since I joined @thinkymachines. Time flies fast here and I’ve been enjoying life, happier and more motivated than ever to work. Come join us!
Jifan Zhang tweet media
English
10
1
136
9.5K
John Schulman
John Schulman@johnschulman2·
@jankulveit good stuff -- any recs for the next things to read/watch on @akorinek's thinking, targeted at a non-economist audience?
English
3
1
104
11.5K
Jan Kulveit
Jan Kulveit@jankulveit·
1. Obviously Dario knows way more about the effects of AGI on the labour market than almost any economist, by the virtue of treating AGI seriously, and not "as if nothing ever happens" 2. Yes, listen to the actual expert: youtube.com/watch?v=Z8K-Np… 3. LeCun is not a serious voice.
YouTube video
YouTube
Yann LeCun@ylecun

Dario is wrong. He knows absolutely nothing about the effects of technological revolutions on the labor market. Don't listen to him, Sam, Yoshua, Geoff, or me on this topic. Listen to economists who have spent their career studying this, like @Ph_Aghion , @erikbryn , @DAcemogluMIT , @amcafee , @davidautor

English
41
17
309
98.8K
John Schulman
John Schulman@johnschulman2·
Luke and Rudolf's writing on keeping humans central in an AI-powered world sparked a lot of discussion at Thinking Machines. For me, it captured some things I'd been thinking about but hadn't put as clearly. The more I got to know them and learned about their work, the more I wanted to work together. Really glad they're joining us.
Workshop Labs@WorkshopLabs

Workshop Labs is joining @thinkymachines. We believe there's a path for AI to make humans matter more. We couldn’t be prouder to join Thinking Machines to see this work through. workshoplabs.ai/blog/wsl-joini…

English
5
21
461
58.8K
Arcee.ai
Arcee.ai@arcee_ai·
Today we're releasing Trinity-Large-Thinking. Available now on the Arcee API, with open weights on Hugging Face under Apache 2.0. We built it for developers and enterprises that want models they can inspect, post-train, host, distill, and own.
English
101
245
2.1K
698.4K
John Schulman
John Schulman@johnschulman2·
Great work by Chroma training a search agent with SoTA efficiency. Lots of cool details: a prune tool for editing context mid-search, a synthetic data pipeline with verification steps, and a curriculum that shifts from recall to precision. Trained with Tinker!
Chroma@trychroma

Introducing Chroma Context-1, a 20B parameter search agent. > pushes the pareto frontier of agentic search > order of magnitude faster > order of magnitude cheaper > Apache 2.0, open-source

English
16
33
467
69.3K
John Schulman
John Schulman@johnschulman2·
@stuhlmueller reach out if you want any help with this! would be interested to hear about your use case.
English
1
0
4
506
Andreas Stuhlmüller
Andreas Stuhlmüller@stuhlmueller·
What's the best (large) model + service/infra for RL training agentic models with custom constitutions these days?
English
1
1
8
1.1K
giovanni
giovanni@regulargio·
Joined @miramurati, @soumithchintala, @johnschulman2, @neal_wu at @thinkymachines, they promised me 45lb weights. No compute constraints, increased weights, and crushing PRs. Recruiting tells me u gotta apply: #join-us" target="_blank" rel="nofollow noopener">thinkingmachines.ai/#join-us
giovanni tweet media
Thinking Machines@thinkymachines

We are partnering with @nvidia to power our frontier model training and platforms delivering customizable AI. thinkingmachines.ai/news/nvidia-pa…

English
18
8
343
74.4K
JingyuanLiu
JingyuanLiu@JingyuanLiu123·
Some updates: I've always been bullish on TML, and I actually joined TML this Monday Looking back, I am feeling so lucky that I have the privilege to work closely with the best optimization experts on the Muon optimizer ( @Jianlin_S from Kimi and @clu_cheng from Meta). Now I am so excited to be able to work with @jxbz and build new cool things! (On the other hand, there have always been some bad rumors about Meta TBD's potential failure. That's not true! From my personal experiences, it really has the best talents in the field, and I really enjoyed learning from the lab. The avocado model will for sure be great!)
JingyuanLiu@JingyuanLiu123

hmm I sort of disagree and I am bullish for TML. I think they really really have the top talents that I admire in the field, e.g. Jeremy and Sam for optimization, Songlin for Attn, Lia for MoE, Andrew for FSDPv2, and a bunch more folks it's just natural that it takes a while to publish good models: - dpsk starts to publish papers in 2023, even piblished dspkv2 (which I think is already amazing) in mid 2024 and nobody cares, until dpskv3 and r1 - msh took 10+ month to deliver a first not bad long ctx model in 2023 and be silent for the whole 2024 year, and starts to catch up gradually in 2025 - qwen starts to be a much better model than llama until qwen2.5, mid or late 2024, while the lab has been there forever it takes time to get infra and data done, but as long as you have good folks, and principled ways of doing science and experiments, some time or later, scaling laws will pay back

English
41
8
274
54.2K
John Schulman
John Schulman@johnschulman2·
@jeremyphoward True, but let’s say there’s a system that’s supposed to have human oversight, but some operators start set up an auto approve system, similar to how Tesla drivers override the hands-on-the-wheel check. That’s the kind of thing you could detect with the right monitoring
English
2
0
9
3K
Jeremy Howard
Jeremy Howard@jeremyphoward·
@johnschulman2 If a contract says you have to support autonomous weapons, then it doesn't matter what "safety stack" you have - you gotta deliver on the contract or you'll be in a *lot* of trouble.
English
8
1
109
8.4K
John Schulman
John Schulman@johnschulman2·
There's some discussion about whether contract terms ("all lawful use" vs more specific terms) vs safety stack (monitoring systems) are more effective as safeguards against AI misuse. It'd be useful for someone to game out how they'd hold up against historical incidents of surveillance abuse like COINTELPRO, or what authoritarian governements do today.
English
21
20
373
48.2K
John Schulman
John Schulman@johnschulman2·
I suspect usage policies are generally pretty weak, and what matters more is transparency (so abuses are harder to hide) and criminal penalties (whether the officials violating the policies will actually go to jail)
English
1
3
102
10.5K
John Schulman retweetledi
Tinker
Tinker@tinkerapi·
We’ve loved watching the Tinker community grow, and we're excited to have a place to share product updates, helpful recipes, and spotlights on the amazing things Tinkerers are building. Get started with Tinker here: thinkingmachines.ai/tinker/
English
8
21
181
130.4K
Yifei Zhou
Yifei Zhou@YifeiZhou02·
Belated life update: I started my next chapter at Thinking Machines Lab this week, and it’s been an incredible experience — unmatched work culture and talent density. Extremely bullish on what the team is building 🚀
Yifei Zhou tweet media
English
50
11
760
81.7K
Belinda
Belinda@belindmo·
Did you know that Claude Code is so powerful now that it can fine-tune models for you? We made a Claude Code skill using @thinkymachine's Tinker to fine-tune models ->
English
42
114
1.7K
162.4K