Rafi Ayub

79 posts

Rafi Ayub

@theayubinator

AI at @AnthropicAI. Former core torchtune dev and LLM fine-tuning at @MetaAI and @PyTorch.

Katılım Mart 2024

97 Takip Edilen136 Takipçiler

Rafi Ayub@theayubinator·17 Nis

@official_j3rck "slightly less"

English

joe@official_j3rck·16 Nis

Fun times, fun people, slightly less existential dread about AI Come join

Mark Saroufim@marksaroufim

If you’re excited about optimizing code that runs equally well on a single or thousands of GPUs and if you have the ability to submit a single substantial PR to a major OSS library, we want you on the PyTorch team - especially if you’re early in your career.

English

210

Rafi Ayub retweetledi

David Hershey@DavidSHershey·25 Şub

So, I did a thing 🙂 This was really just a fun little side project - I wanted to spend some time working on agents, and Pokemon was the most fun way I could come up with. And then it kinda took off! 3.7 Sonnet is so fun to watch play!

Anthropic@AnthropicAI

A few researchers at Anthropic have, over the past year, had a part-time obsession with a peculiar problem. Can Claude play Pokémon? A thread:

English

542

60.8K

Rafi Ayub retweetledi

joe@official_j3rck·23 Şub

Thanks for the contribution!! Now you can do SFT, KD, QAT, RLHF, *and* RL completely within torchtune … let’s post-train some crazy models 🫡🫡🫡

Ariel@redtachyon

btw torchtune is officially reinforcement learning - the GRPO implementation is officially merged, the entire codebase is really clean and modifiable, so go out there, reinforcement learn your LLMs, and any contributions welcome!

English

836

Rafi Ayub retweetledi

Ariel@redtachyon·22 Şub

English

438

44.5K

Rafi Ayub retweetledi

Genius@Genius·3 Şub

ZXX

142

41.2K

1.4M

Rafi Ayub@theayubinator·18 Oca

@official_j3rck No no, queue up Severance Season 2

English

joe@official_j3rck·16 Oca

Twin Peaks Season 3 queued up now No one release anything cool in the meantime

Letterboxd@letterboxd

David Lynch (1946-2025) 🖤

English

Rafi Ayub@theayubinator·26 Ara

@official_j3rck We are your family

English

joe@official_j3rck·24 Ara

Starting a fine tuning run now so I have an excuse to leave “family holiday time” every 30 mins to check loss curves

clem 🤗@ClementDelangue

AI is getting better at math but we're just scratching the surface of what they will be capable of doing IMO (ex O3 only got 25% on FrontierMath). So we're super excited to release FineMath, the best open math dataset for everyone to use. Currently number one trending datasets on HF!

English

164

Rafi Ayub retweetledi

Philip Bontrager@FilipoGiovanni·23 Ara

If you have a lot of experience training and fine-tuning ML models and want to help bring that expertise to the community, we’re looking to hire a new member for the torchtune team! Full description in link below

English

1.9K

Rafi Ayub retweetledi

joe@official_j3rck·21 Ara

New release new release new release!!! 🎁 Kaggle collab notebooks 🎁 Early exit training recipe 🎁 QAT + LoRA github.com/pytorch/torcht…

English

180

Rafi Ayub retweetledi

Jim@jimchang·24 Kas

starting a VC fund that invests in founders who sleep 8 hours a night, have significant others, consistently work out and bathe regularly

English

201

2.4K

136.1K

Rafi Ayub retweetledi

sunshine.base.eth@sunshinevndetta·18 Kas

ai ain’t taking everyone’s jobs, let’s be real. what it’s actually doing is making the human touch worth its weight in gold. when a single prompt can spit out a track, live acts and real bands are gonna hit harder than ever, trust me. as prompt-to-video tech blows up, live theater is about to flex its status exclusive, top-tier, luxury vibes only. jazz clubs, underground poetry readings, and the rise of live painting as full blown performance art shows on a mainstream level. i can already picture schools bussing kids in just to catch a glimpse of real human creativity, raw and unfiltered instead of homeworks like watching a movie on netflix or a documentary human made art’s about to go premium, mega luxurious, almost untouchable. handwritten books, custom-made crafts, anything touched by human hands will be the new holy grail of pop culture. give it a decade even less, and that’ll be the game. just some thoughts running through my head lately. feels like today is the perfect time to create art, start today.

English

105

891

201.3K

Rafi Ayub@theayubinator·18 Kas

Knowledge distillation is how the Llama 3.2 1B and 3B models still pack a punch and why they’re our most used models. It will remain key in creating lightweight, performant, task-specific LLMs to automate production workflows. Read more for how this is done in torchtune 👇

PyTorch@PyTorch

We're happy to announce the addition of knowledge distillation to torchtune, a PyTorch library for easily authoring, fine-tuning & experimenting with #LLMs. Check it out: hubs.la/Q02YyqDG0 Distilling Llama3.1 8B into 1B in #torchtune

English

169

Rafi Ayub@theayubinator·15 Kas

Activation offloading is one of those few free lunches - 20% reduction in memory with only a 1% slowdown by leveraging high throughput memory bandwidth. Just enable with a single parameter in your torchtune config 💪

joe@official_j3rck

torchtune v0.4.0 is out: github.com/pytorch/torcht…! 🤏reduce memory by a further 20% using activation offloading 🧠try out the newest cutting-edge models from qwen2.5 💪multimodal training is BIGGER and BADDER w/ support for Llama3.2V 90B Happy tuning 🫡

English

186

Rafi Ayub@theayubinator·15 Kas

@marksaroufim @t0kenl1mit @kakemeister MHA and transformer components have been rewritten so many times for different models because of small but significant differences. Only recently has the field converged to more standard decoder components. Still, it’s been hard to maintain a unified MHA just within torchtune

English

161

Mark Saroufim@marksaroufim·15 Kas

@t0kenl1mit @kakemeister pytorch.org/docs/stable/ge… should be deprecated soon - we'll have an updated doc on this soon. (the lack of a notice proves your point)

English

204

Mark Saroufim@marksaroufim·15 Kas

If you could change one thing about PyTorch what would it be?

English

250

118.1K

Rafi Ayub retweetledi

joe@official_j3rck·14 Kas

Super exciting collab to make it easier to fine tune w your favorite Kaggle models 🙏

meg.ai 🇨🇦@MeganRisdal

Coming soon! kagglehub 💙 torchtune integration github.com/pytorch/torcht…

English

740

Rafi Ayub@theayubinator·25 Eki

Wake up and do not go directly to work. Make yourself a coffee. Go for a walk. Seize a little time for yourself before the day seizes you.

NIK@ns123abc

Good morning. Gentle reminder from Karpathy

English

148

Rafi Ayub@theayubinator·20 Eki

@ashwinb This must’ve been written by a Southern Hemispheran

English

Ashwin Bharambe@ashwinb·19 Eki

Some intense smelly BS.

nature@Nature

Stop using ‘summer’, ‘winter’ and the rest when inviting researchers to events — it’s a small step, but it’s necessary and inclusive go.nature.com/4dUMxT0

English

141

Rafi Ayub@theayubinator·17 Eki

@official_j3rck git back to git

English

joe@official_j3rck·17 Eki

bored bored bored

English

Rafi Ayub retweetledi

PyTorch@PyTorch·17 Eki

PyTorch 2.5 is here 🔥 We are excited to announce the release of #PyTorch 2.5, featuring a new CuDNN backend for SDPA, regional compilation of torch.compile, & TorchInductor CPP backend performance speedup Read more in our blog: hubs.la/Q02TRs9p0

English

154

663

50.4K

Rafi Ayub@theayubinator·13 Eki

One interesting idea in Anthropic’s essay was that AI could accelerate progress in medicine not by analyzing data but by playing PI, designing and conducting experiments (with robots) based on literature it’s trained on Hard to realize that with current LLMs, which cannot reason

Dario Amodei@DarioAmodei

Machines of Loving Grace: my essay on how AI could transform the world for the better darioamodei.com/machines-of-lo…

English

159

Keşfet

@official_j3rck @marksaroufim @t0kenl1mit @kakemeister @elonmusk @BarackObama @taylorswift13 @cristiano