Rafi Ayub

79 posts

Rafi Ayub

Rafi Ayub

@theayubinator

AI at @AnthropicAI. Former core torchtune dev and LLM fine-tuning at @MetaAI and @PyTorch.

Katılım Mart 2024
97 Takip Edilen136 Takipçiler
Rafi Ayub retweetledi
Ariel
Ariel@redtachyon·
btw torchtune is officially reinforcement learning - the GRPO implementation is officially merged, the entire codebase is really clean and modifiable, so go out there, reinforcement learn your LLMs, and any contributions welcome!
English
8
32
438
44.5K
Rafi Ayub retweetledi
Genius
Genius@Genius·
Genius tweet media
ZXX
142
3K
41.2K
1.4M
Rafi Ayub retweetledi
Philip Bontrager
Philip Bontrager@FilipoGiovanni·
If you have a lot of experience training and fine-tuning ML models and want to help bring that expertise to the community, we’re looking to hire a new member for the torchtune team! Full description in link below
English
1
7
6
1.9K
Rafi Ayub retweetledi
joe
joe@official_j3rck·
New release new release new release!!! 🎁 Kaggle collab notebooks 🎁 Early exit training recipe 🎁 QAT + LoRA github.com/pytorch/torcht…
English
1
1
5
180
Rafi Ayub retweetledi
Jim
Jim@jimchang·
starting a VC fund that invests in founders who sleep 8 hours a night, have significant others, consistently work out and bathe regularly
English
201
87
2.4K
136.1K
Rafi Ayub retweetledi
sunshine.base.eth
sunshine.base.eth@sunshinevndetta·
ai ain’t taking everyone’s jobs, let’s be real. what it’s actually doing is making the human touch worth its weight in gold. when a single prompt can spit out a track, live acts and real bands are gonna hit harder than ever, trust me. as prompt-to-video tech blows up, live theater is about to flex its status exclusive, top-tier, luxury vibes only. jazz clubs, underground poetry readings, and the rise of live painting as full blown performance art shows on a mainstream level. i can already picture schools bussing kids in just to catch a glimpse of real human creativity, raw and unfiltered instead of homeworks like watching a movie on netflix or a documentary human made art’s about to go premium, mega luxurious, almost untouchable. handwritten books, custom-made crafts, anything touched by human hands will be the new holy grail of pop culture. give it a decade even less, and that’ll be the game. just some thoughts running through my head lately. feels like today is the perfect time to create art, start today.
English
58
105
891
201.3K
Rafi Ayub
Rafi Ayub@theayubinator·
Knowledge distillation is how the Llama 3.2 1B and 3B models still pack a punch and why they’re our most used models. It will remain key in creating lightweight, performant, task-specific LLMs to automate production workflows. Read more for how this is done in torchtune 👇
PyTorch@PyTorch

We're happy to announce the addition of knowledge distillation to torchtune, a PyTorch library for easily authoring, fine-tuning & experimenting with #LLMs. Check it out: hubs.la/Q02YyqDG0 Distilling Llama3.1 8B into 1B in #torchtune

English
0
0
4
169
Rafi Ayub
Rafi Ayub@theayubinator·
Activation offloading is one of those few free lunches - 20% reduction in memory with only a 1% slowdown by leveraging high throughput memory bandwidth. Just enable with a single parameter in your torchtune config 💪
joe@official_j3rck

torchtune v0.4.0 is out: github.com/pytorch/torcht…! 🤏reduce memory by a further 20% using activation offloading 🧠try out the newest cutting-edge models from qwen2.5 💪multimodal training is BIGGER and BADDER w/ support for Llama3.2V 90B Happy tuning 🫡

English
0
0
5
186
Rafi Ayub
Rafi Ayub@theayubinator·
@marksaroufim @t0kenl1mit @kakemeister MHA and transformer components have been rewritten so many times for different models because of small but significant differences. Only recently has the field converged to more standard decoder components. Still, it’s been hard to maintain a unified MHA just within torchtune
English
1
0
4
161
Mark Saroufim
Mark Saroufim@marksaroufim·
If you could change one thing about PyTorch what would it be?
English
47
16
250
118.1K
Rafi Ayub
Rafi Ayub@theayubinator·
@ashwinb This must’ve been written by a Southern Hemispheran
English
0
0
0
18
joe
joe@official_j3rck·
bored bored bored
English
1
0
0
84
Rafi Ayub retweetledi
PyTorch
PyTorch@PyTorch·
PyTorch 2.5 is here 🔥 We are excited to announce the release of #PyTorch 2.5, featuring a new CuDNN backend for SDPA, regional compilation of torch.compile, & TorchInductor CPP backend performance speedup Read more in our blog: hubs.la/Q02TRs9p0
PyTorch tweet media
English
10
154
663
50.4K