J Feldman

193 posts

J Feldman

J Feldman

@Jdfeldmo

Katılım Mayıs 2013
114 Takip Edilen35 Takipçiler
J Feldman retweetledi
Predibase by Rubrik
Predibase by Rubrik@predibase·
Today we're thrilled to announce the first end-to-end platform for Reinforcement Fine-Tuning. With just a dozen labeled data points, you can outperform #OpenAI o1 and #DeepSeekR1 on complex tasks. Built on the #GRPO methodology that DeepSeek-R1 popularized, our platform delivers exceptional results. In our real-world PyTorch to Triton transpilation case study, we achieved 3x higher accuracy than OpenAI o1 and DeepSeek-R1 when writing GPU code. Check out the thread below to learn how you can adapt an #opensource #LLM to your use cases with unmatched efficiency. #rft
English
20
69
500
1M
J Feldman retweetledi
Rohan Paul
Rohan Paul@rohanpaul_ai·
Want AI that speaks your language? Fine-tuning is the spark you need. Essentially, you’re tailoring an off-the-shelf model to your precise goals. The catch? Historically, it took an avalanche of labeled data—thousands of samples—to train your ‘old dog’ to perform a brand-new trick. I was getting tired of labeling thousands of samples and the cost. And then I looked into Reinforcement Fine-Tuning (RFT) from @predibase . 🧵1/n With RFT, a lightweight opensource LLM can rapidly evolve into an exceptional problem-solving machine. 📚 Reinforcement Method Predibase's Reinforcement Fine-Tuning (RFT) addresses the constraints of classic supervised approaches. It systematically applies a reward function to guide model updates. - Advanced policy gradient methods that shorten iteration cycles. - This setup surpasses standard fine-tuning by providing precise feedback and higher accuracy with minimal labeled data. ⚙️ Minimal Labeled Data - RFT excels with fewer than 100 labeled examples. - It discards the old requirement for massive datasets by validating responses against a reward metric. - This approach cuts data-collection costs significantly. - Small, controlled feedback loops sharpen performance in logical or multi-hop tasks. 🔗 Chain-of-Thought Boost Chain-of-thought integration refines step-by-step reasoning. RFT checks partial correctness and then readjusts updates to strengthen valid outputs. This self-correcting mechanism limits error propagation in arithmetic or combinatorial tasks. Iterative feedback enables the model to refine its own reasoning. 🚀 Integration and Impact The RFT pipeline is seamlessly integrated within the Predibase platform. - Debugging tools help trace reward distribution, and distributed training supports larger-scale tasks. - Models can be deployed or tracked in a serverless manner without hardware overhead. - Adaptive reward shaping and automated checkpointing accelerate development. - RFT extends fine-tuning capabilities for LLMs in limited-data domains. - This eliminates hefty labeling expenses while sustaining performance gains. - It directly tackles situations where correctness is measurable but labeled data is scarce. ------ The below image is from their official technical report (link in comment). RFT leads the pack at each data scale. Note the jump in performance when training with just 10 or 100 samples.
Rohan Paul tweet media
English
6
13
99
11.8K
J Feldman retweetledi
Saam Motamedi
Saam Motamedi@saammotamedi·
Huge release from @Predibase today -- the first end-to-end platform for Reinforcement Fine-Tuning Bringing the techniques that power DeepSeekR1 to any open source model and data
Predibase by Rubrik@predibase

Today we're thrilled to announce the first end-to-end platform for Reinforcement Fine-Tuning. With just a dozen labeled data points, you can outperform #OpenAI o1 and #DeepSeekR1 on complex tasks. Built on the #GRPO methodology that DeepSeek-R1 popularized, our platform delivers exceptional results. In our real-world PyTorch to Triton transpilation case study, we achieved 3x higher accuracy than OpenAI o1 and DeepSeek-R1 when writing GPU code. Check out the thread below to learn how you can adapt an #opensource #LLM to your use cases with unmatched efficiency. #rft

English
2
8
24
5.6K
J Feldman retweetledi
Nordic Semiconductor
Nordic Semiconductor@NordicTweets·
New #WirelessQ! Read how: 🔥 how #IoT promises quicker detection of wildfires 💰 sensors tech will continue to become cheaper, more advanced and more widely available 🧸#Wireless tech helps create innovative products to engage and educate children bit.ly/3nroXmT📚
Nordic Semiconductor tweet media
English
0
5
12
2.7K
J Feldman
J Feldman@Jdfeldmo·
@united waited on hold for an hour after being told 5 minute wait time only to have the agent hang up on me in the first 30 seconds. I understand flights are being cancelled but at least provide the customer support staffing to help your customers.
English
0
0
1
0
J Feldman retweetledi
Tyler Hoffman
Tyler Hoffman@ty_hoff·
Had another great time talking with @embeddedfm, this time about firmware developer productivity within small and large organizations.
English
0
2
11
0
J Feldman retweetledi
Moose Trax
Moose Trax@MooseTraxNFT·
Our CustoMoose collection is dropping next week! Read about the details on our latest blog post: @moosetraxnft/customoose-is-here-95ecddf53578" target="_blank" rel="nofollow noopener">medium.com/@moosetraxnft/… Join us on Discord if you have any lingering questions about the drop!
English
9
24
42
0
J Feldman retweetledi
Bluetooth
Bluetooth@BluetoothSIG·
Find out how Bluetooth Audio Sharing will bring new audio experiences to consumers all over the world: bit.ly/3jtKM62 #LEAudio
English
0
3
4
0
J Feldman retweetledi
Own the Moment
Own the Moment@OwnTheMomentNFT·
🎥SHOWCASE GIVEAWAY🎥 We understand influencers have an advantage. That's not fair so we're doing something about it. Whatever pack the OTM showcase wins, we'll gift the Moments to YOU! ✅RT this tweet ✅Like the showcase ✅Tag 2 friends in comments nbatopshot.com/showcases/7c99…
English
562
492
469
0
J Feldman retweetledi
CoinDesk
CoinDesk@CoinDesk·
EXCLUSIVE: Privacy-centric web browser @Brave is raising a Series A round at a valuation of roughly $133 million, sources tell CoinDesk. ow.ly/qKom30oKHGr @BradyDale reports
English
37
183
486
0
J Feldman retweetledi
DataLight
DataLight@DataLightMe·
Just 10 years ago first #Bitcoin transaction was sent and it’s astonishing to see industry growth and how many new assets have gained traction. In this unique visualization by DataLight, you can track the top-10 crypto assets by market cap, from crypto’s early days until today.
English
87
891
2.4K
0
J Feldman retweetledi
Cheddar
Cheddar@cheddar·
In a busy office and need to take a phone call? Next time, use this device.
English
91
59
187
0
J Feldman retweetledi
Braven Brewing Co.
Braven Brewing Co.@BravenBrewing·
Remember how we just announced the debut of Flashy Ways this morning? Well we found out at 10 am that it just won a Bronze Medal 🥉 at the 2018 New York Craft Brewers Competition and Governor’s Excelsior Cup! Stoooooooooked!!!!!
Braven Brewing Co. tweet media
Brooklyn, NY 🇺🇸 English
2
2
12
0
J Feldman retweetledi
Boxmining
Boxmining@boxmining·
Buying beer with 0.001 Bitcoin using PundiX
English
94
414
1.6K
0
J Feldman retweetledi
ESPN
ESPN@espn·
242 wins. 21 seasons. Bartolo Colon is still going strong on his 45th birthday.
GIF
English
112
1.2K
5.6K
0