FairMind

5 posts

FairMind banner
FairMind

FairMind

@FairMindAI

Pioneering ethical #GenerativeAI solutions for business transformation and social good. Driving innovation responsibly. #AIForChange #TechWithPurpose

Entrou em Ocak 2024
4 Seguindo13 Seguidores
FairMind retweetou
Andrej Karpathy
Andrej Karpathy@karpathyยท
Programming is changing so fast... I'm trying VS Code Cursor + Sonnet 3.5 instead of GitHub Copilot again and I think it's now a net win. Just empirically, over the last few days most of my "programming" is now writing English (prompting and then reviewing and editing the generated diffs), and doing a bit of "half-coding" where you write the first chunk of the code you'd like, maybe comment it a bit so the LLM knows what the plan is, and then tab tab tab through completions. Sometimes you get a 100-line diff to your code that nails it, which could have taken 10+ minutes before. I still don't think I got sufficiently used to all the features. It's a bit like learning to code all over again but I basically can't imagine going back to "unassisted" coding at this point, which was the only possibility just ~3 years ago.
English
517
2K
18.2K
2.8M
FairMind retweetou
Sebastian Raschka
Sebastian Raschka@rasbtยท
If you are looking for something to read this weekend, I am happy to share that Chapter 7 on instruction finetuning LLMs is now finally live on the Manning website: manning.com/books/build-a-โ€ฆ This is the longest chapter in the book and takes a from-scratch approach to implementing the instruction finetuning pipeline. This includes everything from input formatting to batching with a custom collate function, masking padding tokens, the training loop itself, and scoring the response quality of the finetuned LLM on a custom test set. (The exercises include changing prompt styles, instruction masking, and adding LoRA.) As a side note, it's also the last chapter, and the publisher is currently preparing the layouts for the print version. PS: After moving, traveling, and returning from the awesome SciPy conference, I am now also super eager to finally type up my notes from recent research papers on the instruction finetuning front. I will share them soon, in the next few days, as a follow-up!
Sebastian Raschka tweet media
English
22
262
1.6K
160.7K
FairMind retweetou
Alexio Cassani
Alexio Cassani@alexioยท
๐Ÿ’™ Exciting Milestone at @FairMindAI! ๐Ÿ’š In just two days following the release of the @SapienzaNLP team's Minerva base models, we at #FairMind are thrilled to unveil our latest innovation: ๐— ๐—ถ๐—ป๐—ฒ๐—ฟ๐˜ƒ๐—ฎ-๐Ÿฏ๐—•-๐—œ๐—ป๐˜€๐˜๐—ฟ๐˜‚๐—ฐ๐˜-๐˜ƒ๐Ÿญ.๐Ÿฌ. This new iteration enhances instruction-following capabilities, marking a significant step forward in AI technology. ๐—ค๐˜‚๐—ถ๐—ฐ๐—ธ ๐—œ๐—ป๐˜€๐—ถ๐—ด๐—ต๐˜๐˜€: ๐Ÿš€ ๐—•๐—ฎ๐˜€๐—ฒ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น: Minerva-3B-base-v1.0 (Transformer-based, bilingual in Italian and English) ๐Ÿ“š ๐——๐—ฎ๐˜๐—ฎ ๐—จ๐˜๐—ถ๐—น๐—ถ๐˜‡๐—ฒ๐—ฑ: Italian translation of the Stanford Alpaca-Cleaned dataset, broadening the model's applicability across domains. ๐ŸŽฏ ๐—ข๐—ฏ๐—ท๐—ฒ๐—ฐ๐˜๐—ถ๐˜ƒ๐—ฒ: Sharpen the modelโ€™s proficiency in processing and executing complex instructions in Italian. ๐Ÿ“ˆ ๐—ฃ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ: * Hellaswag_it: 0.5197 * Arc_it: 0.3157 * M_mmlu_it 5-shot: 0.2631 * Average: 0.366 ๐ŸŒ Discover more on our model page: ๐— ๐—ถ๐—ป๐—ฒ๐—ฟ๐˜ƒ๐—ฎ-๐Ÿฏ๐—•-๐—œ๐—ป๐˜€๐˜๐—ฟ๐˜‚๐—ฐ๐˜-๐˜ƒ๐Ÿญ.๐Ÿฌ on Hugging Face: huggingface.co/FairMind/Minerโ€ฆ #AI #MachineLearning #TechInnovation #StartupLife #ModelRelease
English
1
2
3
281
FairMind
FairMind@FairMindAIยท
Hello World ๐Ÿ’™๐Ÿ’š
FairMind tweet media
English
0
0
0
141