Inceptive

201 posts

Inceptive banner
Inceptive

Inceptive

@inceptive_tech

De la stratégie à l’industrialisation, Inceptive crée et déploie des solutions sur mesure qui mettent l’intelligence artificielle au service du réel. #AI

Grenoble, France Katılım Şubat 2014
129 Takip Edilen296 Takipçiler
Inceptive retweetledi
Sebastian Raschka
Sebastian Raschka@rasbt·
How to Teach LLMs to Follow Instructions I just shared a new tutorial video as part of the Build a Large Language Model From Scratch series! In this one, we take a pre-trained decoder-only model and turn it into a small personal assistant that can handle free-form instruction-following tasks like answering questions, rewriting text, or generating short responses. From my experience working with teams building LLM applications, I’ve noticed that many people still think instruction fine-tuning is complicated. But as you’ll see, we reuse the same loss function (and even the same training loop) as during pre-training. The only real difference? How we format the data. In this video, I walk through: 1. What instruction fine-tuning actually is 2. How to format data into prompt/response pairs using common templates 3. A lightweight dataset to demonstrate the process 4. Practical tips like dynamic padding, loss masking, and clean end-of-text handling The model we train here is not a ChatGPT competitor, but it is small and fast, which is ideal for educational use, and great way to understand the principles behind instruction-following LLMs. The full walkthrough is available here: youtube.com/watch?v=4yNswv…
YouTube video
YouTube
Sebastian Raschka tweet media
English
17
163
971
57.1K
Inceptive retweetledi
Xing Han Lu
Xing Han Lu@xhluca·
DeepSeek-R1 Thoughtology: Let’s about LLM reasoning 142-page report diving into the reasoning chains of R1. It spans 9 unique axes: safety, world modeling, faithfulness, long context, etc.
Xing Han Lu tweet media
English
6
127
731
73.1K
Inceptive retweetledi
Constantin 🔥
Constantin 🔥@constantout·
"I built a SAAS in 3 days with 0 coding knowledge using AI. Developers are obsolete." Their project:
English
267
2.5K
28.4K
1.3M
Inceptive retweetledi
Philipp Schmid
Philipp Schmid@_philschmid·
Mini-R1: Reproduce @deepseek_ai R1 „aha moment“ a RL tutorial! Recreate an RL "aha moment" using Group Relative Policy Optimization (GRPO) and train an open model using reinforcement learning to teach it self-verification and search abilities all on its own to solve the Countdown Game. TL;DR: 🤯 DeepSeek R1's "aha moment" demonstrates RL's potential for self-improvement in LLMs. 2️⃣ Using 2 reward functions, 1x for format (,) and 1x for correctness 🤖 Qwen2.5-3B-Instruct model learns self-verification and search abilities. ⚙️ Use @MSFTDeepSpeed and @vllm_project for efficient and distributed online RL Training with @huggingface TRL 🤟 Include Training Observations and Hyperparameter improvements 🧮 Uses Countdown Game (arithmetic puzzles) to teach models self-correction via and tags 📊 Achieves 50% success rate after 450 training steps on 4x H100 GPUs ⚡ Training takes ~6 hours on 4x H100 GPUs for 450 steps
Philipp Schmid tweet media
English
30
150
812
77.2K
Inceptive retweetledi
Hugging Face
Hugging Face@huggingface·
The Open Source community is amazing 🤗
English
18
44
469
59.7K
Inceptive retweetledi
hubert guillaud
hubert guillaud@hubertguillaud·
Excellente histoire du Robots.txt, ce fichier qui autorise ou non l'indexation. Excellente mise en perspective qui explique pourquoi beaucoup souhaitent interdire l'indexation pour l'IA, mais peine à identifier les robots dédiés au-delà de GPTbot : theverge.com/24067997/robot…
Français
2
10
8
855
Julien Chaumond
Julien Chaumond@julien_c·
Llama 2 (70B) just landed in HuggingChat💬 This is the largest running version of the model from @MetaAI, running on fast optimized inference on @huggingface infra. Unleash the llamas! 🦙🦙 Try it out now hf.co/chat
Julien Chaumond tweet media
English
10
86
359
190.9K
Inceptive retweetledi
Yann LeCun
Yann LeCun@ylecun·
This is huge: Llama-v2 is open source, with a license that authorizes commercial use! This is going to change the landscape of the LLM market. Llama-v2 is available on Microsoft Azure and will be available on AWS, Hugging Face and other providers Pretrained and fine-tuned models are available with 7B, 13B and 70B parameters. Llama-2 website: ai.meta.com/llama/ Llama-2 paper: ai.meta.com/research/publi… A number of personalities from industry and academia have endorsed our open source approach: about.fb.com/news/2023/07/l…
English
384
3.4K
14.9K
4.3M
Inceptive retweetledi
Olivier Tesquet
Olivier Tesquet@oliviertesquet·
⚠️Quelques réflexions à chaud sur l'inquiétante proposition de loi installant la reconnaissance faciale en temps réel dans l'espace public, adoptée hier par le Sénat. Bienvenue dans le contrôle d'identité permanent et général. telerama.fr/debats-reporta…
Français
7
425
510
99.1K
Inceptive
Inceptive@inceptive_tech·
Nous vous invitons à explorer l'univers fascinant des #LLM, ces technologies changent notre accès à l'information et nos méthodes de recherche. Une occasion de découvrir comment vous pouvez contrôler vos données tout en profitant des avantages de l'#IA. app.livestorm.co/inceptive/reto…
Français
0
1
0
200
Inceptive
Inceptive@inceptive_tech·
Inceptive vous invite au webinaire sur les Grands Modèles de Langage (LLMs) le 13 juin à 10h ! Découvrez les alternatives à OpenAI, explorez la transformation de l'accès à l'information grâce à ces technologies. app.livestorm.co/inceptive/reto…
Français
0
0
0
27
Inceptive
Inceptive@inceptive_tech·
Rejoignez Inceptive & @TENERRDIS le 17/05 à 10h pour un webinaire sur l'IA dans le secteur de l'énergie! Découvrez l'état de l'IA en 2023, les opportunités et méthodologies pour intégrer l'IA dans les processus métier. Inscrivez-vous maintenant. tenerrdis.fr/fr/evenements/…
Français
0
0
0
52
Inceptive retweetledi
David GAL-REGNIEZ
David GAL-REGNIEZ@dgrpro·
Minalogic partenaire de l’événement vous invite à découvrir la dynamique du spatial français et à en devenir acteur. 2 jours intenses a ne pas rater. Préambule aussi à Grenoble avec le CNES le 16 mai pour découvrir la richesse de l’écosystème de la Région…lnkd.in/euqSVHqf
Français
0
1
0
95