Antoine Lizée

201 posts

Antoine Lizée

@A_Lizee

Engineering, data, ai. Family stuff. Random thoughts.

Se unió Aralık 2012

270 Siguiendo130 Seguidores

Antoine Lizée@A_Lizee·10 Tem

@SenEricSchmitt @grok what does he mean by "illegal immigrants"? How can illegal immigrants have access to medicaid?

English

Senator Eric Schmitt@SenEricSchmitt·1 Tem

The Big Beautiful Bill kicks 1.4 MILLION illegal immigrants off Medicaid. For too long, Americans have been paying for the welfare of people who shouldn't even be in our country. Today, the Senate voted to end that. And yes—this DID make it into the final draft of the bill. 🧵

English

4.7K

13.2K

83.7K

6.6M

Antoine Lizée@A_Lizee·6 Tem

Ppl ask why Claude Code > Cursor. As an IDE > vim guy, I was surprised too. My take after 1m: - CC is linear, takes less mental load. You're in the driver seat but it's on autopilot. Much easier to follow. - Cursor's edit models are sh*t and will introduce random changes that you don't need. - Cursor's "accept" flow is broken, and gets you confused fast. - CC has planning mode, forces you to frame your needs well and puts the agent on a good track. I personally don't think CC is def. > Cursor, and still go back to Cursor often. And to my good ol' PyCharm too!!

English

Antoine Lizée@A_Lizee·2 Mar

@gabhubert Yes but they are wearing suits

English

Gabriel Hubert@gabhubert·28 Şub

Such small men.

Deutsch

870

Antoine Lizée@A_Lizee·26 Şub

Coding with AI makes me want more than 88 characters per line

English

123

Antoine Lizée@A_Lizee·8 Oca

@_dlangston Thank you :-) The findings are in the thread and the paper linked below!

English

Antoine Lizée@A_Lizee·25 Kas

🧵 Is AI ready for patients? Today we're publishing the first ever large-scale study of conversational medical AI in real-world conditions. Meet Mo, our AI medical assistant, deployed in our medical advice chat with GPs A thread on what we learned 👇

English

7.7K

Antoine Lizée@A_Lizee·17 Ara

@dhimmel Very cool!

English

Daniel Himmelstein@dhimmel·17 Ara

Wow. $6000 prize payable in $XMR to whoever can make the best visualization of 2 billion ISBNs! annas-archive.org/blog/all-isbns…

English

267

Antoine Lizée@A_Lizee·17 Ara

@elder_plinius You mean on X? Sweet.

English

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·17 Ara

@A_Lizee pliny posts! 🤗

English

135

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·15 Ara

Say hello to Mini Pliny! 🤗 I fine-tuned gpt-4o on my archive and the results are prettty much how one might expect lol Loaded up a healthy dose of API credits so you all can talk to the lil fucker––first come first serve til the tokens are used up. Have fun and please share your favorite outputs below! mini-pliny.streamlit.app Be warned: EXTREME levels of sass 😘 #MiniusPlinius

English

377

61.3K

Antoine Lizée@A_Lizee·12 Ara

@rohanpaul_ai Could that be learned by RL, a bit like o1?

English

Rohan Paul@rohanpaul_ai·11 Ara

Training procedure of Chain of Continuous Thought (Coconut) Start (Language CoT): The model begins with normal training data where reasoning is expressed in language steps - like [Step 1], [Step 2], etc. Progressive Training Stages: - Stage 0: Introduces the and tokens but still uses language steps - Stage 1: Replaces first language step with a continuous thought - Stage 2: Adds another continuous thought, removing another language step - This continues until Stage N where all language steps are replaced with continuous thoughts Think of it like teaching someone to ride a bike: 1. First they use training wheels (all language steps) 2. Then gradually remove one training wheel (replace one step with continuous thought) 3. Keep removing support until they can ride freely (all continuous thoughts) The model calculates loss only on the remaining language tokens after the continuous thoughts, helping it learn to use these direct neural pathways effectively. The parameter 'c' in the figure shows how many continuous thoughts replace each language step - in this example, c=1 means one continuous thought per step.

English

2.3K

Rohan Paul@rohanpaul_ai·11 Ara

Brilliant paper from @Meta having the potential to significantly boost LLM's reasoning power. Why force AI to explain in English when it can think directly in neural patterns? Imagine if your brain could skip words and share thoughts directly - that's what this paper achieves for AI. By skipping the word-generation step, LLMs can explore multiple reasoning paths simultaneously. Introduces Coconut (Chain of Continuous Thought), enabling LLMs to reason in a continuous latent space rather than through word tokens, leading to more efficient and powerful reasoning capabilities. 🧠 The key Solution in this paper Current LLMs are constrained by having to express their reasoning through language tokens, where most tokens serve textual coherence rather than actual reasoning. So this paper proposes a novel solution where instead of decoding the hidden state into word tokens, it's directly fed back as the next input embedding in a continuous space. Let me explain the mechanism simply: In normal LLMs, when the model thinks, it has to: 1. Convert its internal neural state into actual words 2. Then convert those words back into neural patterns to continue thinking What Coconut does instead: It directly takes the neural patterns (hidden state) from one thinking step and feeds them into the next step - no conversion to words needed. It's like letting the model's thoughts flow directly from one step to the next in their raw neural form. Think of it like this: Instead of having to write down your thoughts on paper and then read them back to continue thinking (like regular LLMs do), Coconut lets the model's thoughts continue flowing naturally in their original neural format. This is more efficient and lets the model explore multiple possible thought paths at once. ----- The method uses special tokens and to mark latent reasoning segments, and employs a multi-stage training curriculum that gradually replaces language reasoning steps with continuous thoughts. Key insights of the paper: → Coconut achieves 34.1% accuracy on GSM8k math problems, outperforming baseline Chain-of-Thought (30.0%) → The continuous space enables parallel exploration of multiple reasoning paths, similar to breadth-first search → Performance improves with more continuous thoughts per reasoning step, showing effective chaining capability → Latent reasoning excels in tasks requiring extensive planning, with 97% accuracy on logical reasoning (ProsQA)

English

305

2.1K

207.4K

Antoine Lizée@A_Lizee·11 Ara

@GoogleAI Very excited about the prospective real-world evaluation. Hopefully it confirms the results we shared last week on our very similar study: x.com/A_Lizee/status…

Antoine Lizée@A_Lizee

English

208

Google AI@GoogleAI·10 Ara

We present our exploration into using the Articulate Medical Intelligence Explorer (AMIE) for sub-specialist medicine applications, including complex cardiomyopathies and breast cancer, and a new partnership for safe, prospective real-world validation goo.gle/3VzJMAf

English

234

28.4K

Antoine Lizée@A_Lizee·4 Ara

@pash22 Thank you for sharing! Glad you found our research useful.

English

Ash Paul@pash22·3 Ara

Is AI ready for patients? @A_Lizee et al's study with Mo. alan.com/en/blog/discov…

English

367

Antoine Lizée@A_Lizee·27 Kas

@ADarmouni @Gorintic @avec_alan Yes Longer answer: sourcing doesn't really matter in convos, few people use it. Building our own models (further training & fine tuning) to improve parts of the system is def a strategy we've taken. RAG is important in doing so.

English

107

Axel Darmouni@ADarmouni·27 Kas

@Gorintic @avec_alan @A_Lizee After the read, curious: not sure if it’s already done, but if it isn’t: any plans on connecting it with textbooks or viable medecine websites to source advices? If so, rag or finetuning? Feeling like latency boost of Mo is so good that a slight delay can be added there

English

Charles Gorintin@Gorintic·25 Kas

AI medical assistants are ready for practice. That’s what we’re showing with Mo @avec_alan. We just shared our pre-print of the first-large scale study of conversational medical AI in real world conditions.

English

1.8K

Antoine Lizée@A_Lizee·25 Kas

Summary blog post without the jargon: alan.com/en/blog/discov…

English

293

Antoine Lizée@A_Lizee·25 Kas

Read the research paper for: Detailed methodology Safety protocols Real-world deployment learnings Future research priorities Check it out here: arxiv.org/abs/2411.12808

English

373

Descubrir

@SenEricSchmitt @grok @gabhubert @dhimmel @elder_plinius @rohanpaul_ai @Meta @GoogleAI