Clem

24 posts

Clem banner
Clem

Clem

@clemhus

Data Science @ETH and @Stanford | ex @Google Gemini Intern, @EPFL, @HKUST | 22

Stanford, CA 参加日 Haziran 2024
139 フォロー中92 フォロワー
Clem
Clem@clemhus·
From my experience, ChatGPT has been trained to aggressively ask follow up questions, and I feel a single button tap might be a nice way to keep it going
English
0
0
0
49
Clem
Clem@clemhus·
A pattern I like for reducing friction with LLMs is to dynamically render quick answer options or CTA buttons in the UI. Makes the experience smoother especially in domain specific settings. Haven't seen this around much, do you know of other examples?
Clem tweet media
English
1
0
0
87
Clem
Clem@clemhus·
@bennetkrause yep I agree! But was pretty cool to see the model improve through self-judging. I used Gemini as the judge on the test-set to make things fairer
English
0
0
1
45
Bennet
Bennet@bennetkrause·
@clemhus Cool idea! Although some people prefer using entirely different models as judges due to bias within the same model family
English
1
0
1
89
Clem
Clem@clemhus·
RLAIF w/ TRL+GRPO on a single GPU. Co-locate vLLM with the trainer and reuse the same base model as a judge. Policy updates via LoRA; judge runs with base weights. No extra RM server, low latency.
Clem tweet media
English
3
2
13
2.2K
Besher
Besher@mr_besher·
@clemhus very clever! hats off.
English
1
0
1
109
Clem
Clem@clemhus·
Does TRL expect/encourage this pattern? Any gotchas? @QGallouedec
English
2
0
1
180
Clem
Clem@clemhus·
For GPU/API-poor setups: reuse the same vLLM process that does GRPO rollouts to batch the judge calls. Simple + fast. Result: in a free-text medical diagnosis reasoning test, this setup on Llama-3.1-8B gave ~+14% accuracy vs. the baseline (48→62%, n=300, single seed)
English
1
0
1
198
Clem
Clem@clemhus·
having dinner with @garrytan wasn’t on my bingo card when I joined the bay 2 months ago amazing day at @ycombinator AI startup school thanks for the invite garry!
Clem tweet media
English
2
4
27
8.2K
Maurin
Maurin@schickkler·
I'm going to ycombinator AI Startup School in Sf!!!!!!! Hmu if your going, especially if your from Switzerland/Europe
English
1
0
7
370
Clem
Clem@clemhus·
We won Meta's Llamacon Hackathon! Imagine engineers, PMs, or managers being able to query their org’s knowledge base in natural language: 🧠 “Who’s responsible for the normalization layer of the Llama 4 models?” 🔐 “Who’s an expert in our multi-step auth flow?” Had a blast hacking this up. Btw, we didn't use Cursor/Windsurf. I actually believe that planning ahead and understanding the exact structure of your code is key to deliver a project you own.
Cerebral Valley@cerebral_valley

🥇 1st Place: OrgLens An AI-powered expert matching system that connects you with the right professionals within your organization. By leveraging data from various sources, OrgLens creates a comprehensive knowledge graph and detailed profiles, streamlining expert matching. @KPJedrzejewski & @clemhus

English
1
0
3
241
Clem
Clem@clemhus·
Was great to get invited at Meta HQ to discuss our experience working with the new Llama API. A unified way to use all models from any provider is the way for developers. Performance and cost rule!
Clem tweet media
English
1
0
2
251
Agajan Torayev
Agajan Torayev@torayeff·
Last week, I was in SF and, by a lucky chance, participated in the first LlamaCon Hackathon — and surprise: I won 3rd place and a $6,000 cash prize among top devs in Silicon Valley 🎉 Huge thanks to @MetaforDevs and @cerebral_valley for hosting this great event!
Meta for Developers@MetaforDevs

We're excited to announce the winners of the first LlamaCon Hackathon! These talented individuals and teams have demonstrated exceptional skill and creativity in their projects using Llama. 🥇 1st Prize: OrgLens An AI-powered expert matching system that connects you with the right professionals within your organization. By leveraging data from various sources, OrgLens creates a comprehensive knowledge graph and detailed profiles, streamlining expert matching. See their GitHub Repository: bit.ly/4k62WaO @KPJedrzejewski, @clemhus 🥈 2nd Prize: Compliance Wizards An AI-powered transaction analyzer designed to detect fraud and alert users. It uses Llama API’s multi-modality to assist fraud assessors in determining client involvement in criminal activities. See their GitHub Repository: bit.ly/3RRxiBS @SamDc73, @k_a__reem, @nicetomeetyu2, @sorhanft 🥉 3rd Prize: Llama CCTV Operator A Llama CCTV AI control room operator that identifies custom surveillance video events without model fine-tuning. It uses Llama 4’s multi-modal image understanding to assess and report predefined events. See their GitHub Repository: bit.ly/4d9UPrw @torayeff 🌟 Best Llama API Usage: Geo-ML This project uses Llama 4 Maverick and GemPy to generate 3D geological models, processing extensive geology reports into structured data for 3D representations. See their GitHub Repository: bit.ly/3GITT15 @WilliamJSDavis Please join us in congratulating these winners on their outstanding achievements! We're honored to have them as part of the Llama community. 🎉

English
9
0
21
1.3K
Clem
Clem@clemhus·
@gdb Love this. We hacked on something similar at Llamacon, mapping GitHub repos into a knowledge graph and layering LLMs on top. Super powerful direction.
English
0
0
0
264
Clem
Clem@clemhus·
@MetaforDevs was fun cooking with the new Llama API 🦙
English
0
0
0
31
Clem がリツイート
Meta for Developers
Meta for Developers@MetaforDevs·
We're excited to announce the winners of the first LlamaCon Hackathon! These talented individuals and teams have demonstrated exceptional skill and creativity in their projects using Llama. 🥇 1st Prize: OrgLens An AI-powered expert matching system that connects you with the right professionals within your organization. By leveraging data from various sources, OrgLens creates a comprehensive knowledge graph and detailed profiles, streamlining expert matching. See their GitHub Repository: bit.ly/4k62WaO @KPJedrzejewski, @clemhus 🥈 2nd Prize: Compliance Wizards An AI-powered transaction analyzer designed to detect fraud and alert users. It uses Llama API’s multi-modality to assist fraud assessors in determining client involvement in criminal activities. See their GitHub Repository: bit.ly/3RRxiBS @SamDc73, @k_a__reem, @nicetomeetyu2, @sorhanft 🥉 3rd Prize: Llama CCTV Operator A Llama CCTV AI control room operator that identifies custom surveillance video events without model fine-tuning. It uses Llama 4’s multi-modal image understanding to assess and report predefined events. See their GitHub Repository: bit.ly/4d9UPrw @torayeff 🌟 Best Llama API Usage: Geo-ML This project uses Llama 4 Maverick and GemPy to generate 3D geological models, processing extensive geology reports into structured data for 3D representations. See their GitHub Repository: bit.ly/3GITT15 @WilliamJSDavis Please join us in congratulating these winners on their outstanding achievements! We're honored to have them as part of the Llama community. 🎉
Meta for Developers tweet media
English
6
3
29
4.7K
Cerebral Valley
Cerebral Valley@cerebral_valley·
SF's LlamaCon Hackathon just ended — here are the winning teams from the 600 hackers that applied for a chance at $35,000 from @MetaforDevs (save for project inspiration later 🧵):
English
3
5
30
2.9K