Tianlong Xu

12 posts

Tianlong Xu

Tianlong Xu

@txu0915

Staff Applied Scientist @ Squirrel AI | Ex. Goldman Sachs

Seattle Katılım Ocak 2015
218 Takip Edilen34 Takipçiler
Tianlong Xu
Tianlong Xu@txu0915·
@_philschmid Would this be easily scalable to multi-modal case? I am wondering if we could see such aha moments in image understanding by teaching it to think deep.
English
0
0
0
17
Philipp Schmid
Philipp Schmid@_philschmid·
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial! Recreate an RL "aha moment" using Group Relative Policy Optimization (GRPO) and train an open model using reinforcement learning to teach it self-verification and search abilities all on its own to solve the Countdown Game. philschmid.de/mini-deepseek-…
Philipp Schmid tweet media
English
9
36
215
11.9K
Tianlong Xu
Tianlong Xu@txu0915·
@jiayi_pirate Hi Jiayi, this is really inspiring - is there a multi-modal version out there, or do you plan to create something soon?
English
0
0
0
119
Jiayi Pan
Jiayi Pan@jiayi_pirate·
We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM develops self-verification and search abilities all on its own You can experience the Ahah moment yourself for < $30 Code: github.com/Jiayi-Pan/Tiny… Here's what we learned 🧵
Jiayi Pan tweet media
English
192
1.2K
6.3K
1.7M
Tianlong Xu
Tianlong Xu@txu0915·
As a quick snippet of the ideas in these two papers: one is to use LLMs to detect the root cause of a student who has made an error from their digital drafts. And the other is to use multi-agents to perform alignments between math questions and knowledge concepts.
Tianlong Xu tweet media
English
0
0
0
627
Tianlong Xu
Tianlong Xu@txu0915·
Check out our two papers accepted by #AAAI2025. "AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction" and "Knowledge Tagging with Large Language Model based Multi-Agent System" .
English
3
0
0
2.6K
Tianlong Xu retweetledi
nature
nature@Nature·
Fifty years ago, Apollo astronauts first landed on the Moon. What better way to celebrate than constructing tiny versions of some of the mission's iconic vehicles and scientists? Sign up for Nature Briefing for a chance to win (current readers can enter and win too) 🚀
English
42
1.8K
344
0