Ruiqi Wang (@RuiqisNotes) - Профиль Twitter

Закреплённый твит

Ruiqi Wang@RuiqisNotes·17 Haz

🎉 Excited to share our new work: 👓Ego-R1! We cracked ultra-long egocentric video reasoning! 🤯 Think days/weeks of footage processed efficiently with Chain-of-Tool-Thought ⛓️🔧 🌐 egolife-ai.github.io/Ego-R1

Shulin Tian@shulin_tian

🎥 Video is already a tough modality for reasoning. Egocentric video? Even tougher! It is longer, messier, and harder. 💡 How do we tackle these extremely long, information-dense sequences without exhausting GPU memory or hitting API limits? We introduce 👓Ego-R1: A framework for reasoning over ultra-long (i.e., in days and weeks) egocentric videos, with the support from Chain-of-Tool-Thought (CoTT) that decomposes complex reasoning tasks into modular steps. At its core is Ego-R1-Agent-3B, an orchestrating language model trained to dynamically invoke specialized tools at each step, based on the previous actions and observations, to collect the necessary information and solve the tasks gradually, step-by-step. All code and data are fully open-sourced :) 🌐 Project: egolife-ai.github.io/Ego-R1 📄 Paper: arxiv.org/abs/2506.13654 💻 Code: github.com/egolife-ai/Ego…

English

1

8

27

3.1K

Ruiqi Wang ретвитнул

田中義弘 | taziku CEO / AI × Creative@taziku_co·18 Haz

1週間分の一人称映像を“人間のように”理解するAI。 Ego-R1は、複雑な行動を「小さなステップ」に分解し、その都度ツールを使って推論する新型フレームワーク。強化学習で訓練されたエージェントが、映像の意味を“考えながら解く”。プロジェクトページなどは🧵から

日本語

1

3

17

2.9K

Ruiqi Wang ретвитнул

Ziwei Liu@liuziwei7·18 Haz

🎬Ultra-Long Egocentric Video Reasoning🎬 😎Ego-R1😎 reasons over ultra-long (i.e., in weeks) egocentric videos, which leverages a **Chain-of-Tool-Thought (CoTT)** process trained via reinforcement learning (RL) - Project: egolife-ai.github.io/Ego-R1/ - Code: github.com/egolife-ai/Ego…

AK@_akhaliq

Ego-R1 Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

English

0

15

83

7.3K

Ruiqi Wang ретвитнул

AI Bites | YouTube Channel@ai_bites·17 Haz

Ego-R1, a novel framework for reasoning over ultra-long (i.e.,in days and weeks) egocentric videos, which leverages a structured Chain-of-Tool-Thought (CoTT) process, orchestrated by an Ego-R1 Agent trained via reinforcement learning (RL). Inspired by human problem-solving strategies, CoTT decomposes complex reasoning into modular steps, with the RL agent invoking specific tools, one per step, to iteratively and collaboratively answer sub-question stackling such tasks as temporal retrieval and multi-modal understanding. Paper Title: Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning Project: egolife-ai.github.io/Ego-R1/ Link: arxiv.org/abs/2506.13654 #AI #video #GenerativeAI #AIイケメン部

English

0

3

7

1.1K

Ruiqi Wang ретвитнул

AK@_akhaliq·17 Haz

Ego-R1 Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

English

4

39

167

29.1K

Ruiqi Wang@RuiqisNotes·17 Haz

This work was done during my visit at MMlab@NTU! 🇸🇬✨Special thanks to Prof Ziwei Liu @liuziwei7 for the amazing opportunity and to my supervisor Prof Richard Zhang @richardzhangsfu for his unconditional support! 🙏Grateful for such incredible mentorship that made this possible!

English

0

7

192

Ruiqi Wang@RuiqisNotes·17 Haz

🎉 Excited to share our new work: 👓Ego-R1! We cracked ultra-long egocentric video reasoning! 🤯 Think days/weeks of footage processed efficiently with Chain-of-Tool-Thought ⛓️🔧 🌐 egolife-ai.github.io/Ego-R1

Shulin Tian@shulin_tian

🎥 Video is already a tough modality for reasoning. Egocentric video? Even tougher! It is longer, messier, and harder. 💡 How do we tackle these extremely long, information-dense sequences without exhausting GPU memory or hitting API limits? We introduce 👓Ego-R1: A framework for reasoning over ultra-long (i.e., in days and weeks) egocentric videos, with the support from Chain-of-Tool-Thought (CoTT) that decomposes complex reasoning tasks into modular steps. At its core is Ego-R1-Agent-3B, an orchestrating language model trained to dynamically invoke specialized tools at each step, based on the previous actions and observations, to collect the necessary information and solve the tasks gradually, step-by-step. All code and data are fully open-sourced :) 🌐 Project: egolife-ai.github.io/Ego-R1 📄 Paper: arxiv.org/abs/2506.13654 💻 Code: github.com/egolife-ai/Ego…

English

1

8

27

3.1K

Ruiqi Wang ретвитнул

Arthur Douillard@Ar_Douillard·17 Haz

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning Reasoning over weeks-long video, with a mix of CoT and using tools

English

1

8

25

2.3K

Ruiqi Wang@RuiqisNotes·3 Eki

We introduce the first active learning (AL) framework for high-accuracy instance segmentation of moveable parts from RGB images of real indoor scenes with a dataset contribution. @FenggenYu

English

0

292

Ruiqi Wang@RuiqisNotes·3 Eki

Learning part motions from real images is challenging due to data limitation. Today at #ECCV2024 (10/3, 4:30PM, #41) I will present our work, Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images. @richardzhangsfu

English

1

5

11

1.3K

Ruiqi Wang

Открыть