Le Chen

13 posts

Le Chen banner
Le Chen

Le Chen

@clthegoat

PhD student at MPI-IS and ETH Zurich

Katılım Mayıs 2018
80 Takip Edilen131 Takipçiler
Le Chen retweetledi
Weiyang Liu
Weiyang Liu@Besteuler·
🚀 Meet OFTv2 — Orthogonal Finetuning made scalable, finally. ⚡️ 10× faster 💾 3× less GPU memory 🤖 Quantized OFT: plug-and-play on quantized LLMs, better than QLoRA Try it now on Hugging face PEFT: tinyurl.com/ycxswfe7 Website: spherelab.ai/oftv2/ #AI #LLM 🧵1/6
Weiyang Liu tweet media
English
1
4
20
3.5K
Laura Graesser
Laura Graesser@lgraesser3·
@clthegoat Congratulations Li! This is great work and I really enjoyed reading your paper.
English
1
0
1
177
Le Chen
Le Chen@clthegoat·
Learning to play the piano with two robot hands is super challenging, even in simulation! It requires coping with bimanual coordination at high speed to achieve human-level dexterity. We introduce RP1M, a large-scale robot piano-playing motion dataset, featuring ~1M trajectories over 2k music pieces. Website: rp1m.github.io Paper: arxiv.org/abs/2408.11048
English
9
36
173
26.9K
Kevin Zakka
Kevin Zakka@kevin_zakka·
@clthegoat Amazing work @clthegoat! Super happy to see you got rid of the need for fingering labels. Unlocking basically any MIDI file makes RoboPianist even more useful to the community :)
English
1
0
5
479
Le Chen
Le Chen@clthegoat·
@YagaoDirac For each RL agent, we use DroQ to train an MLP.
English
1
0
0
130
Yagao Dirac
Yagao Dirac@YagaoDirac·
@clthegoat What layer or model is used for this? 用的什么层?全连接还是tfm还是什么别的?
中文
1
0
0
231
Le Chen
Le Chen@clthegoat·
RP1M contains various piano-playing motions, including highly dynamic motions and long-distance movements. Here is an example of the song Flight of the Bumblebee:
English
0
1
7
1.2K
Le Chen
Le Chen@clthegoat·
🌟Our OT-based method: 1️⃣ Low-cost: No need for human demonstrations or annotations. 2️⃣ Cross embodiments: Supports diverse hand morphologies and robot platforms. 3️⃣ Superior performance: Allows robots to discover optimal fingering aligned with their unique morphology.
English
1
0
4
856
Le Chen retweetledi
Jan Schneider
Jan Schneider@JanS1854·
Gradient subspace optimization unlocked for RL 🔒➡️🔓 Used only for supervised learning so far, our #ICLR2024 paper illustrates that policy gradients evolve in a small, slowly-changing subspace, opening up many opportunities for more efficient RL. arxiv.org/abs/2401.06604
English
1
6
35
3.1K