ControlWiz

134 posts

ControlWiz

@Control_wiz

Building & thinking out loud

加入时间 Nisan 2026

31 关注8.3K 粉丝

ControlWiz 已转推

Junyao Shi@JunyaoShi·19h

Basically you pick your poison in terms of where you want to inject human inductive bias in the loop: directly at low level (imitation) or at some abstraction level (reward shaping, initialization etc.). There are arguments for either to be more bitter lesson pilled. One bitter lesson for RL in robotics manipulation is: it is actually often less scalable due to the amount of task-specific and environment specific tuning of each parameter to get anything working.

Yunlong Song@realyunlong

Most robotics RL paper is often just imitation learning in disguise. The "human expert" transfer task through extensive reward shaping, curricula, initialization strategies, environment design, and various tricks. You are providing demonstrations--just indirectly. A reward function is just a demonstration written in a different language.

English

2.5K

ControlWiz@Control_wiz·17h

@yacineMTB This would be an issue for open loop unstable systems or if you have fast disturbances/uncertainties.

English

kache@yacineMTB·17h

I wonder if anyone has ever let a policy decide how much time it wants to wait before making its next action

English

111

9.9K

ControlWiz@Control_wiz·21h

@lymytom20 I haven't followed more recent work, but Drone racing and RL locomotion have been established since ~2018. Locomotion on relatively structured terrain is a fairly mature problem, and MPC variants have also shown strong results. Backflips are cool demos, but nothing more.

English

Tommy Ly@lymytom20·1d

@Control_wiz What's your take on all the recent papers using RL for locomotion - from drone races to Unitree backflip?

English

ControlWiz@Control_wiz·9 Haz

Always do MPC first. A lot of useful problem can be mapped to an MPC. RL is inefficient.

English

1.7K

ControlWiz 已转推

bradford@LusciousPear·1d

Pixels are not enough. Robots need a realtime learning loop too, but it needs to use kinematic and sensor data from the real world. since that's where robots actually operate.

English

846

ControlWiz 已转推

inControl podcast@inControlpdcst·2d

New episode! 🎙️Peter Caines (@mcgillu) takes us through six decades of control, from system identification to adaptive control to the birth of mean field games, all infused with the restless spirit of the 1960s: Coltrane, Kerouac, and even hitch-hiking all the way to Istanbul!👇

English

371

ControlWiz 已转推

Naval@naval·3d

Science is not a process, a credential, or an institution. It is the unflinching pursuit of truth, carried out by the few, co-opted by the many.

English

766

2.2K

14.5K

ControlWiz 已转推

Baxate@Baxate·6d

friendly reminder that things get way less scary when you understand how to use them and even less so when you understand how they work and it serves the dual purpose of making you more resilient to people manipulating that fear to progress their own agenda

English

2.8K

ControlWiz@Control_wiz·3d

AGI doesn’t have a set definition, let alone be anywhere on the horizon. Treating it as an inevitable trillion dollar superweapon is science fiction and only adds fear and confusion.

gabriel@gabriel1

agi is the most economically valuable asset of all time, there will be trillions in free market capital put into it this is extremely unlike the manhattan project. this time, governments can only cooperate. we can't just pick a winner, or that winner will lose

English

1.1K

ControlWiz 已转推

Mathieu@miniapeur·5d

Being a PhD student is hard for many reasons. On paper, the flexibility can seem like an advantage: you often do not have a fixed schedule or many strict deadlines. In practice, however, that same freedom can become a trap. Since your progress depends so much on your own initiative, it is easy to convince yourself that every extra hour of work is an investment in your future. For students who are already inclined to overwork, this can blur the line between dedication and self-exploitation, eventually leading to burnout.

English

381

27.5K

ControlWiz@Control_wiz·3d

VLA based models can only interpolate over the support of its training/pre-training distribution. To become a generalist, it would need training coverage dense enough to span an enormous (possibly unbounded) task space, which is not scalable. A generalist needs to learn on the job and plan accordingly.

English

2.8K

ControlWiz@Control_wiz·4d

@Lavnir_ 😂

QME

Lavnir@Lavnir_·4d

@Control_wiz Just prompt the controller with be an expert driver don’t crash lol

English

ControlWiz@Control_wiz·4d

I have to stop arguing on here.

Sparr@sparr0

@Control_wiz What work? This wouldn't even warrant a publication. The implementation would just be things like "don't ever follow closely enough that them braking could cause a collision", "don't ever enter their turning radius", etc. all the worst case bounds for any more-informed simulation

English

1.6K

ControlWiz 已转推

Deb Raji@rajiinio·5d

I do not want to do AI research that is reactive to what these companies are doing, or even what they're saying. The entire field keeps chasing after product releases. Some spend more time reading marketing copy than their colleagues papers and I just... do not want to do that?

English

127

7.7K

ControlWiz@Control_wiz·4d

@sparr0 Never heard of such work, and highly doubt it exist. Share references.

English

188

Sparr@sparr0·4d

@Control_wiz You can predict their entire navigational envelope and avoid it completely.

English

143

ControlWiz@Control_wiz·5d

In multi-agent planning you react and replan continuously using your belief states at 30–50 Hz. If the behavior models of the other agents are known, many methods exist. When they are unknown, you need some predictive model, and collision avoidance still cannot be guaranteed without strong assumptions. I’m not aware of a general method that guarantees safe behavior without these, please share if you do.

Sparr@sparr0

@Control_wiz @ChenTessler No, you don't need some assumption about how other agents will act. It is possible to drive such that you always have an escape route. To never follow so closely that you can't stop or evade if they brake. To never cross ahead of someone that might speed up.

English

1.7K

ControlWiz 已转推

Fei-Fei Li@drfeifei·6d

Scientific research is fundamental to advancing civilization and helping people globally to solve the most critical problems, from medicine to materials, from brain science to physics, and much beyond. This is only possible when scientists have access to the best tools of the time to conduct scientific research, including having access to AI-based tools.

English

120

471

3.1K

192.9K

ControlWiz@Control_wiz·5d

@ChenTessler AVs also perform worse with bad weather , road debris, and mixed traffic. In normal driving conditions, they’re often better than the average human driver.

English

Chen Tessler@ChenTessler·5d

Isn't the fact that humans are shitty drivers that cross red lights seconds after it's switched to red, kind of the proof this isn't globally accurate? So if we agree (some) humans are shitty drivers with bad internal models and inability to predict. Then the goal is to be better than the median/mean and that's already a big improvement.

English

ControlWiz@Control_wiz·5d

Idk if human behavior is solvable. No car will likely reach level 5 autonomy, as long as there are human drivers.

alex p.@alexpdiggs

@Control_wiz just a lil speed bump.

English

3.4K

ControlWiz@Control_wiz·5d

@ChenTessler I agree. Never claimed AVs are not better than humans. My point was we don't have full autonomy because of humans on the road.

English

ControlWiz@Control_wiz·5d

@diakou What article? I am not following.

English

Diakou@diakou·5d

@Control_wiz article when? (comparing current reality of robotics in china, US / west) etc?

English

140

ControlWiz@Control_wiz·10 Haz

One of my goals with this account is to share my honest thoughts on where I think the frontier of robotics actually is. I might sound like a pessimist, but I think the field deserves realism. I’m happy robotics is getting all this attention, but I also have to admit that a lot of what we see in both industry and academia is exaggerated. Curated demos make the field seem much more advanced than it really is. I’ll share my thoughts, for what they’re worth.

English

1.4K

288.6K

发现

@yacineMTB @lymytom20 @mcgillu @Lavnir_ @sparr0 @elonmusk @BarackObama @taylorswift13