ControlWiz

136 posts

ControlWiz

@Control_wiz

Building & thinking out loud

شامل ہوئے Nisan 2026

32 فالونگ8.3K فالوورز

ControlWiz@Control_wiz·5h

Useful specialized robots already exist in many homes. A robot that can do everything a human can do is likely not happening in our lifetime. The goal isn’t to replicate humans. It’s to build robots that help humans solve problems. Legged humanoids that look like humans are not the best solution for many applications.

RoboStrategy@RoboStrategy

How soon until humanoid robots are part of everyday life? CEO Andrew Kang (@Rewkang) gives his thesis:

English

994

ControlWiz@Control_wiz·6h

Great talk by @JitendraMalikCV. youtu.be/HvDU7Vk4pbc?is…

YouTube

English

430

ControlWiz ری ٹویٹ کیا

Junyao Shi@JunyaoShi·1d

Basically you pick your poison in terms of where you want to inject human inductive bias in the loop: directly at low level (imitation) or at some abstraction level (reward shaping, initialization etc.). There are arguments for either to be more bitter lesson pilled. One bitter lesson for RL in robotics manipulation is: it is actually often less scalable due to the amount of task-specific and environment specific tuning of each parameter to get anything working.

Yunlong Song@realyunlong

Most robotics RL paper is often just imitation learning in disguise. The "human expert" transfer task through extensive reward shaping, curricula, initialization strategies, environment design, and various tricks. You are providing demonstrations--just indirectly. A reward function is just a demonstration written in a different language.

English

2.9K

ControlWiz@Control_wiz·1d

@yacineMTB This would be an issue for open loop unstable systems or if you have fast disturbances/uncertainties.

English

1.1K

kache@yacineMTB·1d

I wonder if anyone has ever let a policy decide how much time it wants to wait before making its next action

English

119

10.5K

ControlWiz@Control_wiz·1d

@lymytom20 I haven't followed more recent work, but Drone racing and RL locomotion have been established since ~2018. Locomotion on relatively structured terrain is a fairly mature problem, and MPC variants have also shown strong results. Backflips are cool demos, but nothing more.

English

Tommy Ly@lymytom20·1d

@Control_wiz What's your take on all the recent papers using RL for locomotion - from drone races to Unitree backflip?

English

ControlWiz@Control_wiz·9 Haz

Always do MPC first. A lot of useful problem can be mapped to an MPC. RL is inefficient.

English

1.7K

ControlWiz ری ٹویٹ کیا

bradford@LusciousPear·2d

Pixels are not enough. Robots need a realtime learning loop too, but it needs to use kinematic and sensor data from the real world. since that's where robots actually operate.

English

861

ControlWiz ری ٹویٹ کیا

inControl podcast@inControlpdcst·2d

New episode! 🎙️Peter Caines (@mcgillu) takes us through six decades of control, from system identification to adaptive control to the birth of mean field games, all infused with the restless spirit of the 1960s: Coltrane, Kerouac, and even hitch-hiking all the way to Istanbul!👇

English

398

ControlWiz ری ٹویٹ کیا

Naval@naval·3d

Science is not a process, a credential, or an institution. It is the unflinching pursuit of truth, carried out by the few, co-opted by the many.

English

766

2.3K

14.5K

ControlWiz ری ٹویٹ کیا

Baxate@Baxate·6d

friendly reminder that things get way less scary when you understand how to use them and even less so when you understand how they work and it serves the dual purpose of making you more resilient to people manipulating that fear to progress their own agenda

English

2.8K

ControlWiz@Control_wiz·3d

AGI doesn’t have a set definition, let alone be anywhere on the horizon. Treating it as an inevitable trillion dollar superweapon is science fiction and only adds fear and confusion.

gabriel@gabriel1

agi is the most economically valuable asset of all time, there will be trillions in free market capital put into it this is extremely unlike the manhattan project. this time, governments can only cooperate. we can't just pick a winner, or that winner will lose

English

1.1K

ControlWiz ری ٹویٹ کیا

Mathieu@miniapeur·5d

Being a PhD student is hard for many reasons. On paper, the flexibility can seem like an advantage: you often do not have a fixed schedule or many strict deadlines. In practice, however, that same freedom can become a trap. Since your progress depends so much on your own initiative, it is easy to convince yourself that every extra hour of work is an investment in your future. For students who are already inclined to overwork, this can blur the line between dedication and self-exploitation, eventually leading to burnout.

English

381

27.6K

ControlWiz@Control_wiz·4d

VLA based models can only interpolate over the support of its training/pre-training distribution. To become a generalist, it would need training coverage dense enough to span an enormous (possibly unbounded) task space, which is not scalable. A generalist needs to learn on the job and plan accordingly.

English

2.8K

ControlWiz@Control_wiz·5d

@Lavnir_ 😂

QME

Lavnir@Lavnir_·5d

@Control_wiz Just prompt the controller with be an expert driver don’t crash lol

English

ControlWiz@Control_wiz·5d

I have to stop arguing on here.

Sparr@sparr0

@Control_wiz What work? This wouldn't even warrant a publication. The implementation would just be things like "don't ever follow closely enough that them braking could cause a collision", "don't ever enter their turning radius", etc. all the worst case bounds for any more-informed simulation

English

1.6K

ControlWiz ری ٹویٹ کیا

Deb Raji@rajiinio·5d

I do not want to do AI research that is reactive to what these companies are doing, or even what they're saying. The entire field keeps chasing after product releases. Some spend more time reading marketing copy than their colleagues papers and I just... do not want to do that?

English

127

7.7K

ControlWiz@Control_wiz·5d

@sparr0 Never heard of such work, and highly doubt it exist. Share references.

English

190

Sparr@sparr0·5d

@Control_wiz You can predict their entire navigational envelope and avoid it completely.

English

143

ControlWiz@Control_wiz·5d

In multi-agent planning you react and replan continuously using your belief states at 30–50 Hz. If the behavior models of the other agents are known, many methods exist. When they are unknown, you need some predictive model, and collision avoidance still cannot be guaranteed without strong assumptions. I’m not aware of a general method that guarantees safe behavior without these, please share if you do.

Sparr@sparr0

@Control_wiz @ChenTessler No, you don't need some assumption about how other agents will act. It is possible to drive such that you always have an escape route. To never follow so closely that you can't stop or evade if they brake. To never cross ahead of someone that might speed up.

English

1.8K

ControlWiz ری ٹویٹ کیا

Fei-Fei Li@drfeifei·10 Haz

Scientific research is fundamental to advancing civilization and helping people globally to solve the most critical problems, from medicine to materials, from brain science to physics, and much beyond. This is only possible when scientists have access to the best tools of the time to conduct scientific research, including having access to AI-based tools.

English

120

471

3.1K

193.2K

ControlWiz@Control_wiz·5d

@ChenTessler AVs also perform worse with bad weather , road debris, and mixed traffic. In normal driving conditions, they’re often better than the average human driver.

English

Chen Tessler@ChenTessler·5d

Isn't the fact that humans are shitty drivers that cross red lights seconds after it's switched to red, kind of the proof this isn't globally accurate? So if we agree (some) humans are shitty drivers with bad internal models and inability to predict. Then the goal is to be better than the median/mean and that's already a big improvement.

English

ControlWiz@Control_wiz·6d

Idk if human behavior is solvable. No car will likely reach level 5 autonomy, as long as there are human drivers.

alex p.@alexpdiggs

@Control_wiz just a lil speed bump.

English

3.4K

ControlWiz@Control_wiz·5d

@ChenTessler I agree. Never claimed AVs are not better than humans. My point was we don't have full autonomy because of humans on the road.

English

دریافت کریں

@JitendraMalikCV @yacineMTB @lymytom20 @mcgillu @Lavnir_ @sparr0 @elonmusk @BarackObama