荻野ゼミ
1.1K posts


Humanoids = stacked actuators. Pure and simple. If you don't understand actuators, you don't understand robots. Actuators are not solved: too heavy, too hot, too inefficient. This is a mega thread on actuator design for humanoid robots, directly from the mind of @GoingBallistic5 Bookmark it for later 👇


1/6 TheBitter RL 今天,RL太🔥了,RLHF更是毕业利器。 但 @RichardSSutton 和 @GoogleDeepMind 的Welcome to the Era of Experience 犹如TheBitterLesson的续章给我们当头一棒。 经历过模拟时代, 享受过人类数据时代, 如今我们正踏入经验时代 不靠模仿,不靠学习,而靠“活过”。 #AI范式 #RL


Would you believe that deep RL can work without replay buffers, target networks, or batch updates? Our recent work gets deep RL agents to learn from a continuous stream of data one sample at a time without storing any sample. Joint work with @Gautham529 and @rupammahmood.



















