skara

398 posts

skara banner
skara

skara

@skara_io

cto • simulating reality for robots @bifrost_ai @sequoia @lux_capital

San Francisco Katılım Mayıs 2021
838 Takip Edilen2.5K Takipçiler
vrushtee
vrushtee@vruga_·
I am working on Online RL. MolmoACT2 deploys out of the box but fails at many tasks due to its data distribution. One really interesting insight would be to have a setup working where you can deploy this good enough base policy and make it learn on the go
English
7
12
86
6.5K
skara
skara@skara_io·
@Willqyu Also not to mention the LLMs are built on dozens of low level abstractions. We have to build everything from scratch in robotics
English
1
0
0
37
Will Yu
Will Yu@Willqyu·
"Good at building the factories and tooling them" is doing all the heavy lifting here The biggest difference between LLMs and robotics is that hardware is not transferable like information. The same model* can be used to generate a legal document or a python script. But you cannot use the same fixture for one part for another. The atoms define the function. And the generic solution will never be a global maximum There are a thousand steps to design and build every factory. Every one is atomic locked. This is what every pundit is missing about "deployment"
FleetingBits@fleetingbits

some tentative thoughts on humanoid robotics 1) i'm not sure there is such a thing as asi for humanoid robotics, at least in a commercially valuable sense 2) once you have a robot that can take text instructions and turn them into decent management of a particular robot embodiment, you have captured most of the value 3) at that point, you can deploy it onto a production line, you can use it for unpacking and stocking shelves in a convenience store, perhaps it can be used in the home, etc... 4) at scale, like in a warehouse or on a factory floor, they can be coordinated using reasoning models, which receive telemetry and issue commands to individual robots and troubleshoot their behavior; 5) you can imagine further advances in robot operating models that create incremental value in particular industries, but the core of the value has already been captured 6) you can also imagine more general models that can operate multiple kinds of physical embodiments well, not just one; but i tend to think that this is mostly a cost improvement for operators 7) we should assume foundation models for robotics will follow a similar trend to llms, where the open source models trail the frontier by 9-24 months 8) this is true in part because there are well resourced players, like nvidia, that would train these models and have an incentive to open source them, to avoid concentration of their customer base 9) so, the companies that get the software advantage first have 9-24 months of lead time, but their models will saturate much more quickly than language model intelligence saturates 10) at that point, most of the value goes to whoever can produce the robots at scale and get them out at good gross margins, not to whoever produced the software 11) so, the winners look like the people that are good at building and running the factories and tooling them, plus the people that are good at training language models for discovery and operations that support this 12) my gut is that companies like figure and physical intelligence are on the wrong side of robotics on the long term; they are too invested in the software 13) tesla is maybe on the right side; chinese hardware companies are certainly on the right side; as these companies specialize in building at scale 14) there will also be many niche uses of robotics that both require further capability unlocks to be fully valuable and are vlm-shaped, like very small air gapped drones for war 15) but, i suspect this is not the majority of the economic value for humanoid robots and the majority of the value saturates on intelligence

English
1
3
8
1.5K
Tammie
Tammie@tammiesiew·
back home in singapore after 2 years and cannot wait to relive the quintessential teenage past time of playing first person shooters in seedy internet cafes
English
1
0
6
188
skara
skara@skara_io·
@orbitrobotics The 4 arms makes total sense for this application but it’s nightmare fuel
English
0
0
0
402
ORBIT
ORBIT@orbitrobotics·
The future has arrived! After a long wait, we are finally ready to reveal our complete space humanoid HELIOS. After two semesters of intense work, research and iteration, this is what we have to show. 4 arms. 4 hands. 1 vision. 1 dream.
English
69
191
1.8K
194K
skara
skara@skara_io·
@VoidAsuka Would be so sick if folks could use it as a virtual movie set
English
0
0
1
46
Asuka Zheng🎀
Asuka Zheng🎀@VoidAsuka·
This is super cool and incredible work for humanity. I hope more people around the world get to see this. My friend has been working on creating 3D Gaussian Splatting digital copies of ancient Chinese architecture-artistic and intelligent masterpieces built hundreds or thousands of years ago that are now fading with time. For example, the one below is a famous scene from the popular video game Black Myth: Wukong called Xiaoxitian. He took 3,811 on-site photos, totaling 206 GB of data, to completely rebuild these scenes. I deeply respect and appreciate initiatives like this. And I could see multiple great startup ideas behind this as well.
Asuka Zheng🎀 tweet media
MasterPa@HanyangWang

小西天,看着像视频,但其实是我们在现场实地拍摄 3,811 张 206 GB 的照片后建模的。FUNES 把《黑神话:悟空》里「既见未来,为何不拜」满天神佛的原型,来自自山西临汾隰县的小西天,做成了一个可漫游的 3DGS 数字存档。 完全实地拍摄,每天清晨一开门就冲上山去,趁着没人的时候拍。然后通过 Gaussian Splatting 重建,没有手工建模,尽量保留真实悬塑和圆塑的极其密集的金色空间、细节和光感。不同的材质在这里交织成了无法分辨的一个天国世界。这种半空中的小塑像是「悬塑」,它们大多出现在十六世纪到十七世纪。 在现场如果要看清小西天的所有细节,我想大概需要三天时间。但是有了模型,我们可以在屏幕前慢慢看。在相当长的时间里,学术界并没有特别重视小西天这样的悬塑——因为在只有学术图录的年代,平面印刷无法展示出悬塑的震撼。而随着技术的进步,我们终于可以在远方一窥明代悬塑的璀璨。 重轻特意为这个模型做了配乐,大家可以打开慢慢欣赏。 推荐电脑访问:funes.world/apps/the-hangi…

English
1
2
64
4.9K
skara
skara@skara_io·
The importance of simulation is to build machine optimised behaviours that are often times non-human behaviours. Ego-centric data may help us scale system-2 systems but systems-1 systems may have to be RL’ed per embodiment???
The Humanoid Hub@TheHumanoidHub

Atlas hauling a 50 lb mini-fridge - Practiced the maneuver for millions of hours in a virtual environment. - Focused the training policy on full-body engagement rather than just hand-grasping, allowing the robot to leverage its entire frame for the lift.

English
0
0
1
235
adriel
adriel@adrielyong·
everytime i see a video of “here’s how i use AI to make $30k/mth consistently” i’m like why aren’t you enjoying life instead
English
8
1
11
746
Lu Ling
Lu Ling@LuLing26466911·
🚀 Want to see how we do real-to-sim from a single input image? We’re releasing the code for I-Scene #CVPR2026! ✨Highlights: - Stronger scene generalization trained on randomly composed objects - Scalable data generation for downstream tasks - Supports both 3D Gaussian Splatting and mesh outputs Try the online Hugging Face demo and play with I-Scene yourself! GitHub: github.com/LuLing06/I-Sce… Demo: huggingface.co/spaces/LuLing/… Project: luling06.github.io/I-Scene-web-pa…
Lu Ling@LuLing26466911

Do we really need massive curated 3D scene data for interactive world generation? #SAM3D, #WorldGen say yes. We say no. I-Scene learns better spatial knowlesge using only 25K randomly composed instances. 🔑 Key insight: We reprogram the instance generator to infer support, proximity, and symmetry from purely geometric cues for generating interactive scenes. 🧠 Scene-context attention 👁️ View-centric space 🧱 Random composition beats expensive curation 🌐 luling06.github.io/I-Scene-projec… 💻 github.com/LuLing06/I-Sce… 🧵 Details below [1/6]

English
2
19
121
13.7K
skara
skara@skara_io·
@lukas_m_ziegler Measuring the accuracy of the system is gonna be a fun task
English
0
0
1
29