skara

398 posts

skara

@skara_io

cto • simulating reality for robots @bifrost_ai @sequoia @lux_capital

San Francisco Katılım Mayıs 2021

838 Takip Edilen2.5K Takipçiler

skara@skara_io·20h

Something about the way they filmed it, made me feel things

David McAllister@davidrmcall

Robot (@arthurallshire)

English

skara@skara_io·20h

Braindance seems within reach!

SciTech Era@SciTechera

Thats awesome! A developer used Al-powered 4D Gaussian Splatting to convert flat video footage into dynamic 3D spatial scenes. The system reconstructs different camera angles and depth information from ordinary footage, making it possible to navigate scenes in three dimensions. We're moving from recording video to digitally recreating reality itself.

English

989

skara@skara_io·1d

prob the highest levels of fidelity and clean geometry in scene reconstructions so far kasothaphie.github.io/GenRecon/ @MattNiessner @angelaqdai

English

4.6K

skara@skara_io·1d

@vruga_ @DJiafei What kinds of scenarios does it work well on?

English

176

vrushtee@vruga_·1d

I am working on Online RL. MolmoACT2 deploys out of the box but fails at many tasks due to its data distribution. One really interesting insight would be to have a setup working where you can deploy this good enough base policy and make it learn on the go

English

6.5K

skara@skara_io·1d

@Willqyu Also not to mention the LLMs are built on dozens of low level abstractions. We have to build everything from scratch in robotics

English

Will Yu@Willqyu·1d

"Good at building the factories and tooling them" is doing all the heavy lifting here The biggest difference between LLMs and robotics is that hardware is not transferable like information. The same model* can be used to generate a legal document or a python script. But you cannot use the same fixture for one part for another. The atoms define the function. And the generic solution will never be a global maximum There are a thousand steps to design and build every factory. Every one is atomic locked. This is what every pundit is missing about "deployment"

FleetingBits@fleetingbits

some tentative thoughts on humanoid robotics 1) i'm not sure there is such a thing as asi for humanoid robotics, at least in a commercially valuable sense 2) once you have a robot that can take text instructions and turn them into decent management of a particular robot embodiment, you have captured most of the value 3) at that point, you can deploy it onto a production line, you can use it for unpacking and stocking shelves in a convenience store, perhaps it can be used in the home, etc... 4) at scale, like in a warehouse or on a factory floor, they can be coordinated using reasoning models, which receive telemetry and issue commands to individual robots and troubleshoot their behavior; 5) you can imagine further advances in robot operating models that create incremental value in particular industries, but the core of the value has already been captured 6) you can also imagine more general models that can operate multiple kinds of physical embodiments well, not just one; but i tend to think that this is mostly a cost improvement for operators 7) we should assume foundation models for robotics will follow a similar trend to llms, where the open source models trail the frontier by 9-24 months 8) this is true in part because there are well resourced players, like nvidia, that would train these models and have an incentive to open source them, to avoid concentration of their customer base 9) so, the companies that get the software advantage first have 9-24 months of lead time, but their models will saturate much more quickly than language model intelligence saturates 10) at that point, most of the value goes to whoever can produce the robots at scale and get them out at good gross margins, not to whoever produced the software 11) so, the winners look like the people that are good at building and running the factories and tooling them, plus the people that are good at training language models for discovery and operations that support this 12) my gut is that companies like figure and physical intelligence are on the wrong side of robotics on the long term; they are too invested in the software 13) tesla is maybe on the right side; chinese hardware companies are certainly on the right side; as these companies specialize in building at scale 14) there will also be many niche uses of robotics that both require further capability unlocks to be fully valuable and are vlm-shaped, like very small air gapped drones for war 15) but, i suspect this is not the majority of the economic value for humanoid robots and the majority of the value saturates on intelligence

English

1.5K

skara@skara_io·2d

@asimovinc @menloresearch Sick, what are you guys open-sourcing on the sim side?

English

107

Asimov@asimovinc·2d

Asimov 1 is the entry point to humanoid robotics. It's open-source from design to sim, uses off-the-shelf parts, and is fully repairable. If you're at a university lab and want to bring one in, reach out. We'd love to talk. menlo.ai/talk

Joe Harris@_joe_harris_

every university should have one of these in their robotics department

English

5.4K

skara@skara_io·2d

@adrielyong just like u

English

adriel@adrielyong·2d

@skara_io beautiful

English

103

skara@skara_io·2d

no surprises, works really well especially if you are just focused on one embodiment

Louis Le Lay@LouisLeLay4

boston dynamics uses mjlab btw :)

English

3.9K

skara@skara_io·2d

@adrielyong @tammiesiew @tammiesiew will carry our team

English

adriel@adrielyong·2d

@tammiesiew send deets pls i wanna shoot @skara_io

English

Tammie@tammiesiew·3d

back home in singapore after 2 years and cannot wait to relive the quintessential teenage past time of playing first person shooters in seedy internet cafes

English

188

skara@skara_io·5d

@sophiamyang @MistralAI Registered!

English

Sophia Yang, Ph.D.@sophiamyang·18 May

Excited for our first @MistralAI meetup in Singapore this week 🇸🇬 Join us:

English

1.4K

skara@skara_io·5d

@CloutedHQ team always be cooking

Clouted@CloutedHQ

Nobody believes in you, so make them. Announcing our $7M seed round led by @Slow.

English

480

skara@skara_io·6d

@orbitrobotics The 4 arms makes total sense for this application but it’s nightmare fuel

English

402

ORBIT@orbitrobotics·6d

The future has arrived! After a long wait, we are finally ready to reveal our complete space humanoid HELIOS. After two semesters of intense work, research and iteration, this is what we have to show. 4 arms. 4 hands. 1 vision. 1 dream.

English

191

1.8K

194K

skara@skara_io·6d

@VoidAsuka Would be so sick if folks could use it as a virtual movie set

English

Asuka Zheng🎀@VoidAsuka·6d

This is super cool and incredible work for humanity. I hope more people around the world get to see this. My friend has been working on creating 3D Gaussian Splatting digital copies of ancient Chinese architecture-artistic and intelligent masterpieces built hundreds or thousands of years ago that are now fading with time. For example, the one below is a famous scene from the popular video game Black Myth: Wukong called Xiaoxitian. He took 3,811 on-site photos, totaling 206 GB of data, to completely rebuild these scenes. I deeply respect and appreciate initiatives like this. And I could see multiple great startup ideas behind this as well.

MasterPa@HanyangWang

小西天，看着像视频，但其实是我们在现场实地拍摄 3,811 张 206 GB 的照片后建模的。FUNES 把《黑神话：悟空》里「既见未来，为何不拜」满天神佛的原型，来自自山西临汾隰县的小西天，做成了一个可漫游的 3DGS 数字存档。完全实地拍摄，每天清晨一开门就冲上山去，趁着没人的时候拍。然后通过 Gaussian Splatting 重建，没有手工建模，尽量保留真实悬塑和圆塑的极其密集的金色空间、细节和光感。不同的材质在这里交织成了无法分辨的一个天国世界。这种半空中的小塑像是「悬塑」，它们大多出现在十六世纪到十七世纪。在现场如果要看清小西天的所有细节，我想大概需要三天时间。但是有了模型，我们可以在屏幕前慢慢看。在相当长的时间里，学术界并没有特别重视小西天这样的悬塑——因为在只有学术图录的年代，平面印刷无法展示出悬塑的震撼。而随着技术的进步，我们终于可以在远方一窥明代悬塑的璀璨。重轻特意为这个模型做了配乐，大家可以打开慢慢欣赏。推荐电脑访问：funes.world/apps/the-hangi…

English

4.9K

skara@skara_io·6d

The importance of simulation is to build machine optimised behaviours that are often times non-human behaviours. Ego-centric data may help us scale system-2 systems but systems-1 systems may have to be RL’ed per embodiment???

The Humanoid Hub@TheHumanoidHub

Atlas hauling a 50 lb mini-fridge - Practiced the maneuver for millions of hours in a virtual environment. - Focused the training policy on full-body engagement rather than just hand-grasping, allowing the robot to leverage its entire frame for the lift.

English

235

skara@skara_io·6d

@adrielyong Cause it’s never enough

English

adriel@adrielyong·19 May

everytime i see a video of “here’s how i use AI to make $30k/mth consistently” i’m like why aren’t you enjoying life instead

English

746

skara@skara_io·6d

Majulah

rachael de foe@unprofeshme

FORWARD DEPLOYED NATION 🇸🇬 @OpenAI 's first applied ai lab outside of the US is in singapore if you aren't bullish on that pink ic yet, you should be nowww 🩷

Indonesia

182

skara@skara_io·6d

@LuLing26466911 So important for scaling diverse simulations, awesome work @LuLing26466911!

English

Lu Ling@LuLing26466911·6d

🚀 Want to see how we do real-to-sim from a single input image? We’re releasing the code for I-Scene #CVPR2026! ✨Highlights: - Stronger scene generalization trained on randomly composed objects - Scalable data generation for downstream tasks - Supports both 3D Gaussian Splatting and mesh outputs Try the online Hugging Face demo and play with I-Scene yourself! GitHub: github.com/LuLing06/I-Sce… Demo: huggingface.co/spaces/LuLing/… Project: luling06.github.io/I-Scene-web-pa…

Lu Ling@LuLing26466911

Do we really need massive curated 3D scene data for interactive world generation? #SAM3D, #WorldGen say yes. We say no. I-Scene learns better spatial knowlesge using only 25K randomly composed instances. 🔑 Key insight: We reprogram the instance generator to infer support, proximity, and symmetry from purely geometric cues for generating interactive scenes. 🧠 Scene-context attention 👁️ View-centric space 🧱 Random composition beats expensive curation 🌐 luling06.github.io/I-Scene-projec… 💻 github.com/LuLing06/I-Sce… 🧵 Details below [1/6]

English

121

13.7K

skara@skara_io·6d

@lukas_m_ziegler Measuring the accuracy of the system is gonna be a fun task

English

Lukas Ziegler@lukas_m_ziegler·19 May

count the sand grains: agent: OK

Benedikt Seidel@ben_sdl

I built something that finally gives agents the option to see and measure the physical world, reason within it, and accelerate your business with better and faster decisions. Pulling out insights from 3D data and cross-referencing across data silos, completely automating all the steps. From point cloud to the sent pick-up price, date, and picking the best buyer. If you have other companies our agent should integrate with, tell me!

English

11.3K

Keşfet

@MattNiessner @angelaqdai @vruga_ @DJiafei @Willqyu @asimovinc @menloresearch @adrielyong