Yanda

18 posts

Yanda banner
Yanda

Yanda

@BaoYanda

robots @uwcse, prev @Roblox

Katılım Temmuz 2019
196 Takip Edilen147 Takipçiler
Yanda retweetledi
Tyler Westenbroek
Tyler Westenbroek@ty_westenbroek·
Real-world RL is still too brittle and data-hungry for long-horizon, contact-rich tasks. We introduce Simulation Distillation (SimDist), which turns large-scale simulated experience into reusable world-model priors for rapid real-world adaptation. By combining online planning with dynamics adaptation, SimDist achieves high success rates on tasks requiring precision, force, and reactivity. Play with our interactive visualization to see for yourself: sim-dist.github.io (1/n)
English
3
44
203
25.3K
Yanda retweetledi
Abhishek Gupta
Abhishek Gupta@abhishekunique7·
Punchline: distill world models from simulation to enable fast, stable real-world robot adaptation. Simulation is nearly always wrong. But in Simulation Distillation, we ask a simple question: How do we perform simulation pretraining such that real-world adaptation becomes trivially easy? sim-dist.github.io Let's take a closer look (1/n)
English
7
38
304
29K
Yanda retweetledi
Reuben Santoso
Reuben Santoso@reuben_santoso·
Imagine testing your app like real users actually use it. not one fake user clicking through a script, but many users interacting in a single workflow. One uploads a post. Others like and comment on it. In the same test run. In real browsers. @JoveW, @SamuelJepee and I built @qualtydotco a agentic QA that evolves test cases with your product and finally catch the hardest bugs that only show up when users interact. No brittle scripts. No Maintenance.
English
10
14
48
3.4K
Yanda
Yanda@BaoYanda·
@pravsels sounds a bit like pointworld, they predict point deltas
English
1
0
1
77
Praveen Selvaraj
Praveen Selvaraj@pravsels·
began testing this idea over the weekend. what if a World Model takes in the difference between frames as input as opposed to entire frames ? not quite working yet.. but also output isn't completely nonsensical (I'm counting that as a win)
English
5
0
14
1.6K
Yanda retweetledi
Patrick Yin
Patrick Yin@patrickhyin·
We’re releasing OmniReset, a framework for training robot policies using large-scale RL and diverse resets for contact-rich, dexterous manipulation. OmniReset pushes the frontier of robustness and dexterity, without any reward engineering or demonstrations. Try the policies yourself in our interactive simulator! weirdlabuw.github.io/omnireset/ (1/N 🧵)
English
21
95
468
108.1K
Yanda retweetledi
Abhishek Gupta
Abhishek Gupta@abhishekunique7·
Excited to share the project that has surprised me the most in the last year! Large-scale RL in simulation, no demos and no reward engineering can solve dynamic, dexterous and contact rich tasks. The learned behaviors are reactive, forceful and use the environment for recovery in ways that are extremely challenging to bake in or teleoperate! You can play with the policies yourself to see: weirdlabuw.github.io/omnireset/ And, the learned behavior transfers to real world robots from RGB camera inputs! So what’s the trick - using simulator resets carefully! Let’s unpack (1/10)
English
17
91
614
82K
Yanda retweetledi
Keller Jordan
Keller Jordan@kellerjordan0·
Hinton, LeCun, and every other neolab: Gradient descent is fundamentally broken. It needs thousands of examples to learn what humans do in only a few. It’s time to start looking for a radical new learning paradigm to close the gap. In-context learning: Do I mean nothing to you?
English
51
33
819
114.8K
Yanda retweetledi
Pukicho
Pukicho@pukicho·
My child will not be allowed to use chat gpt. He will be smarter and stronger than the other children and he will kill them easily.
English
628
33.8K
272.9K
18.1M
Yanda
Yanda@BaoYanda·
@pjreddie no more wow streams? 🤕
English
1
0
3
4.5K
Jeffrey Emanuel
Jeffrey Emanuel@doodlestein·
I wanted to read Henry Kissinger’s 400 page undergraduate thesis (it has an incredible first page), but really didn’t feel like dealing with a scanned PDF that’s annoying to read on a phone without constantly zooming and panning. So I decided to convert it to a nice markdown format using OCR and LLMs. Then I thought it would be nice to fix the footnotes and get rid of the page breaks and to fix the line breaks and other things like that. I was already working on some other coding projects, so I had the idea of loading up the draft markdown file in Claude Code and having it work on fixing these issues using a swarm of 20 sub-agents, which worked well. Then I thought it would be cool to link to the full sources for all the many references on sites like the Internet Archive or Project Gutenberg, so I had another swarm of sub-agents do a ton of searches to track the links down and insert them into the footnotes and bibliography. Then I figured that I might as well run it through my mind-map generator and summarization code to see what it comes up with, so I tried that. But now I had a few files to present, so needed some kind of index page. So I asked Codex with GPT-5 to whip up a slick looking web page to present the stuff nicely, which it did a yeoman’s job with. Note that I was already working with these tools in a bunch of other sessions on other projects, so my work here was occasionally giving some instructions to the coding agents and letting them crank away. I really didn’t spend much active time on this! Anyway, the net result is clearly the premier way in the world today to consume Henry Kissinger’s undergraduate thesis electronically. I’ll post the link in the next tweet to avoid getting punished by the algorithm. As for the thesis itself, it’s wild how erudite he was as a young man, and also what a great writer he was. And even more impressive considering that English was his second language. The thesis is basically him trying to come to grips with, and to mentally organize in an internally consistent way, a vast swath of Western thought. From what I’ve read so far, I think he did a pretty good job. Incidentally, his thesis is the reason Harvard changes the rules to limit the undergrad honors thesis to a maximum of 35,000 words. Good thing they didn’t apply this silly limit to Henry!
Jeffrey Emanuel tweet media
English
169
442
5.2K
551.3K
Yanda
Yanda@BaoYanda·
i need to get better at suffering fr
English
0
0
5
244
Yanda
Yanda@BaoYanda·
@tbpn kaiming he not even in the top 30 😭
English
0
0
0
454
TBPN
TBPN@tbpn·
Two weeks ago, we launched The Metis List. Since then, we've spoken with many of you and have updated the ranking accordingly. 128 top AI researchers, ranked by their peers.
English
66
41
581
177.4K
Yanda
Yanda@BaoYanda·
best bowl of ramen ive ever had, only $25
Yanda tweet media
English
2
0
2
268
Yanda
Yanda@BaoYanda·
These guys are goated, go download meteor 👀
Y Combinator@ycombinator

Meteor (@browsedotdev) is an intelligent, AI-native browser. Google Chrome is a browser of the past. Meteor gets things done for you, just like your very own personal assistant. Download it today at browse.dev.

English
0
0
1
330
Yanda
Yanda@BaoYanda·
@evilbiscotto struggling w the waking up early part 😖😵
English
1
0
1
63
Yanda retweetledi
wh
wh@nrehiew_·
The problem with all these agent companies/products is that since you don’t have access to the underlying weights, the bet you’re making is that your scaffolds are better than the labs. This is hard because: 1) The labs can bake the scaffolds into the model (Claude Code) 2)
wh tweet media
English
34
20
486
75.2K
Yanda retweetledi
Yiping Lu
Yiping Lu@2prime_PKU·
Anyone knows adam?
Yiping Lu tweet media
English
265
436
4.8K
635K