berkay (@BerkayAntmen) - Twitter Profili | Zamantika Mersobahis Locabet

berkay retweetledi

Andy Zeng@andyzengineer·4d

For the avid viewer -- there’s a brief moment when the robot loses it’s grip on the head of a ziptie, and so it decides to use the other hand to help readjust the grip for the pull. It’s gnarly passing by our robots everyday, and catching these random glimpses of improvisational intelligence in action. Instant dopamine hit.

Generalist@GeneralistAI

Gen-1 ties zipties Read more about Gen-1 in our blog posts in the comments below ↓

English

3

19

160

13.9K

berkay retweetledi

Generalist@GeneralistAI·4d

Gen-1 ties zipties Read more about Gen-1 in our blog posts in the comments below ↓

English

6

27

209

32.7K

berkay@BerkayAntmen·5d

@ezyang Yes, we love FlexAttention

English

0

79

Edward Z. Yang@ezyang·5d

Would a FlexRope analogous to FlexAttention be useful

English

9

0

17

4.8K

berkay retweetledi

Generalist@GeneralistAI·27 Nis

GEN-1 performs a magic trick Read more about GEN-1 in our blog post in the comments below ↓

Ben Pekarek@ben_pekarek

Today marks the end of my first full week @GeneralistAI Last Monday, I was given a challenge: use our GEN-1 model to teach a robot a task of my choosing, using the same no-code platform our customers use. I picked the ball-and-vase magic trick. It was one of my favorites as a kid, and it felt like the right mix of fun and surprisingly hard. A few days later, GEN-1 pulled it off. I left Friday having watched the robot nail it 14 times in a row. What’s wild is that even 4 months ago, if you told me you could go from idea to on-robot skill in a couple of days, I probably wouldn’t have believed you. Really excited to be building with an incredible team. Can’t wait to see what week two brings 🤖

English

3

10

64

9.6K

berkay@BerkayAntmen·27 Nis

🪄

Ben Pekarek@ben_pekarek

Today marks the end of my first full week @GeneralistAI Last Monday, I was given a challenge: use our GEN-1 model to teach a robot a task of my choosing, using the same no-code platform our customers use. I picked the ball-and-vase magic trick. It was one of my favorites as a kid, and it felt like the right mix of fun and surprisingly hard. A few days later, GEN-1 pulled it off. I left Friday having watched the robot nail it 14 times in a row. What’s wild is that even 4 months ago, if you told me you could go from idea to on-robot skill in a couple of days, I probably wouldn’t have believed you. Really excited to be building with an incredible team. Can’t wait to see what week two brings 🤖

ART

0

4

150

berkay@BerkayAntmen·24 Nis

@HI39767037 Yes. Single fine-tune of Gen-1 by our operations people without engineer help or custom work.

English

1

0

1

129

^HI^@HI39767037·24 Nis

@BerkayAntmen Was this a fine tune of the main model?

English

1

0

142

berkay@BerkayAntmen·23 Nis

SOTA in 2 hour training job - 10 hours of human data and 0 hours of robot data. Fully enabled by our no-code platform.

Generalist@GeneralistAI

GEN-1 cleans white board Read more about GEN-1 in our blog post in the comments below ↓

English

5

109

13.2K

berkay@BerkayAntmen·23 Nis

@i_ikhatri @KyleMorgenstein It’s a fair ask :)

English

0

5

61

Ishan Khatri@i_ikhatri·23 Nis

@KyleMorgenstein Ya hear that @BerkayAntmen? The people demand technical blog posts! Or papers! :P

English

1

0

8

292

Kyle🤖🚀🦭@KyleMorgenstein·23 Nis

current SOTA for speed and smoothness; it’s a shame their blog post is just marketing copy. really makes you appreciate e.g. PI technical reports.

Generalist@GeneralistAI

GEN-1 cleans white board Read more about GEN-1 in our blog post in the comments below ↓

English

5

1

126

11K

berkay@BerkayAntmen·23 Nis

@PontiEdoardo Isn’t delaying eviction different from having a principled way to handle eviction? Both are necessary in constrained setups.

English

1

0

87

Edoardo Ponti@PontiEdoardo·23 Nis

Nice paper, but end-to-end KV cache compression already exists: Dynamic Memory Sparsification! It uses Gumbel-sigmoid as a gradient estimator instead of RL. With 8× compression, quality stays the same at equal tokens while reasoning improves sharply at equal latency.

Michael Y. Li@michaelyli__

Can a language model learn, end-to-end, what to keep in its own KV cache and what to throw away? Can it learn to forget while it learns to reason? Deep learning's central lesson: capability emerges from end-to-end optimization, not heuristics/strong inductive biases. But for efficiency, we rely heavily on hand-designed approaches. 🗑️ Introducing Neural Garbage Collection (NGC): we train a language model to jointly reason and manage its own KV cache, using reinforcement learning with outcome-based task reward alone. No SFT, no proxy objectives, no summarization in natural language. New paper with @jubayer_hamid, Emily Fox, and @noahdgoodman!

English

3

9

39

3.8K

berkay@BerkayAntmen·22 Nis

@TimDarcet Not true for ZeRO-3, right?

English

0

5

566

TimDarcet@TimDarcet·22 Nis

periodic reminder that FSDP does exactly 0 additional communication compared to DDP

François Fleuret@francoisfleuret

Nothing shockingly dumb?

English

9

5

80

17.1K

berkay@BerkayAntmen·22 Nis

@BSPK_ 🙏🙏nice to meet you too!

English

0

1

20

BSPK@BSPK_·22 Nis

@BerkayAntmen Wow, that’s awesome. Nice to meet you. Generalist is my favorite robotics AI startup. I’m always rooting for you guys.

English

1

0

1

29

BSPK@BSPK_·21 Nis

여기 진짜 뭘까… 원격제어보다 자연스러운 움직임, 긴 기억력…🤔

Generalist@GeneralistAI

GEN-1 plays the 🐚 shell game, trained on just 1 hr of robot data. It also generalizes to unseen objects, like @BerkayAntmen 's car keys. Physical AI models should be capable of benchmark tasks like this one. It's interesting for the all the reasons @RhodaAI calls out -- requires visual memory, and the model must track the cups from the very start, at high frame rates. Interestingly, GEN-1 appears to exhibit a degree of "active perception." It's subtle; the hands can sometimes appear to "follow" the cups, using its own movements to help attend to where it thinks the object should be. Read more about GEN-1 in our blog post in the comments below ↓

한국어

1

0

8

937

berkay@BerkayAntmen·22 Nis

@BSPK_ Yes, worked on the long context training and inference!

English

1

0

1

25

BSPK@BSPK_·22 Nis

@BerkayAntmen Are you working for Generalist?

English

1

0

20

berkay@BerkayAntmen·22 Nis

@BSPK_ It’s overall a beautiful feat of systems engineering but I am biased

English

1

0

1

28

BSPK@BSPK_·22 Nis

@BerkayAntmen 알고 있어요 ㅎㅎ 나는 그들의 기술이 놀랍고 신기합니다. 👍

한국어

1

0

1

26

berkay retweetledi

Kamyar Ghasemipour@coolboi95·21 Nis

There are so many amazing things about GEN-1 that in our release we didn’t even get to talk about memory, a key capability missing in most robot foundation models! GEN-1 has high FPS memory, unlocking a vast array of workflows, as well as fun games like this from @RhodaAI 🐚 As an aside, it’s so cool to watch the “active perception” in the model, with the hands trying to track the prize 👀 Learn more about GEN-1: generalistai.com/blog/apr-02-20…

Generalist@GeneralistAI

GEN-1 plays the 🐚 shell game, trained on just 1 hr of robot data. It also generalizes to unseen objects, like @BerkayAntmen 's car keys. Physical AI models should be capable of benchmark tasks like this one. It's interesting for the all the reasons @RhodaAI calls out -- requires visual memory, and the model must track the cups from the very start, at high frame rates. Interestingly, GEN-1 appears to exhibit a degree of "active perception." It's subtle; the hands can sometimes appear to "follow" the cups, using its own movements to help attend to where it thinks the object should be. Read more about GEN-1 in our blog post in the comments below ↓

English

0

2

8

1.3K

berkay@BerkayAntmen·21 Nis

This was the first take for “I bet it would work with an arbitrary object” 🙃

Generalist@GeneralistAI

GEN-1 plays the 🐚 shell game, trained on just 1 hr of robot data. It also generalizes to unseen objects, like @BerkayAntmen 's car keys. Physical AI models should be capable of benchmark tasks like this one. It's interesting for the all the reasons @RhodaAI calls out -- requires visual memory, and the model must track the cups from the very start, at high frame rates. Interestingly, GEN-1 appears to exhibit a degree of "active perception." It's subtle; the hands can sometimes appear to "follow" the cups, using its own movements to help attend to where it thinks the object should be. Read more about GEN-1 in our blog post in the comments below ↓

English

0

1

14

2.5K

berkay@BerkayAntmen·19 Nis

@i_ikhatri I vaguely remember a $45 cad/hr

English

0

1

40

Ishan Khatri@i_ikhatri·19 Nis

Jeebus. I was an undergrad TA at UMass and the rate is currently $15/hr. Was probably less when I was actually in school but I can’t remember. groups.cs.umass.edu/uca/frequently…

Jelani Nelson@minilek

Pretty crazy take. If you take our undergrad TA compensation package and measure it as hourly pre-tax income, it's $89 to $102/hr. How is that 'unfair' comp for a 19-year old with no degree and no job experience, when it's 4x+ pretty much every other university?🤔 A good chunk of that is the tuition remission OP mentions, which 80+% goes to students who don't qualify for a penny of financial aid. As I'm quoted in the article, that means we're using other kids' tuition and state funds to subsidize tuition for those wealthier than average, which is exactly ... the opposite of what a government institution like the UC should be doing. I question the negotiation process between public employers and labor unions. The people who run negotiations at the UC Office of the President aren't negotiating with their own money, but rather the money of the taxpayer and the average tuition payer, and the deals they agree to are in my opinion a slap in the face to the taxpayer. P.S. here's how our undergrad TA comp compares to other universities: docs.google.com/spreadsheets/d… (see column G for total comp pre-tax equivalent)

English

2

0

4

729

berkay@BerkayAntmen·18 Nis

@jonchu @fchollet 🤣

QME

0

167

Jon Chu // Khosla Ventures@jonchu·18 Nis

@fchollet What if I list Keras instead

English

2

0

23

4.4K

François Chollet@fchollet·18 Nis

When looking at deep learning profiles, one of the most obvious tells between a mediocre and great candidate is whether they list PyTorch or JAX.

English

170

35

1.6K

1.1M

berkay@BerkayAntmen·17 Nis

Real robots “just work”

Generalist@GeneralistAI

GEN-1 still works with lights off, and generalizes under harsh lighting conditions. The model uses raw video pixels to make decisions, so strong lighting changes can drastically alter its input distribution. Yet performance still holds. Why? GEN-1 was pre-trained on a massive, diverse dataset of different lighting conditions—everywhere from outdoor farms, to warehouses, from grocery stores, to dimly lit homes—it's already seen it all, and transfers this knowledge to new tasks. This is a glimpse of what we call Mastery, and is part of the reason these models can cross a new performance threshold. Read more about it in our blog post in the comments below 👇

English

1

0

8

1.2K

berkay@BerkayAntmen·16 Nis

@rajatdatta099 Faster to do the fine-tune science when the base model is good

English

1

0

17

berkay@BerkayAntmen·15 Nis