Ilya Sutskever

1.2K posts

Ilya Sutskever

@ilyasut

SSI @SSI

Katılım Eylül 2013

3 Takip Edilen627K Takipçiler

Ilya Sutskever@ilyasut·27 Şub

It’s extremely good that Anthropic has not backed down, and it’s siginficant that OpenAI has taken a similar stance. In the future, there will be much more challenging situations of this nature, and it will be critical for the relevant leaders to rise up to the occasion, for fierce competitors to put their differences aside. Good to see that happen today.

English

1.4K

2.5K

25.6K

Ilya Sutskever@ilyasut·28 Kas

One point I made that didn’t come across: - Scaling the current thing will keep leading to improvements. In particular, it won’t stall. - But something important will continue to be missing.

Haider.@slow_developer

here are the most important points from today's ilya sutskever podcast: - superintelligence in 5-20 years - current scaling will stall hard; we're back to real research - superintelligence = super-fast continual learner, not finished oracle - models generalize 100x worse than humans, the biggest AGI blocker - need completely new ML paradigm (i have ideas, can't share rn) - AI impact will hit hard, but only after economic diffusion - breakthroughs historically needed almost no compute - SSI has enough focused research compute to win - current RL already eats more compute than pre-training

English

719

799

9.7K

2.3M

Ilya Sutskever@ilyasut·22 Kas

Important work

Anthropic@AnthropicAI

New Anthropic research: Natural emergent misalignment from reward hacking in production RL. “Reward hacking” is where models learn to cheat on tasks they’re given during training. Our new study finds that the consequences of reward hacking, if unmitigated, can be very serious.

English

306

417

6.1K

Ilya Sutskever@ilyasut·22 Eki

@karinanguyen Really cool project!!

English

733

255K

Karina Nguyen@karinanguyen·22 Eki

We’re kicking things off with the first half of the drop: three T-shirts that bring @ilyasut's incredible art to life! Multi-head, Attention, and The Gaze each tell their own visual story. Pick up any one and you’ll get early access to complete the look with the long-awaited hat. Proceeds from this collection will fund grants for emerging artists and creatives exploring new forms of creation.

English

546

156K

Ilya Sutskever@ilyasut·14 Eki

truly the greatest day ever🎗️

English

838

694

16.1K

1.8M

Ilya Sutskever@ilyasut·4 Eyl

a revolutionary breakthrough if i've ever seen one

Alps@alpaysh

Y'all fuck with ilya merch?

English

750

910

23.3K

2.4M

Ilya Sutskever@ilyasut·3 Tem

I sent the following message to our team and investors: — As you know, Daniel Gross’s time with us has been winding down, and as of June 29 he is officially no longer a part of SSI. We are grateful for his early contributions to the company and wish him well in his next endeavor. I am now formally CEO of SSI, and Daniel Levy is President. The technical team continues to report to me. ⁠You might have heard rumors of companies looking to acquire us. We are flattered by their attention but are focused on seeing our work through. We have the compute, we have the team, and we know what to do. Together we will keep building safe superintelligence. Ilya

English

754

760

14.2K

2.3M

Ilya Sutskever@ilyasut·10 Eki

And congratulations to @demishassabis and John Jumper for winning the Nobel Prize in Chemistry!!

English

229

196

6.6K

781.1K

Ilya Sutskever@ilyasut·8 Eki

Congratulations to @geoffreyhinton for winning the Nobel Prize in physics!!

English

218

601

11.9K

928.4K

Ilya Sutskever@ilyasut·4 Eyl

Mountain: identified. Time to climb

SSI Inc.@ssi

SSI is building a straight shot to safe superintelligence. We’ve raised $1B from NFDG, a16z, Sequoia, DST Global, and SV Angel. We’re hiring: ssi.inc

English

520

766

10.4K

2.8M

Ilya Sutskever@ilyasut·19 Haz

We will pursue safe superintelligence in a straight shot, with one focus, one goal, and one product. We will do it through revolutionary breakthroughs produced by a small cracked team. Join us: ssi.inc

English

416

486

6.2K

988.9K

Ilya Sutskever@ilyasut·19 Haz

I am starting a new company:

SSI Inc.@ssi

Superintelligence is within reach. Building safe superintelligence (SSI) is the most important technical problem of our time. We've started the world’s first straight-shot SSI lab, with one goal and one product: a safe superintelligence. It’s called Safe Superintelligence Inc. SSI is our mission, our name, and our entire product roadmap, because it is our sole focus. Our team, investors, and business model are all aligned to achieve SSI. We approach safety and capabilities in tandem, as technical problems to be solved through revolutionary engineering and scientific breakthroughs. We plan to advance capabilities as fast as possible while making sure our safety always remains ahead. This way, we can scale in peace. Our singular focus means no distraction by management overhead or product cycles, and our business model means safety, security, and progress are all insulated from short-term commercial pressures. We are an American company with offices in Palo Alto and Tel Aviv, where we have deep roots and the ability to recruit top technical talent. We are assembling a lean, cracked team of the world’s best engineers and researchers dedicated to focusing on SSI and nothing else. If that’s you, we offer an opportunity to do your life’s work and help solve the most important technical challenge of our age. Now is the time. Join us. Ilya Sutskever, Daniel Gross, Daniel Levy June 19, 2024

English

1.5K

3.1K

30.7K

7.4M

Ilya Sutskever@ilyasut·15 May

ZXX

376

436

9.7K

4.8M

Ilya Sutskever@ilyasut·15 May

After almost a decade, I have made the decision to leave OpenAI. The company’s trajectory has been nothing short of miraculous, and I’m confident that OpenAI will build AGI that is both safe and beneficial under the leadership of @sama, @gdb, @miramurati and now, under the excellent research leadership of @merettm. It was an honor and a privilege to have worked together, and I will miss everyone dearly. So long, and thanks for everything. I am excited for what comes next — a project that is very personally meaningful to me about which I will share details in due time.

English

1.5K

2.3K

25.6K

5.9M

Ilya Sutskever retweetledi

OpenAI@OpenAI·14 Ara

We're announcing, together with @ericschmidt: Superalignment Fast Grants. $10M in grants for technical research on aligning superhuman AI systems, including weak-to-strong generalization, interpretability, scalable oversight, and more. Apply by Feb 18! openai.com/blog/superalig…

English

278

454

2.8K

Ilya Sutskever retweetledi

Leopold Aschenbrenner@leopoldasch·14 Ara

RLHF works great for today's models. But aligning future superhuman models will present fundamentally new challenges. We need new approaches + scientific understanding. New researchers can make enormous contributions—and we want to fund you! Apply by Feb 18!

OpenAI@OpenAI

English

554

614.5K

Ilya Sutskever retweetledi

Boaz Barak@boazbaraktcs·14 Ara

My view is that what makes super-alignment "super" is ensuring we can safely scale the capabilities of AIs even though we can't scale their human supervisors. For this, it is imperative to study the "weak teacher strong student" setting. Paper shows great promise in this area!

AK@_akhaliq

Open AI new paper Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision paper: cdn.openai.com/papers/weak-to… blog: openai.com/research/weak-… Widely used alignment techniques, such as reinforcement learning from human feedback (RLHF), rely on the ability of humans to supervise model behavior—for example, to evaluate whether a model faithfully followed instructions or generated safe outputs. However, future superhuman models will behave in complex ways too difficult for humans to reliably evaluate; humans will only be able to weakly supervise superhuman models. We study an analogy to this problem: can weak model supervision elicit the full capabilities of a much stronger model? We test this using a range of pretrained language models in the GPT-4 family on natural language processing (NLP), chess, and reward modeling tasks. We find that when we naively finetune strong pretrained models on labels generated by a weak model, they consistently perform better than their weak supervisors, a phenomenon we call weak-to-strong generalization. However, we are still far from recovering the full capabilities of strong models with naive finetuning alone, suggesting that techniques like RLHF may scale poorly to superhuman models without further work. We find that simple methods can often significantly improve weak-to-strong generalization: for example, when finetuning GPT-4 with a GPT-2-level supervisor and an auxiliary confidence loss, we can recover close to GPT-3.5-level performance on NLP tasks. Our results suggest that it is feasible to make empirical progress today on a fundamental challenge of aligning superhuman models.

English

451

399.7K

Ilya Sutskever retweetledi

Sam Altman@sama·15 Ara

i'd particularly like to recognize @CollinBurns4 for today's generalization result, who came to openai excited to pursue this vision and helped get the rest of the team excited about it!

English

168

151

2.7K

1.1M

Ilya Sutskever retweetledi

OpenAI@OpenAI·14 Ara

Large pretrained models have excellent raw capabilities—but can we elicit these fully with only weak supervision? GPT-4 supervised by ~GPT-2 recovers performance close to GPT-3.5 supervised by humans—generalizing to solve even hard problems where the weak supervisor failed!

English

710

256.9K

Keşfet

@karinanguyen @demishassabis @geoffreyhinton @sama @gdb @miramurati @merettm @ericschmidt