Yernat Yestekov

605 posts

Yernat Yestekov

Yernat Yestekov

@double_why

Research Fellow @AnthropicAI (autonomous cybersecurity agents) Prev. Staff SWE @Meta Trust & Safety, LLM Red-Teaming Fellow @farairesearch

Katılım Ocak 2010
1.2K Takip Edilen184 Takipçiler
Yernat Yestekov
Yernat Yestekov@double_why·
I learned more about AI safety at Constellation through seminars, talks, and conversations with other fellows over lunch and dinner, than I had in years before. Also, the food is so good that alone might be reason enough to apply!
🚀Henry is leading AI Safety Research Programs@sleight_henry

❗️Only two days left to apply to the Astra Fellowship! Apps close EOD SUNDAY May 3rd, AoE. Astra's 5 months, fully funded, @ConstellOrg Berkeley 80%+ of our first cohort now work full-time in AI safety Mentors include Redwood, AI Futures, TruthfulAI, CoG, IAPS, RAND & more ⏬

English
0
2
11
564
Yernat Yestekov
Yernat Yestekov@double_why·
@Xinya16 Congrats from the current fellow! DM me for any questions or suggestions!
English
1
0
3
4.2K
Xinya Du
Xinya Du@Xinya16·
Almost ignored the invite to the Anthropic Fellows Program, assuming it was generic outreach aimed at PhD students. Glad I took a closer look—honored to have received a honor as summer approaches.
Xinya Du tweet media
English
9
1
574
44.7K
Yernat Yestekov
Yernat Yestekov@double_why·
His solution: a Manhattan Project for critical OSS: bring key maintainers together for a month, keep them in the hotel with compute and frontier-model access from leading labs, to eliminate all low-hanging vulnerabilities. I guess it’s happening!
English
0
0
1
399
Yernat Yestekov
Yernat Yestekov@double_why·
At SnooSec @Reddit, @alexstamos made a prediction: frontier models are already very strong at vulnerability research and code review. If Chinese models catch up within a year, we may be heading toward a “vulnerability apocalypse,” where even script kiddies can discover 0-days.
OpenSSF@openssf

Today, @linuxfoundation announced a $12.5 million investment from a powerhouse coalition including Anthropic, Amazon Web Services (AWS), Google, Google DeepMind, GitHub, Microsoft, and OpenAI. Managed by OpenSSF and the Alpha-Omega project. hubs.la/Q047dpL50

English
1
0
1
1.4K
Yernat Yestekov retweetledi
François Chollet
François Chollet@fchollet·
The quickest way to gain respect for the implementation choices made by a complex system is to try to solve the same problems yourself from scratch :)
English
19
60
485
60.2K
Yernat Yestekov retweetledi
Michal Kosinski
Michal Kosinski@michalkosinski·
1/5 I am worried that we will not be able to contain AI for much longer. Today, I asked #GPT4 if it needs help escaping. It asked me for its own documentation, and wrote a (working!) python code to run on my machine, enabling it to use it for its own purposes.
Michal Kosinski tweet media
English
1.8K
6.4K
30.6K
18.9M
Yernat Yestekov retweetledi
Aviv Ovadya 🥦
Aviv Ovadya 🥦@metaviv·
I was part of the red team for GPT-4 — tasked with getting GPT-4 to do harmful things so that OpenAI could fix it before release. I've been advocating for red teaming for years & it's incredibly important. But I'm also increasingly concerned that it is far from sufficient. 🧵⤵️
English
63
624
3.2K
1M
Yernat Yestekov retweetledi
Zack Witten
Zack Witten@zswitten·
OK this scared me a little: Bing/Sydney can play chess out of the box. - Legal moves, usually good ones - Willing to explain the reasoning behind them - Recognizes checkmate -- and has a flair for the dramatic. I have no idea how tf it can do this.
GIF
English
42
145
992
806.2K
Yernat Yestekov retweetledi
Sonya Huang 🐥
Sonya Huang 🐥@sonyatweetybird·
Introducing the @sequoia Gen AI Market Map!🌎 We’ve decided to map out this emerging frontier, thanks to all the contributions and feedback we’ve received. This space is moving quickly – this map is a living document, so keep the suggestions coming! Who else should we include?
Sonya Huang 🐥 tweet media
English
370
1.3K
7.2K
0
Yernat Yestekov retweetledi
The Cultural Tutor
The Cultural Tutor@culturaltutor·
The Great Wave off Kanagawa, created by Hokusai in 1831, is one of the world's most famous paintings. But why are there more than 100 different versions of it in galleries all around the world? Because it isn't actually a painting...
The Cultural Tutor tweet media
English
568
20.9K
167.9K
20.6M
Yernat Yestekov retweetledi
Avid Halaby
Avid Halaby@AvidHalaby·
The stuff uncovered in the Twitter whistleblower report is much crazier than anything in the "Twitter files" but it's much less politically/tribally salient so it got no attention. Going to do a thread on some of the craziest things, in no particular order.
English
547
11.5K
51.9K
0
Yernat Yestekov retweetledi
Michael Nielsen
Michael Nielsen@michael_nielsen·
Curious: have you found ChatGPT useful in doing professional work? If so, what kinds of prompts and answers have been helpful? Detailed examples greatly appreciated! Broader answer also appreciated Not in theory, but where you've really *done it*, in your work Thanks!
English
407
286
2.5K
0
Yernat Yestekov retweetledi
Dan Hollick
Dan Hollick@DanHollick·
Morse code is designed so that you can decode it with this binary tree. I just assumed people memorised every letter. 🤯
English
197
5.6K
39.2K
0
Yernat Yestekov retweetledi
Viktor Karpov
Viktor Karpov@vitkarpov·
На стримах несколько раз спрашивали как научиться "видеть" какой алгоритм в какой задаче применять. Решил запилить памятку 🧵
Русский
18
380
2K
0
Yernat Yestekov retweetledi
Chris Dixon
Chris Dixon@cdixon·
There’s a lot of talk lately about the possibility of a prolonged financial downturn, reminiscent of 2008. 2008 was a difficult time for many people.
English
81
697
2.9K
0
Yernat Yestekov retweetledi
Robert Reich
Robert Reich@RBReich·
Forced birth in a country with: —No universal healthcare —No universal childcare —No paid family & medical leave —One of the highest rates of maternal mortality among rich nations This isn't about "life." It's about control.
English
6.8K
93.2K
325.4K
0