Daniela Amodei

29 posts

Daniela Amodei

Daniela Amodei

@DanielaAmodei

President @AnthropicAI. Formerly @OpenAI, @Stripe, congressional staffer, global development

San Francisco, CA Katılım Eylül 2011
289 Takip Edilen23.4K Takipçiler
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing Claude 2! Our latest model has improved performance in coding, math and reasoning. It can produce longer responses, and is available in a new public-facing beta website at claude.ai in the US and UK.
Anthropic tweet media
English
228
500
2.3K
843K
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
Neural networks often pack many unrelated concepts into a single neuron – a puzzling phenomenon known as 'polysemanticity' which makes interpretability much more challenging. In our latest work, we build toy models where the origins of polysemanticity can be fully understood.
English
56
631
3.9K
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
In "Language Models (Mostly) Know What They Know", we show that language models can evaluate whether what they say is true, and predict ahead of time whether they'll be able to answer questions correctly. arxiv.org/abs/2207.05221
Anthropic tweet media
English
19
155
919
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
Transformer MLP neurons are challenging to understand. We find that using a different activation function (Softmax Linear Units or SoLU) increases the fraction of neurons that appear to respond to understandable features without any performance penalty. transformer-circuits.pub/2022/solu/inde…
Anthropic tweet media
English
10
71
380
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
In a new paper, we show that repeating only a small fraction of the data used to train a language model (albeit many times) can damage performance significantly, and we observe a "double descent" phenomenon associated with this. arxiv.org/abs/2205.10487
Anthropic tweet media
English
6
41
331
0
Daniela Amodei
Daniela Amodei@DanielaAmodei·
I’m looking forward to what’s to come. And we’re hiring! #careers" target="_blank" rel="nofollow noopener">anthropic.com/#careers
English
5
2
24
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
We've trained a natural language assistant to be more helpful and harmless by using reinforcement learning with human feedback (RLHF). arxiv.org/abs/2204.05862
Anthropic tweet media
English
3
50
265
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
In our second interpretability paper, we revisit “induction heads”. In 2+ layer transformers these pattern-completion heads form exactly when in-context learning abruptly improves. Are they responsible for most in-context learning in large transformers? transformer-circuits.pub/2022/in-contex…
English
1
57
305
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
Our first societal impacts paper explores the technical traits of large generative models and the motivations and challenges people face in building and deploying them: arxiv.org/abs/2202.07785
English
2
33
149
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
Our first interpretability paper explores a mathematical framework for trying to reverse engineer transformer language models: A Mathematical Framework for Transformer Circuits: transformer-circuits.pub/2021/framework…
English
3
116
609
0
Daniela Amodei retweetledi
Anthropic
Anthropic@AnthropicAI·
Our first AI alignment paper, focused on simple baselines and investigations: A General Language Assistant as a Laboratory for Alignment arxiv.org/abs/2112.00861
English
5
60
324
0
usmann
usmann@usmannk·
@DanielaAmodei @AnthropicAI Hi Daniela! Congrats on the launch, the mission and team look incredible. I sent an email to the address on that page and it bounced with an error saying it doesn't exist. Is there another address I can reach out to re: the Resident role? Thanks! cc: @nottombrown
English
1
0
2
0
Daniela Amodei
Daniela Amodei@DanielaAmodei·
Excited to announce what we’ve been working on this year - @AnthropicAI, an AI safety and research company. If you’d like to help us combine safety research with scaling ML models while thinking about societal impacts, check out our careers page #careers" target="_blank" rel="nofollow noopener">anthropic.com/#careers
English
12
26
200
0
Daniela Amodei
Daniela Amodei@DanielaAmodei·
We’re going to be focused on pushing forward our research for the next few months and are hoping to have more to share later this year. Thrilled to be working with so many talented colleagues!
English
7
1
26
0