Alexander Doria

46.1K posts

Alexander Doria banner
Alexander Doria

Alexander Doria

@Dorialexander

building open ai infrastructure @pleiasfr

Katılım Nisan 2011
4K Takip Edilen23.2K Takipçiler
Sabitlenmiş Tweet
Alexander Doria
Alexander Doria@Dorialexander·
And so important news: we're launching an early beta access to the synth pipelines that originally created SYNTH and have been further refined and enhanced through the last few months.
Alexander Doria tweet media
English
7
12
172
9.1K
Alexander Doria
Alexander Doria@Dorialexander·
@advait_jayant Yes. To some extent they had surprisingly dated prior (also over scaling: highly sparsed MoE were not what they had in mind)
English
0
0
0
8
Advait
Advait@advait_jayant·
@Dorialexander funnily enough ai-2027 got itself backwards on china. it had the ccp nationalizing everything into a single megalab next to a nuclear plant. the opposite happened. deepseek, qwen, kimi, minimax, and as you pointed out even meituan and xiaomi are all shipping their own models!
English
1
0
1
58
52dsl
52dsl@52dsl·
@Dorialexander Très intéressant. Merci ! (il doit cependant manquer un bout à cette phrase : "By 2023, OpenAI and Anthropic had an early lead in model development but nothing that would prevent...")
Français
1
0
1
21
Alexander Doria
Alexander Doria@Dorialexander·
@ed_brz9 @jackson_stokes Thought briefly about that but disagree now. Memory is actually needed for many agentic processes (basically model needs to understand what is being asked, and how to look for things)
English
1
0
1
5
Ed Brz9
Ed Brz9@ed_brz9·
@Dorialexander @jackson_stokes For now I just do synth data and fine tune so I’m not at all qualified to talk about model architecture, but do LLM need to know much to be useful? I feel like more weight dedicated to attention could benefit agentic capabilities. Near zero knowledge models that can use tools
English
1
0
0
5
Jackson Stokes
Jackson Stokes@jackson_stokes·
There seems to be a ~3B lower limit for useful LLMs. below that, instruction following and ICL drop off a cliff? Is there some fundamental reason for this?
English
8
2
55
10.8K
Alexander Doria
Alexander Doria@Dorialexander·
and here is anthropic soft power: blessed be circuit transformers (and the data that feed it).
Alexander Doria tweet media
English
0
2
13
755
Alexander Doria
Alexander Doria@Dorialexander·
great seeing the pope supporting tokenizer research.
Alexander Doria tweet media
English
2
1
38
1.6K
Alexander Doria
Alexander Doria@Dorialexander·
@Noahpinion Just creating conditions for proper open-ended research (the kind OpenAI has just started to emulate) and selecting for people willing to do that.
English
1
0
2
438
Alexander Doria
Alexander Doria@Dorialexander·
@JulienBlanchon @bastiengares (les documents vraiment corpos dans certains domaines c'est totalement galère à localiser en webcralw et les quantités sont pas là : je comprends la logique de racheter des données de boîte)
Français
1
0
1
49
Alexander Doria
Alexander Doria@Dorialexander·
@JulienBlanchon @bastiengares Moi c'est un peu les échos que j'ai eu côté Anthropic (et de certains fournisseurs RL) : migration d'environnements contraints avec pas mal de connaissance métiers vers des workflow plus ouverts de la donnée en gros, moins structuré.
Français
1
0
1
51
Rasmus
Rasmus@synquid·
Synthetic environments is definitely the next big thing
English
1
0
4
741
Alexander Doria
Alexander Doria@Dorialexander·
The upside: frontier labs have until now been suspiciously advancing in directions where they had deep internal expertise at the core. As we move toward open-endedness, this might be the actual bottleneck.
English
0
0
16
919
Alexander Doria
Alexander Doria@Dorialexander·
@menhguin Meanwhile I’m still waiting for 100B compute investments in France…
English
0
0
5
242
Minh Nhat Nguyen
Minh Nhat Nguyen@menhguin·
it's wild that 1. Chinese frontier AI labs have roughly 1/20th the funding and revenue of American frontier labs 2. American frontier labs consider Chinese labs only 6-12 months behind
English
14
9
326
22.4K
Alexander Doria
Alexander Doria@Dorialexander·
@thkostolansky The kind of fresh take we have seen with OpenAI Erdös could get interesting at an industrial scale.
English
0
0
2
114
Alexander Doria
Alexander Doria@Dorialexander·
@thkostolansky many current blocks are conceptual rather than experimental and general feel of standstill for the last decades.
English
1
0
4
350
Alexander Doria
Alexander Doria@Dorialexander·
To be honest things will really get interesting when we’ll start tackling open ended problems in physics.
English
26
17
364
16.3K