David Foster

419 posts

David Foster banner
David Foster

David Foster

@davidADSP

Author of Generative Deep Learning: Teaching Machines how to Paint, Write, Compose and Play (O'Reilly), #generativeAI, Founding Partner of ADSP.

London, UK Katılım Temmuz 2019
577 Takip Edilen777 Takipçiler
David Foster
David Foster@davidADSP·
@mattshumer_ Yeah looks awesome - any idea how they calculated the $0.19-$0.49 PPM tokens? They say it's based on $2/hour H100 cost and serve rate of 0.03 ms / token I think?
English
0
0
1
116
Matt Shumer
Matt Shumer@mattshumer_·
Llama 4's price/perf looks absolutely incredible. And a 10M token context window? Insane. Assuming the vibes check out, we'll be switching over many of our systems to Maverick.
Matt Shumer tweet media
English
10
7
172
25.7K
Thomas Wolf
Thomas Wolf@Thom_Wolf·
what was this thing btw? "Moreover, ARC-AGI-1 is now saturating – besides o3's new score, the fact is that a large ensemble of low-compute Kaggle solutions can now score 81% on the private eval" big ensemble of heuristics?
English
6
3
26
9.2K
David Foster
David Foster@davidADSP·
@fchollet Out of interest @fchollet, what % of arc test set puzzles remain unsolved by any submitted solution? And what would the top 2 entries score if ensembled (I know this means they'd have 4 attempts). Just curious how much they overlap.
English
0
0
1
105
François Chollet
François Chollet@fchollet·
Consulting my heart... Ok, looks like you haven't. But whenever you have a SotA (or close) solution built on top of the OpenAI API we're more than happy to verify it and add it to the public ARC Prize leaderboard. Anything using less than $10k worth of API calls is eligible.
Sam Altman@sama

@DavidSHolz @willdepue in your heart do you believe we’ve solved that one or no?

English
43
48
1.1K
180.6K
David Foster
David Foster@davidADSP·
@jsuarez @hirschibar Awesome write up! What about action masking - i.e. how do you handle cases where certain actions aren't possible (and the env returns you the mask at each timestep). Is this something PufferLib supports?
English
2
0
1
51
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@hirschibar It's just a list of discrete actions. Instead of 1 linear layer to output action, you have n layers. And then you just sum the losses for each
English
1
0
0
80
Arena.ai
Arena.ai@arena·
Big News from Chatbot Arena! @01AI_YI's latest model Yi-Lightning has been extensively tested in Arena, collecting over 13K community votes! Yi-Lightning has climbed to #6 in the Overall rankings (#9 in Style Control), matching top models like Grok-2. It delivers robust performance in technical areas like Math, Hard Prompts, and Coding. Huge congrats to @01AI_YI! Meanwhile, GLM-4-Plus by Zhipu AI (@ChatGLM) has also entered the top 10, marking a strong surge for Chinese LLMs. They're quickly becoming highly competitive. Stay tuned for more! More analysis below👇
Arena.ai tweet media
Arena.ai@arena

Yi-Lightning is now in Chatbot Arena! The latest and most capable model from @01AI_Yi. Come chat and vote at lmarena. ai. The leaderboard will be updated soon.

English
14
45
273
164.3K
David Foster
David Foster@davidADSP·
@SullyOmarr Would you be willing to share the leaderboard from your evals?
English
1
0
1
267
Sully
Sully@SullyOmarr·
underrated: gemini 1.5 flash overrated: gpt-4o We really need better ways to benchmark these models cause lmsys aint it stuff like cost, speed, tool use, writing, etc. arent considered Most ppl just use the top model based on leaderboards, but it's way more nuanced than that
English
28
12
206
24.4K
Lucas Beyer (bl16)
Lucas Beyer (bl16)@giffmana·
You know what's my favourite part with our Gemma release? That we do not misuse the term "open source" like other labs have. It was explicit in the comms briefing that we should call them "open models" and not "open source models". Much respect to the team.
Lucas Beyer (bl16) tweet media
English
10
18
262
29.2K
David Foster
David Foster@davidADSP·
@NPCollapse Funny story - William Peebles co-authored the Mar 2023 Diffusion Transformer paper on which Sora is based, whilst at Meta as an intern. But then joined OpenAI last year to co-lead Sora. So I guess they did know how to do it, but let him leave 😂
English
0
0
3
644
Thomas Wolf
Thomas Wolf@Thom_Wolf·
almost 10 years in and I'm still listening to the soundtrack for Interstellar when I need to code some epic stuff. will it be ever topped
English
17
3
140
16.2K
David Foster
David Foster@davidADSP·
@realGeorgeHotz Given the current breakthroughs, "linguistics" is a left-field candidate 🤔
English
0
0
0
161
David Foster
David Foster@davidADSP·
@nickfloats Does the --iw parameter affect remixes? In the docs it says it doesn't, but I'm never sure how much to trist the docs :)
David Foster tweet media
English
0
0
1
103
Nick St. Pierre
Nick St. Pierre@nickfloats·
Remixing with images can give you even more control in Midjourney You maintain more of the details and can do really fun things like turn group photos into animal balloon parties. A quick series of images, w/ a tutorial on how to do it at the end. It's actually super easy.
Nick St. Pierre tweet media
English
39
95
1.1K
484.7K
David Foster
David Foster@davidADSP·
@nickfloats Related question / challenge - how do you get Midjourney to output the usual meaning of 'fork in the road', rather than this? Changing the prompt to use different words isn't allowed 😃
David Foster tweet media
English
0
0
0
42
Nick St. Pierre
Nick St. Pierre@nickfloats·
Duck you and your stupid ducking AI
Nick St. Pierre tweet media
English
12
7
59
16.5K
Sully
Sully@SullyOmarr·
Someone should just use GPT4 to create a unbiased news agency. Feed it all the data and let it create news articles. Bonus point: you can let users chat with it as well, so they can ask questions. Now that i think of it, why hasn't anyone done this yet?
English
101
24
290
75.4K
Stability AI
Stability AI@StabilityAI·
Announcing StableLM❗ We’re releasing the first of our large language models, starting with 3B and 7B param models, with 15-65B to follow. Our LLMs are released under CC BY-SA license. We’re also releasing RLHF-tuned models for research use. Read more→ stability.ai/blog/stability…
Stability AI tweet media
English
67
859
3.5K
1.2M
Emad
Emad@EMostaque·
#StableLM ask me anything below 👇🏾
English
146
24
222
119.9K