Wojciech Galuba

274 posts

Wojciech Galuba

Wojciech Galuba

@wgaluba

Head of Data & Evals @Cohere | prev: multimodal and embodied AI at @MetaAI | founded @Meta’s A/B testing an AI annotation platforms | @ICepfl alumnus

London, England Katılım Nisan 2009
1.5K Takip Edilen593 Takipçiler
Sabitlenmiş Tweet
Wojciech Galuba
Wojciech Galuba@wgaluba·
To all the impacted at FAIR and GenAI: come and have a fresh start with us advancing the science of evals at Cohere. Links to this and many other opportunities in the 🧵👇 (DMs open as well!)
Wojciech Galuba tweet media
English
1
5
20
1.6K
Wojciech Galuba retweetledi
Eren Bali
Eren Bali@erenbali·
Every technical founder who had stopped coding 10 years ago
Eren Bali tweet media
English
183
356
7.9K
532.8K
Wojciech Galuba
Wojciech Galuba@wgaluba·
To all the impacted at FAIR and GenAI: come and have a fresh start with us advancing the science of evals at Cohere. Links to this and many other opportunities in the 🧵👇 (DMs open as well!)
Wojciech Galuba tweet media
English
1
5
20
1.6K
Devi Parikh
Devi Parikh@deviparikh·
Me: Joins a Google Meets call Granola: Want me to take notes?! Notion: Want me to take notes?! Gemini: Want me to talk notes?!
English
4
3
56
7.3K
Wojciech Galuba retweetledi
Ruben Hassid
Ruben Hassid@rubenhassid·
The world's leading AI research center completed the most comprehensive study ever on kids and AI. They surveyed 1,800+ children, parents, and teachers in UK. Here's what they found: (spoiler: children are outsmarting adults on AI)
Ruben Hassid tweet media
English
60
403
3.6K
645K
Wojciech Galuba retweetledi
sunny
sunny@thePiggsBoson·
The software to create the black hole in the movie 'Interstellar' is a full implementation of Einstein's equations in 40,000 lines of C++, and rendered thousands of 23-megapixel IMAX frames on a 32,000-core render farm at about 20 core-hours per frame: arxiv.org/pdf/1502.03808…
sunny tweet media
English
43
264
1.9K
191.5K
Wojciech Galuba retweetledi
Vals AI
Vals AI@ValsAI·
We just released our Finance Agent Benchmark! In this, we find that current models are falling short on research tasks. The best-performing model, @OpenAI's o3, achieves a mere 48.3% average accuracy. This is despite huge investments made into autonomous AI agents for finance, promising to ease analyst workloads. (1/8)
Vals AI tweet mediaVals AI tweet mediaVals AI tweet mediaVals AI tweet media
English
11
14
69
21.1K
Wojciech Galuba retweetledi
Hamel Husain
Hamel Husain@HamelHusain·
Only a matter of time before the job title “AI Scientist” emerges - Better than most AI Engineers at Evals, statistics, Data Analysis, Error Analysis, A/B testing, etc - Better than most Software Engineers at AI Engineering (😅 I hope we don’t need another job title )
JosH100@josh_wills

Data Scientist (n.): Person who is better at statistics than any software engineer and better at software engineering than any statistician.

English
39
34
383
76K
Wojciech Galuba retweetledi
Alex Volkov
Alex Volkov@altryne·
I'm a simple man, I want a local macOS chat client, where I can select any of the frontier LLMs and add as many MCP tools (or MCP toolboxes) and chat with them. I don't want to use ClaudeDesktop. I want to be able to use my Anthropic API keys instead. I don't want to use @windsurf_ai or @cursor_ai for this purpose, i just want a chat with MCP. Anything like this exist ? @tdinh_me @theo anyone has this already?
English
206
64
1.4K
405.8K
Wojciech Galuba
Wojciech Galuba@wgaluba·
This has been an exciting ride! @cohere Command A is out: open-weights model that is on par or better than GPT-4o and DeepSeek-V3 in many tasks with double the efficiency. Great foundation for building enterprise agents - and we are just getting started this year! links 🧵👇
Wojciech Galuba tweet media
English
1
10
60
3.6K
Wojciech Galuba retweetledi
OpenRouter
OpenRouter@OpenRouter·
.@cohere's Command R tripled in usage this week 🚀 It's now #2 for prompts about general tech
OpenRouter tweet media
English
8
26
181
23.4K
Wojciech Galuba retweetledi
Jack Rae
Jack Rae@jack_w_rae·
I'd love to watch a documentary on the rise and eventually fall-from-grace of MMLU, narrated by Morgan Freeman
English
9
5
67
17.3K