Brea Browne

147 posts

Brea Browne banner
Brea Browne

Brea Browne

@bb_turing_ai

Making the impossible - POSSIBLE

Newport Beach Katılım Haziran 2023
336 Takip Edilen55 Takipçiler
Brea Browne retweetledi
Overworld
Overworld@overworld_ai·
A portal is opening to your very own world... 🧵
GIF
English
2
6
56
26.8K
Brea Browne
Brea Browne@bb_turing_ai·
Fantastic work team - proud to share this Cogito v2.1 was trained on @nebiusai 🦾🦾🦾
Drishan Arora@drishanarora

4/ For Cogito v2.1, we fork off the open-licensed Deepseek base model from November 2024. This is an obvious choice for a pretrained base model, as Deepseek architecture has an ecosystem of cheap inference built around it. We have built a frontier training stack, while being an early stage startup, since we can stand on the shoulders of open source champions like @huggingface, @togethercompute, @runpod and @nebiusai, as well as stellar contributions by @Microsoft, @Meta, @nvidia and a lot of other folks in open source. Over the last months, we have iterated and refined our post-training strategies of self-play + RL (called Iterated Distillation and Amplification - IDA) with Cogito v1 and v2. You will see high-quality responses from Cogito v2.1 while being a bit different from usual models - we increase the model’s intelligence prior and teach it how to think via process supervision. So there are significantly shorter reasoning chains for the responses. We also use less markdown, less verbosity. In short, we want to make the model great for API usage - faster, fewer tokens with super high quality.

English
0
0
1
119
Brea Browne
Brea Browne@bb_turing_ai·
teaching models how to search, not just what to predict - brilliant work @drishanarora
Drishan Arora@drishanarora

It is intuitively easy to understand why self play *can* work for LLMs, if we are able to provide a value function at intermediate steps (although not as clearly guaranteed as in two-player zero-sum games). In chess / go / poker, we have a reward associated with every next move, but as Noam points out, natural language is messy. It is hard to define a value function at intermediate steps like tokens. As a result, in usual reinforcement learning (like RLVR), LLMs get a reward at the end. They end up learning to 'meander' more for hard problems. In a way, we reward brute forcing with more tokens to end up at the right answer as the right approach. However, at @DeepCogito, we provide a signal for the thinking process itself. Conceptually, you can imagine this as post-hoc assigning a reward to better search trajectories. This teaches the model to develop a stronger intuition for 'how to search' while reasoning. In practice, the model ends up with significantly shorter reasoning chains for harder problems in a reasoning mode. Somewhat surprisingly, it also ends up being better in a non-thinking mode. One way to think about it is that since the model knows how to search better, it 'picks' the most likely trajectory better in the non-thinking mode.

English
0
0
1
72
Brea Browne retweetledi
Roman Chernin
Roman Chernin@romanchernin·
Guess how long after signing the pivotal deal it took our CEO to ask: “Why hasn’t this (relatively small) new customer received an answer yet?” Less than 1 hour. That’s the obsession we need to keep as we build a true multi-customer cloud. The MSFT deal is the fuel for growth.
English
22
30
450
27.1K
Brea Browne retweetledi
Louis Castricato @ lovecraftian horrors
Anyone at ICML wanna see the future of world models? We're walking around with a laptop running our world model at 500 FPS+ (fully local). Would love to demo/chat with anyone interested
English
4
5
45
6.4K
Brea Browne retweetledi
Overworld
Overworld@overworld_ai·
We'll be at CVPR tomorrow! Interested in chatting about open science diffusion world models? Reach out!
English
0
8
18
4.5K
Nick Davidov
Nick Davidov@Nick_Davidov·
Grammarly just raised $1B for AI. Here's a fireside with Grammarly's CEO @shishirmehrotra from our AI Rabbit Hole conf from several weeks ago on how they see their product changing the future of work (link below)
English
4
0
6
950
Kylie Robison
Kylie Robison@kyliebytes·
SCOOP: OpenAI is working on its own X-like social network, according to multiple sources familiar with the matter. While the project is still in early stages, we’re told there’s an internal prototype focused on ChatGPT’s image generation that has a social feed.
Kylie Robison tweet media
English
186
220
2.1K
622.6K
Brea Browne
Brea Browne@bb_turing_ai·
If you can't tell from my "turing" handle (aka Alan Turing) I am a philospher at heart - and am certain #tech, #ai, and #philosophy are all apart of the same conversation. Recently I wrote this think piece. Would love to hear your thoughts: linkedin.com/posts/brownebr…
English
0
0
3
56
Brendan Iribe
Brendan Iribe@brendaniribe·
The open source Conversational Speech Model is out. We shared the 1B base model for everyone to build on. Links below.
English
116
191
2.5K
309.7K
Brea Browne
Brea Browne@bb_turing_ai·
Packing up to head to @NVIDIAGTC - what's the thing you can't go to a conference without? Besides caffiene of course 😅 #GTC25
English
0
0
0
51
Brea Browne retweetledi
Nebius
Nebius@nebiusai·
We’re here at @NVIDIA #GTC25 in San Jose! Our first tech talk — on how we rebuilt our AI Cloud from scratch — takes place at 3:00 PM today. Join us in room SJCC 212A. Tomorrow, the expo halls open. Stop by booth 809 to meet our leaders and tech experts!
English
1
36
94
4.6K
Brea Browne
Brea Browne@bb_turing_ai·
✨ Gartner IOCS 2024 is just around the corner! Connect with @weka's on how to unlock new levels of performance, efficiency, and cost savings. 📅 Dec. 10–12, 2024 📍The Venetian, Las Vegas, NV 🔗 Book your meeting now! 👇 #GartnerIOCS sprou.tt/1ErLVsBnL7n
English
0
0
1
45
Brea Browne
Brea Browne@bb_turing_ai·
Byte-Sized Joke Thursday! 🧀 Why did the legacy data platform lose the race? 🏁 Because it couldn’t match @weka’s speed! 🚀 WEKA is setting new benchmarks for #AI and #HPC workloads with record-breaking performance and unmatched efficiency. 💡 sprou.tt/17IyXRxA1hT
English
0
0
1
32
Brea Browne
Brea Browne@bb_turing_ai·
Headed to #SC24 in Atlanta?! Join @weka for a night of F-U-N and see Jimmy Eat World Preform! Register now 👇🎟️
English
0
0
3
60
Brea Browne
Brea Browne@bb_turing_ai·
🚀 Ready to turbocharge your #AI game? Check out @weka's latest data platform appliances—WEKApod Nitro & WEKApod Prime! Built to fuel your AI innovation and keep your data moving at lightning speed. ⚡️ 🔗 weka.io/data-platform/…
English
0
0
2
34