Arjun Patrawala

23 posts

Arjun Patrawala

Arjun Patrawala

@arjunpatrawala

Berkeley

Katılım Ağustos 2017
302 Takip Edilen129 Takipçiler
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
Cartesia is building a muscle of developing and applying innovations orthogonal to model and data scale. We are excited to apply these ideas to an expanded set of domains in the coming months.
English
0
0
9
385
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
In January, @_albertgu got deeply involved in our TTS research program. Over the following 5 months, we developed ~3 innovations (no changes to data or model scale), each targeting a fundamental issue plaguing our prior models. The results:
Karan Goel@krandiash

We released Sonic-3.5 and Ink-2, the #1 streaming models for text to speech and speech to text you can use in your voice agents today. New architectures enable new frontiers for speed and quality. We're now the only provider to have #1 models for both speaking and listening.

English
3
4
51
9K
Karan Goel
Karan Goel@krandiash·
We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -
English
1.4K
1.2K
8.5K
4.9M
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
@alexwei_ When doing verifiable math problems, it's +EV for the model to guess an answer, even when unsure. I'd bet this "breakthrough" is fixing this fundamental flaw. E.g., update the reward to: 0 for wrong answer a for saying "idk" b for correct answer for some values, 0 < a < b
English
2
0
6
153
Alexander Wei
Alexander Wei@alexwei_·
On IMO P6 (without going into too much detail about our setup), the model "knew" it didn't have a correct solution. The model knowing when it didn't know was one of the early signs of life that made us excited about the underlying research direction!
Daniel Litt@littmath

One piece of info that seems important to me in terms of forecasting usefulness of new AI models for mathematics: did the gold-medal-winning models, which did not solve IMO problem 6, submit incorrect answers for it? 🧵

English
75
155
1.7K
281.7K
emma
emma@emguoz·
what i’ve been working on for the past year 🥹! excited for users to finally try it
Ivan Zhao@ivanhzhao

So for 5 years, “offline” has been the #1 request. Today, thanks to the perseverance of our engineering team, @NotionHQ finally works offline. Your ideas don’t need Wi‑Fi to exist! For Notion community: thank you for your patience while we built this right. This is a journey, I want to share what we had to invent to make this real... 1/n

English
16
4
144
19.2K
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
Programming languages with pedantic, correctness-oriented compilers (Rust, OCaml) seem good in the AI era. Hard to trust vibe-coded mallocs and frees lol
English
0
0
3
326
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
@alexwei_ Choosing good values of a and b allows you to select the "confidence" the model must have before presenting an answer. If a is close to b, the model must be quite confident because the risk of outputting the wrong answer is high and the reward is small.
English
0
0
1
96
Nakul Shenoy
Nakul Shenoy@nakulshn·
What if we RL trained the model to use chain of thought to do better at the pretraining next-token-prediction objective? Rather than a 0/1 verifiable domain?
English
1
0
0
88
Nick Jiang
Nick Jiang@nickhjiang·
Vision transformers have high-norm outliers that hurt performance and distort attention. While prior work removed them by retraining with “register” tokens, we find the mechanism behind outliers and make registers at ✨test-time✨—giving clean features and better performance! 🧵
Nick Jiang tweet media
English
16
131
1K
178.8K
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
@sea_snell It’s most interesting that we reached for the over-engineered solutions first (PRMs, human-labeled CoT, etc.) before trying the simple idea
English
0
0
3
136
Charlie Snell
Charlie Snell@sea_snell·
R1-zero is such a striking example of a discovery that’s blatantly obvious in retrospect, yet eluded so many for such a long time
English
39
73
1.7K
285.2K
Arjun Patrawala
Arjun Patrawala@arjunpatrawala·
Spotted at the door of berkeley artificial intelligence research today
Arjun Patrawala tweet media
English
4
0
19
1.4K
Akhand Dugar
Akhand Dugar@akspaks9·
How do you put emphasis on the letter ‘I’ (i) when you’re typing.
English
1
0
2
0
Akhand Dugar
Akhand Dugar@akspaks9·
I hate it when people do the thing where they use their teeth to take the food off the fork/spoon instead of their lips. Like brosef. That’s why Charles Darwin gave us lips.
English
3
1
7
0
Akhand Dugar
Akhand Dugar@akspaks9·
If I was a talented artist I would draw fake stills that look like they’re from old Disney movies to confuse people
English
1
0
3
0