Soham

627 posts

Soham

@sohamg121

Research Scientist @MistralAI on multimodal/audio LLMs. Previously: @GoogleDeepMind MS @CarnegieMellon

Mountain View, CA Katılım Ağustos 2013

1.2K Takip Edilen436 Takipçiler

Soham retweetledi

Alankar Jain@alankarjain91·7 Nis

Two truths, no lies: - Most AI models are insanely powerful. - Most people use a tiny fraction of that power. Closing that gap isn’t about better models, it’s about better products. NextToken puts AI’s power in your hands, for your day-to-day work.

English

103

Soham@sohamg121·7 Nis

I'm so excited for my friends to be building something so cool from scratch!

Nitish Kulkarni@nitishkulki

Today, we are launching NextToken - a single place to build production-grade agents, apps and analytics. Code-forward. Cloud Hosted. Zero setup. Extremely affordable. @nexttoken_co

English

119

Soham retweetledi

Guillaume Lample @ NeurIPS 2024@GuillaumeLample·27 Mar

Our first speech model, Voxtral TTS, is out. It delivers SOTA performance while significantly reducing cost compared to existing solutions, and it operates with very low latency. It uses a new architecture that combines auto-regressive generation of semantic speech tokens with flow-matching for acoustic tokens. We are also releasing a technical report sharing all our training methodology and insights. Much more to come in audio -- stay tuned !

Guillaume Lample @ NeurIPS 2024 tweet media

Mistral AI@MistralAI

🔊Introducing Voxtral TTS: our new frontier open-weight model for natural, expressive, and ultra-fast text-to-speech 🎭Realistic, emotionally expressive speech. 🌍Supports 9 languages and accurately captures diverse dialects. ⚡Very low latency for time-to-first-audio. 🔄Easily adaptable to new voices

English

695

45.8K

Soham retweetledi

Mistral AI@MistralAI·26 Mar

English

210

614

4.6K

869.9K

Soham@sohamg121·13 Mar

@jachiam0 There is no way this happens through closed-source AI built by profit-chasing enterprise, and your CEO's offhand comments are a testimonial to that. I do believe in the potential promise of AI as you say, but one cannot ignore selfish human motivations.

English

Joshua Achiam@jachiam0·12 Mar

We're entering the phase of AI politics where society will intensely debate whether it is a good idea to build AI at all. Builders need to make the case. The way I see it, AI is our best chance to defeat hunger, want, death, and war. It's a moral imperative to try.

English

130

294

96.2K

Soham@sohamg121·19 Şub

@giffmana did it debug the right way, i.e. put a bunch of print statements and run code again and again?

English

110

Lucas Beyer (bl16)@giffmana·19 Şub

Well damn, it was bound to happen and this morning it happened. There's a big chunk of code touching many pieces that i know in depth because i proudly hand-crafted it all. But it has one bug that i wasn't able to pinpoint even after an hour of debugging yesterday evening. This morning i resume debugging, but after another 10min of being none the wiser i decided to shoot a prompt to Opus4.6 and let it search while i continue debugging. A mere 1min51s later, Opus actually found the bug, in a file that i didn't even consider looking at during my two debugging sessions. This marks the first time a LLM found the bug in my code faster than me. I've tried this many times in the past, and i was always faster or ~same. FWIW the root cause was simple: np.prod(tuple_of_pyints) returns a np.int64, not a python int. Finding this as the root cause of my bug is what was not simple: i didn't even mention that code part in my prompt because i didn't consider it.

English

762

63.7K

Soham retweetledi

Mistral AI for Developers@MistralDevs·18 Şub

Since launching Voxtral Realtime, the community response has been remarkable. Today, we share the technical report, launch the Realtime playground in Mistral Studio, and share the model in Hugging Face Transformers. 🧵

English

555

92.6K

Soham@sohamg121·16 Şub

@asharoraa mental note to not use Whatsapp as my non-work todo tracker

English

1.5K

Ash Arora@asharoraa·15 Şub

$500K to anyone who can recover my WhatsApp chats

Ash Arora@asharoraa

$100K to anyone who can help recover it

English

122

318

182.8K

Soham retweetledi

Guillaume Lample @ NeurIPS 2024@GuillaumeLample·15 Oca

🚀 We are hiring Research Interns ! We are looking for Master's and PhD students (final year) with a strong background in AI / ML / NLP who want to work on cutting-edge AI systems alongside Mistral AI researchers. 📍 Location: Paris, London, or Palo Alto ⏳ Duration: 4-6 months 🎓 For Master's students: opportunity to continue with a PhD at Mistral after the internship (via the CIFRE program). Links below:

English

88.6K

Soham@sohamg121·30 Ara

Stoked to see our 5-month old OSS model be above so many other closed-source models!

Scale AI@scale_AI

Speech isn’t just text read out loud. 💬 Real conversations are dynamic, full of interruptions, and context-rich — and benchmarks should match. Introducing Audio MultiChallenge (Audio MC), the first benchmark built to test how well native Speech-to-Speech models handle real conversations.

English

Soham@sohamg121·24 Ara

@agihippo how about you just carry a 1.5 kg thing in a backpack man, it's really not that troublesome lol

English

563

yi@agihippo·24 Ara

I think we should normalize issuing multiple corp laptops for ai researchers. Some people live in multiple residences or work from many diff desks. It's cumbersome to bring your laptop everywhere so sometimes we just don't bring it around. The added convenience of checking in on your job saves much more money than one more MBP. This should apply to all technical staff but even more so ai researchers because the impact of checking in on your jobs more frequently can have high ROI.

English

106

37.3K

Soham@sohamg121·18 Ara

@levelsio anyone claiming they know Google is going to win purely because they have "YouTube" data has no idea how careful Google is with content licensing and how shit most of YT is.

English

@levelsio@levelsio·18 Ara

So I bought over $1M in Google today Kinda crazy but also not so crazy I've been the biggest Google hater for years, it was completely mismanaged, destroyed by politics and lack of any leadership, fumbled inventing Transformers etc. Then Sergey returned and suddenly Google is dominating not just in the AI benchmarks and leaderboards but in real usage AI benchmarks can and are easily rigged But me running an AI startup and always wanting to use the best models makes me conclude something basic now: it's really just Google and Elon Musk and the Chinese in the end who will probably win The models I use are all by either Google, xAI, or the Chinese (ByteDance, Kling, Minimax) As you know Google now has its own chips (TPUs), Google has the biggest data set in video (YouTube), images (Google images) and generally the web (for LLMs), still the one of the biggest general user bases (Google Search etc), and they finally have a real engineer being the de facto CEO now (Sergey Brin) Elon Musk with xAI you can't bet against cause he simply has the sheer willpower to get things done The Chinese are similar, sheer willpower and they don't sleep and they really want to win, and companies like ByteDance (TikTok) have massive data sets in video too of course In my opinion everyone is still staring too much at LLMs, I've always been more interested in image models, video models and now the nascent 3d and world models, that's where it's going and where we'll be able to prompt entire worlds or apps or whatever, it's hard to imagine WHAT exactly With my app Photo AI I try be a little part of that journey there of course Now I can't invest in xAI, I'm a bit invested in the Chinese via the ICHN ETF, but of course Google anyone can invest in and so I think I should I've reduced my Nvidia investments already months ago, as it was inevitable there'd be real competitors to their chips at some point, with Google's TPUs there are now I'm not an expert, and you should mostly just buy ETFs, and you shouldn't listen to me and this is not financial advice

@levelsio@levelsio

This is Sergey Brin's yacht He got so bored of sitting on this $450M yacht that he had to get out and go create things again The only true long-term satisfaction for man is to create, either things, or babies

English

535

246

6.7K

2.8M

Soham@sohamg121·16 Ara

@AnjneyMidha Will this actually attract the same level of talent that GovTech in Singapore did (with salaries that matched/beat private companies)?

English

215

Anjney Midha@AnjneyMidha·15 Ara

Alumni from this program will go on to build trillion dollar companies in the coming decade

Scott Kupor@skupor

Your government needs YOU to transform the federal government through modern software development. If you’re up for a huge challenge, join 1,000 of the country’s best and brightest technologists in the inaugural class of @USTechForce. We are partnering with the top U.S. technology companies to take on this challenge. You’ll learn a ton, network across the most important government agencies and private sector companies, ultimately creating powerful career opportunities whether you want to continue in public service or join the private sector. I am grateful to @POTUS for ensuring that America remains the world’s technology leader. Go to TechForce.gov to apply today.

English

15.5K

Soham@sohamg121·14 Ara

@rasbt @haider1 LaMDA was decoder-only. Encoder-only models had their own place, and Google looked at both encoder-decoder/decoder-only architectures. OpenAI _did_ run with it.

English

460

Sebastian Raschka@rasbt·13 Ara

@haider1 I am not sure "OpenAI ran with it" is entirely correct. I remember there was quite some rivalry between Google's encoder approach and OpenAI's push for decoder-style models. Google tried to make encoder-style models work for many years, since the original architecture.

English

257

29.2K

Haider.@haider1·13 Ara

Sergey Brin admits Google messed up by under-investing in the transformer architecture it invented Google was too scared to release chatbots that "say dumb things", so it under-invested in scaling compute "we didn't take it very seriously... and openAI ran with it"

English

109

355

5.1K

1.3M

Soham@sohamg121·12 Ara

@onetwoval :bufo-is-everywhere:

English

Val@onetwoval·11 Ara

imagine if twitter had slack emotes

English

3.1K

Soham@sohamg121·11 Ara

@_arohan_ @Miles_Brundage What's shameful about using OSS models to bootstrap synthetic data? (I'm assuming that's what they mean, and not logit distillation)

English

rohan anil@_arohan_·10 Ara

@Miles_Brundage x.com/andrewcurran_/…

Andrew Curran@AndrewCurran_

Bloomberg is reporting that META's superintelligence lab is using Gemma, OpenAI's open source model, and Qwen to train their next large model, code named Avocado.

QME

2.9K

rohan anil@_arohan_·10 Ara

Distilling news and billion dollar equity packages 6 months ago, make it make sense. Either whatever is reported is complete nonsense or it’s very over.

English

16.1K

Soham retweetledi

Mistral AI@MistralAI·9 Ara

Introducing the Devstral 2 coding model family. Two sizes, both open source. Also, meet Mistral Vibe, a native CLI, enabling end-to-end automation. 🧵

English

174

458

3.5K

1.8M

Soham@sohamg121·6 Ara

@NyanpasuKA Goodhart's law

English

Nyanpasu@NyanpasuKA·5 Ara

we can safely say lmarena is saturated ?

Arena.ai@arena

🚨BREAKING: Text Leaderboard Update 🐳 Deepseek-v3.2 enters the leaderboard at #38, and Deepseek-v3.2-thinking lands at #41. For comparison, previous versions ranked higher: 🔹 v3.2 ranks #38 (–5 pts v3.1 and –14 pts v3.2-exp) 🔹 v3.2-thinking ranks #41 (–7 pts vs v3.1-thinking and –5 pts v3.2-exp-thinking) Both models show their biggest gains in Legal by rank, with improvements of +28 points for v3.2 and +19 points for v3.2-thinking when compared to v3.1 predecessors. The largest drop appears in Healthcare for, where v3.2-thinking falls by 25 points. Where v3.2 performs strongest (among open models): 🔹 #1 in Math and Legal 🔹 Top 10 in Multi-Turn, Media, and Business Where v3.2-thinking performs strongest (among open models): 🔹 #1 in Science 🔹 Top 5 in Legal These updates reflect @deepseek_ai’s ongoing work to expand and refine its open source model family.

English

4.4K

Soham@sohamg121·6 Ara

@docmilanfar There were more, also funny that they are exactly 1000 apart