Pavel Shtykovskiy

158 posts

Pavel Shtykovskiy

@framrus

Particle physics and astrophysics -» predicting ad clicks at Yandex -» spoken language understanding -» LLM-powered NPCs at https://t.co/A3vXLHYYGN

Berlin, Deutschland Присоединился Mayıs 2014

1.4K Подписки88 Подписчики

Pavel Shtykovskiy ретвитнул

Inworld AI@inworld_ai·21 Oca

Inworld TTS-1.5 releases today. The #1 TTS on Artificial Analysis now offers realtime latency under 250ms and optimized expression and stability for user engagement, and costs half a cent per minute. Some voice models are fast, some are expressive, some are affordable. We outperform them all across the board. Production-grade realtime latency: <250ms latency for Max model, <130ms for Mini (P90 first audio) - 4x faster than before. Voice agents now respond before users notice any delay. Engagement-optimized quality: 30% more expressive to serve a wider range of personalities and 40% lower word error rates for fewer hallucinations, word cutoffs, and audio artifacts. Built for consumer-scale: Radically affordable with enhanced multilingual support (15 languages including Hindi) and enhanced voice cloning, now via API. On-prem options now available for enterprises.

English

105

491

285.2K

Pavel Shtykovskiy ретвитнул

Inworld AI@inworld_ai·6 Kas

Our TTS Max model just debuted at #1 on the @ArtificialAnlys leaderboard. And at $10/million characters, it’s also the most cost-efficient commercial TTS model available. Excited to keep making state-of-the-art voice more accessible. Check it out at inworld.ai/tts or through our partners @pipecat_ai and @livekit.

Artificial Analysis@ArtificialAnlys

Inworld TTS 1 Max is the new leader on the Artificial Analysis Speech Arena Leaderboard, surpassing MiniMax’s Speech-02 series and OpenAI’s TTS-1 series The Artificial Analysis Speech Arena ranks leading Text to Speech models based on human preferences. In the arena, users compare two pieces of generated speech side by side and select their preferred output without knowing which models created them. The speech arena includes prompts across four real-world categories of prompts: Customer Service, Knowledge Sharing, Digital Assistants, and Entertainment. Inworld TTS 1 Max and Inworld TTS 1 both support 12 languages including English, Spanish, French, Korean, and Chinese, and voice cloning from 2-15 seconds of audio. Inworld TTS 1 processes ~153 characters per second of generation time on average, with the larger model, Inworld TTS 1 Max processing ~69 characters on average. Both models also support voice tags, allowing users to add emotion, delivery style, and non-verbal sounds, such as “whispering”, “cough”, and “surprised”. Both TTS-1 and TTS-1-Max are transformer-based, autoregressive models employing LLaMA-3.2-1B and LLaMA-3.1-8B respectively as their SpeechLM backbones. See the leading models in the Speech Arena, and listen to sample clips below 🎧

English

140

16.1K

Pavel Shtykovskiy ретвитнул

Chieh-Hsin (Jesse) Lai@JCJesseLai·29 Eki

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

English

493

2.4K

842K

Pavel Shtykovskiy ретвитнул

Nathan Lambert@natolambert·19 Ağu

Just signed a book deal for The RLHF Book, excited to make improvements to it this fall and get physical copies in your hands soon :) (rlhfbook dot com)

English

457

47.1K

Pavel Shtykovskiy ретвитнул

The NetHack Learning Environment@NetHack_LE·23 Tem

1 43.6 Grok-4-Wiz-AI-Cha died in The Dungeons of Doom on level 1. Killed by a housecat.

Davide Paglieri@PaglieriDavide

LLMs acing math olympiads? Cute. But BALROG is where agents fight dragons (and actual Balrogs)🐉😈 And today, Grok-4 (@grok) takes the gold 🥇 Welcome to the podium, champion!

English

4.5K

Pavel Shtykovskiy ретвитнул

Kevin Patrick Murphy@sirbayes·9 Ara

I am happy to announce that the first draft of my RL tutorial is now available. arxiv.org/abs/2412.05265

English

726

4.4K

320.5K

Pavel Shtykovskiy ретвитнул

Aleksey Tikhonov@altsoph·8 Ara

Earlier, we with @framrus developed a humor generation method that gives human-level results on blind tests. Now, we with @SaveTheRbtz are launching HUMOR-ARENA (humor.ph34r.me), generated humor labeling site with the models ranking, and the top of generated jokes. Blog-post: altsoph.medium.com/humor-arena-7e…

English

2.3K

Pavel Shtykovskiy ретвитнул

Simons Institute for the Theory of Computing@SimonsInstitute·5 Kas

simons.berkeley.edu/events/specula…

Simons Institute for the Theory of Computing tweet media

ZXX

155

26.8K

Pavel Shtykovskiy@framrus·1 May

@abacaj What happens with eval loss when train loss sharply decreases on the beginning of epochs 2 and 3? I saw multiple times how it jumps up on 2nd/3rd epochs start.. Do people keep training in such cases because it's good for final metrics?

English

318

anton@abacaj·1 May

This is exactly why I don't really mess with PEFT / Lora for fine tuning... even though full fine tune is a longer process gives you a better domain model

English

190

53.3K

Pavel Shtykovskiy ретвитнул

Chelsea Finn@chelseabfinn·17 Nis

Want to learn about meta-learning & few-shot learning? All of the latest lecture videos for Stanford CS330 are now online! youtube.com/playlist?list=… New topics in Fall '22 include: - self-supervised pre-training - large scale meta-optimization - domain adaptation & generalization

English

185

918

150.9K

Pavel Shtykovskiy ретвитнул

NeurIPS Conference@NeurIPSConf·13 Oca

You can now watch the recorded material from #NeurIPS2022 online without registration at: slideslive.com/neurips-2022

English

215

774

140.5K

Pavel Shtykovskiy ретвитнул

Karol Hausman@hausman_k·31 Eki

Our 2021 CS330 (cs330.stanford.edu/fall2021) lectures are online: youtube.com/playlist?list=… It was a pleasure to co-teach this class with @chelseabfinn. Topics incl. meta-learning, MTL, few-shot learning, deep RL (incl. multi-task, meta, goal-conditioned, hierarchical and offline RL)

English

419

Pavel Shtykovskiy ретвитнул

Sebastien Bubeck@SebastienBubeck·30 Haz

The video of my talk @EPFL_en today on Transformers and how to make sense of them is online! youtube.com/watch?v=brmidg…

YouTube

English

486

Pavel Shtykovskiy@framrus·2 Mar

@yaroslavvb @wtpayne2 @yandex @YandexAI But I agree with you that Yandex officials need to express their position stronger, otherwise it could be too late.

English

Pavel Shtykovskiy@framrus·2 Mar

@yaroslavvb @wtpayne2 @yandex @YandexAI This is not true any more. People are scared and not without a reason. Also what is scary is that gov position has high support in masses due to brain washing (not in Y., mostly among not well educated people).

English

Yaroslav Bulatov@yaroslavvb·2 Mar

Yandex is a key tool in shaping the alternative reality that allows Ukraine war to continue with popular support. Many people are associated with @yandex or @YandexAI and remain silent on the issue. Silence is complicity. meduza.io/news/2022/03/0…

English

Pavel Shtykovskiy ретвитнул

Soumith Chintala@soumithchintala·4 Şub

Fun read on why MLOps is still somewhat broken -- the engineers who build them are not users. In ML Frameworks, the authors were ML scientists -- (Py)Torch, Theano, Caffe, MXNet, Keras, Chainer, TF, etc. and that helped in design requirements accurately being in your head.

Yaroslav Bulatov@yaroslavvb

Bananas and ML infrastructure: I've asked around about cloud workflows, and most of the feedback had unhappiness with cloud tooling. This prompted a discussion in @chipro's MLops community -- why are MLops frameworks so bad? (1/9)

English

250

Pavel Shtykovskiy ретвитнул

Yaroslav Bulatov@yaroslavvb·4 Şub

English

365

Pavel Shtykovskiy@framrus·24 Oca

Nice blog post on distributed multi-GPU training of large models lilianweng.github.io/lil-log/2021/0…

English

Pavel Shtykovskiy ретвитнул

Sheldon Axler@AxlerLinear·27 Kas

Today the videos that I made to accompany my book Linear Algebra Done Right surpassed two million minutes of total viewing on YouTube. Those videos are freely available from the links at linear.axler.net/LADRvideos.html. #LinearAlgebra

English

496

2.7K

Pavel Shtykovskiy ретвитнул

Sebastien Bubeck@SebastienBubeck·3 Kas

Just watched an incredible talk by @AlexGDimakis at the Simons Institute, highly recommended. Their Iterative Layer Optimization technique to solve inverse problems with GANs make a LOT of sense! The empirical results on the famous blurred Obama face speak for themselves! 1/4

English

444

Открыть

@ArtificialAnlys @pipecat_ai @livekit @DrYangSong @gimdong58085414 @mittu1204 @StefanoErmon @SaveTheRbtz