
Maxim Makatchev
437 posts

Maxim Makatchev
@maxipesfix
founder of https://t.co/yK3uD96q4s AI's next UI. conversational AI blog: https://t.co/fACup81SvP


.@maxipesfix forked the open source audio Smart Turn model and added video! Smart Turn is a "turn detection" model, used in a conversational agent to decide when the agent should respond. The model, training data, and training code are all completely open source. When we built the first version of Smart Turn, enabling this kind of extention and collaboration is exactly why we wanted to make everything open source. Maxim's blog post is super useful to read if you're interested in training multimodal models. It describes the design choices and technical details (3D ResNet, late fusion, two-stage training, inference runs on GPU in ~100ms). And all the code is available in the GitHub repo. Really great work.









@amir_harati Unfortunately, the choice of having your largest fonts nearly as small as X's smallest fonts leaves out me as a potential user.

Imagine you’re Chuck Schumer watching this









feeling really bad for the Meta OS team

画像生成AI利用の店とは「距離を置く」――沼津市公認VTuberの“AI反対宣言”が物議 「見解異なる」と同市 itmedia.co.jp/aiplus/article…




