Maxim Makatchev

437 posts

Maxim Makatchev

Maxim Makatchev

@maxipesfix

founder of https://t.co/yK3uD96q4s AI's next UI. conversational AI blog: https://t.co/fACup81SvP

San FranOsaka Katılım Temmuz 2014
1.3K Takip Edilen368 Takipçiler
Maxim Makatchev
Maxim Makatchev@maxipesfix·
@pipecat_ai is our inspiration as well as the upstream Smart Turn repo so, the shout out from @kwindla is much appreciated!
kwindla@kwindla

.@maxipesfix forked the open source audio Smart Turn model and added video! Smart Turn is a "turn detection" model, used in a conversational agent to decide when the agent should respond. The model, training data, and training code are all completely open source. When we built the first version of Smart Turn, enabling this kind of extention and collaboration is exactly why we wanted to make everything open source. Maxim's blog post is super useful to read if you're interested in training multimodal models. It describes the design choices and technical details (3D ResNet, late fusion, two-stage training, inference runs on GPU in ~100ms). And all the code is available in the GitHub repo. Really great work.

English
0
0
3
36
kwindla
kwindla@kwindla·
.@maxipesfix forked the open source audio Smart Turn model and added video! Smart Turn is a "turn detection" model, used in a conversational agent to decide when the agent should respond. The model, training data, and training code are all completely open source. When we built the first version of Smart Turn, enabling this kind of extention and collaboration is exactly why we wanted to make everything open source. Maxim's blog post is super useful to read if you're interested in training multimodal models. It describes the design choices and technical details (3D ResNet, late fusion, two-stage training, inference runs on GPU in ~100ms). And all the code is available in the GitHub repo. Really great work.
kwindla tweet media
English
6
11
112
6.6K
Maxim Makatchev retweetledi
Amir Harati
Amir Harati@amir_harati·
I built a tool that researches ideas so you don't waste time building bad ones. Launching early access today (docs coming soon). Quick research + real citations. Free credits for all, bonus for early birds. Sign up, DM/email me your email. Link in the next tweet.⬇️
English
2
4
18
13.4K
Maxim Makatchev
Maxim Makatchev@maxipesfix·
Last week, just a day before Gemini 3 was released, @susuROBO helped run a 1-hour version of the @pipecat_ai x @GeminiApp hackathon at Osaka College of High Technology @osaka_hightech 大阪ハイテクノロジー専門学校. We used Pipecat's SmallWebRTC Prebuilt repo as a starting point (thanks @kwindla and @aconchillo!), which allowed even freshmen to finish the hour running a multimodal voice agent on their laptops. Ironically but not surprisingly, what they've built in under an hour was in a few ways more advanced than the Alexa they got as prizes.
Maxim Makatchev tweet mediaMaxim Makatchev tweet media
日本語
0
2
5
3.2K
Maxim Makatchev
Maxim Makatchev@maxipesfix·
Dawned realization: You can only sell a product to someone whose likes you can get to know in person. If the only people you meet are SEs and PMs, you’ll likely only be able to sell your developer time, or perhaps a productivity tool.
English
0
0
1
149
Amir Harati
Amir Harati@amir_harati·
@maxipesfix It was relatively small task :) and get complaints but you are right on priorities here.
English
1
0
1
27
Amir Harati
Amir Harati@amir_harati·
Some UI/UX (including font size ) are updated. I will add some help and update the landing page next.
Maxim Makatchev@maxipesfix

@amir_harati Unfortunately, the choice of having your largest fonts nearly as small as X's smallest fonts leaves out me as a potential user.

English
1
2
5
229
Maxim Makatchev
Maxim Makatchev@maxipesfix·
@RockZhang Let me put it out there that Notta is the single most used and useful service for a foreign entrepreneur in Japan.
English
0
0
0
15
Ryan
Ryan@RockZhang·
Ryan tweet media
ZXX
1
0
1
173
Maxim Makatchev
Maxim Makatchev@maxipesfix·
@amir_harati Unfortunately, the choice of having your largest fonts nearly as small as X's smallest fonts leaves out me as a potential user.
Maxim Makatchev tweet media
English
1
0
1
290
Maxim Makatchev
Maxim Makatchev@maxipesfix·
Well that was a fun day and night of programming: Integrated Gemini Live Multimodal for voice convo, Gemini Flash for gesture recognition, and AIROID avatar into a @pipecat_ai pipeline for non-verbal user gesture recognition/response: youtu.be/0yOsHS209ww @AITinkerers
YouTube video
YouTube
English
0
1
3
241
Maxim Makatchev
Maxim Makatchev@maxipesfix·
Jump-starting an ecosystem takes a confluence from many directions. We timed the first AI Salon Kansai so that it coincides with @EDCON_Official and Global Startup Expo last Thursday. The result was a mix of local and international builders that reminded me what was great at meetups I joined in SF.
Maxim Makatchev tweet media
English
1
1
2
207
Maxim Makatchev
Maxim Makatchev@maxipesfix·
Come to think of it, receiving some correspondence in Esperanto would be a nice touch.
Maxim Makatchev tweet media
English
0
0
1
76