Alex

393 posts

Alex

Alex

@theAlexFerrari

infovore/LLMs whisperer/"his mind teeming"

The Hague, The Netherlands Katılım Mayıs 2020
891 Takip Edilen243 Takipçiler
Alex
Alex@theAlexFerrari·
@thekitze I don't know man, don't expect the same level of insights
English
0
0
0
905
kitze 🛠️ tinkerer.club
wow, 3 years my life in a zip file time to nuke my chatgpt chats, history, and memory it's been a fun few years but i'm going fully local for chat cloud models for coding only from now on
kitze 🛠️ tinkerer.club tweet media
English
31
5
307
27.8K
Alex
Alex@theAlexFerrari·
What does this mean in the end? For daily tasks, keep on using MiniMax-M2.7. When you need analysis that require deeper thinking (and you need to trust the results) use Gemini 3 Flash (and Pro when the quality of the results is pivotal)
English
0
0
0
39
Alex
Alex@theAlexFerrari·
Gemini 3 Flash is a worse agent than MM-M2.7 and anyone that used Gemini CLI can attest to that. Until Gemini closes that gap, their very good models will have to eat sand of cheaper less intelligent but more effective Chinese counterparts.
Alex tweet media
English
1
0
0
59
Alex
Alex@theAlexFerrari·
If you look at the Artificial Analysis Intelligence Index, MiniMax-M2.7 is many points above Gemini 3 Flash. Yet, the more I use MiniMax-M2.7 the less this seems to match my day-to-day experience 🧵
Alex tweet media
English
1
0
0
61
Winter
Winter@WinterArc2125·
Most people don’t realize this: You get 1,500 free daily requests to Gemma 4 31B on @GoogleAIStudio. That’s plenty of free inference (imo). And you can route it into @NousResearch Hermes Agent via Vercel’s AI Gateway: 1. Create an API key on Google AI Studio 2. Add it under BYOK (Google) in Vercel AI Gateway 3. Create a Vercel Gateway API key 4. In Hermes → select “Vercel AI Gateway” + your Google model Now all your Google model requests route through your free AI Studio quota. Basically: free 31B model access inside your agent stack. (Tradeoff: not as private as running locally)
English
40
125
1.8K
122.6K
Alex
Alex@theAlexFerrari·
If you are getting crappy markdown tables from Hermes Agent in Discord/Telegram, ask it to render the tables inside triple-backtick code blocks and you'll get properly rendered tables like in the screenshot
Alex tweet mediaAlex tweet media
English
0
0
2
347
Alex
Alex@theAlexFerrari·
@ivanfioravanti Ah ok, thank you, that makes sense. I love the writing style (and reasoning capabilities) of gemini. I so wished Google would allow us to use the subscription outside of its ecosystem (like OpenAI does) @OfficialLoganK
English
0
0
0
17
Alex
Alex@theAlexFerrari·
@ivanfioravanti what is your preferred way to "keep generating ton of tokens"?
English
2
0
1
54
Ivan Fioravanti ᯅ
Ivan Fioravanti ᯅ@ivanfioravanti·
One things is sure, Gemini Ultra usage limits are very large, because I keep generating ton of tokens and I never get any issue.
English
6
0
27
2.9K