Mateusz Mirkowski

3.8K posts

Mateusz Mirkowski

@llmdevguy

Autonomous agents, agentic engineering Building & testing agentic systems Exploring local LLMs

Remote work evangelist Katılım Mart 2013

150 Takip Edilen1.8K Takipçiler

Sabitlenmiş Tweet

Mateusz Mirkowski@llmdevguy·27 Nis

x.com/i/article/2048…

ZXX

173

74K

Mateusz Mirkowski@llmdevguy·16 Tem

@ivanfioravanti @MiniMax_AI @Zai_org Come to Poland finally! 😇

English

335

Ivan Fioravanti ᯅ@ivanfioravanti·16 Tem

Planning (at least) three weeks in China in September with my wife and young daughter that speaks Chinese! ✈️ I hope to be able to visit at least @MiniMax_AI and @Zai_org labs while there 🙏

English

265

20.8K

Mateusz Mirkowski@llmdevguy·13 Tem

5h limits temporary gone in GPT. Please stay this forever. 🙏

English

267

Mateusz Mirkowski@llmdevguy·11 Tem

WTF ChatGPT is this a joke? I am on paid plan...

English

359

Mateusz Mirkowski@llmdevguy·10 Tem

@jun_song GPT results looks very GLM or Kimi style. :D

English

Jun Song@jun_song·9 Tem

Fable Ultracode vs GPT 5.6 Sol Ultra Frontend design comparison test First website is from Fable Fable took 5min vs GPT took 30min Which one do you think is better?

English

22.7K

Mateusz Mirkowski@llmdevguy·10 Tem

🔥First impressions of GPT-5.6: Sol: - Eats limits way too fast - Much better for security checking than 5.5 - Way better for design Terra: - Slightly better 5.5 - Good balance between results and costs Luna: - Cheap and fast for easy tasks.

English

228

Mateusz Mirkowski@llmdevguy·9 Tem

@NielsRogge Something went wrong. ;(

English

454

Niels Rogge@NielsRogge·9 Tem

"Grok 3 will be made open source in about 6 months." - Elon Musk, August 2025

Elon Musk@elonmusk

The @xAI Grok 2.5 model, which was our best model last year, is now open source. Grok 3 will be made open source in about 6 months. huggingface.co/xai-org/grok-2

English

1.7K

185.2K

Mateusz Mirkowski@llmdevguy·9 Tem

@jun_song No difference 😛

English

331

Jun Song@jun_song·9 Tem

GPT-5.5-Extra high vs GPT-5.6-Sol-Ultra Frontend design comparison test. Same prompt : Build a website about local AI GPT-5.5 took 10min, and 5.6 took 30min. Do you see the big improvement?

English

17.8K

Mateusz Mirkowski@llmdevguy·9 Tem

@outsource_ hilarious

English

Eric ⚡️ Building...@outsource_·9 Tem

Cant make this up CLAUDE RESET USAGE after GPT 5.6 SOL dropped LOL

GIF

English

Mateusz Mirkowski@llmdevguy·8 Tem

Bye bye open sourced models. 😢

Dansk

259

Mateusz Mirkowski@llmdevguy·26 Haz

Orinth - new local llm king.

English

291

Mateusz Mirkowski@llmdevguy·22 Haz

Fugu is not a new model. It’s more like proxy to many different models like GPT 5.5 or Opus 4.8.

Mateusz Mirkowski@llmdevguy

Remember, Fugu is a dangerous fish. 😄

English

227

Mateusz Mirkowski@llmdevguy·22 Haz

Remember, Fugu is a dangerous fish. 😄

English

326

Mateusz Mirkowski@llmdevguy·19 Haz

@loktar00 I was expecting 12-15 t/s. At least 25 would be good I think.

English

193

Loktar 🇺🇸@loktar00·18 Haz

GLM 5.2 Q2_xxxs 6x3090s, 256 gb ddr4, after an hour this is what we got (Prompt was an infinite miner.. apparently it forgot the enemies, treasures etc). Ended up at 12.5 tk/s when all said and done. Will try to offload to the 5090s and BC-250's next just to see.

English

129

18K

Mateusz Mirkowski@llmdevguy·18 Haz

🚀You end the day with 19% of your limit left. You wake up in the morning, and the limit has reset. Thanks @thsottiaux ! 😇

English

116

Mateusz Mirkowski@llmdevguy·17 Haz

@AfzalBuilds That's great!

English

933

Muhammad Afzal@AfzalBuilds·17 Haz

@llmdevguy Recently I developed an invoice generator MVP using glm 5.2 and the results are insane. I have recorded a complete session while testing all the things which is available at my profile.

English

1.1K

Mateusz Mirkowski@llmdevguy·17 Haz

🌓I was supposed to write a post today about how great Kimi K2.7-Coder is, but then I started testing GLM 5.2. I've changed my mind. The difference between GLM 5.2 and the previous 5.1 feels bigger than the difference between K2.6 and K2.7. Which one is better? For now, it seems like GLM has a slight edge, but I need to run more tests.

English

104

10.1K

Mateusz Mirkowski@llmdevguy·16 Haz

Cool, Codex. But please do not use my limits if this occurs...

English

441

Mateusz Mirkowski@llmdevguy·15 Haz

I would add just one thing. Try to buy motherboard with 2 x8 pcie express ports. Cheap ones can give you one x16 and other x4 which will reduce speed greatly with two GPUs.

Lotto@LottoLabs

Keep it simple if you want to learn local ai 1) Build the cheapest rig that can house 1 used 3090 2) Who cares about ram, cpu, etc, buy ddr4 32gb, get a decent mid range ryzen/intel 3) download qwen 3.6 27b You’ll be into the rig like $1500 at most You won’t be able to upgrade much (maybe another 3090) but it’s a great starting point You don’t feel locked into learning about big rigs and upgrade paths If you like it then you can still keep and run the 3090 rig (trust me it’s useful) And you can build a new separate rig if you need more vram, on any platform after you have some experience and know what type of hardware you actually need, not what people recommend online!

English

374

Mateusz Mirkowski@llmdevguy·15 Haz

🙏Please release a model that’s at least half as good as this meme.

Alexander Knigge@AlexanderKnigge

oh my god its happening @MistralAI has officially confirmed the upcoming release of Le Chaton Fat - 30T MoE with 256 experts - 1M context window - multimodal and multilingual - outperforms Fable 5 on every benchmark

English

465

Mateusz Mirkowski@llmdevguy·15 Haz

@jun_song For now it's a meme, unfortunately.

English

282

Jun Song@jun_song·15 Haz

Fine-tuned Super-Chaton Fat : - 500T with 1B MoE - 1B context - 104.2% on SWE bench - 98.2% on cute cat bench

Alexander Knigge@AlexanderKnigge

English

217

23.9K

Keşfet

@ivanfioravanti @MiniMax_AI @Zai_org @jun_song @NielsRogge @outsource_ @loktar00 @thsottiaux