Luigi Maselli

4.7K posts

Luigi Maselli

@grigi0

techno-geek, webdev, hackerpreneur.

Earth Katılım Kasım 2009

270 Takip Edilen499 Takipçiler

Luigi Maselli@grigi0·3 May

@lu_zero_ @antirez it mostly depends on the LLM model and the quality of data in the context provided by tools/pi-extensions youtube.com/watch?v=A_fvWh… For simple tool usage, you don't need a 10K context to explain how to use tools

YouTube

English

Luca Barbato@lu_zero_·2 May

@grigi0 @antirez but it is as good?

English

antirez@antirez·1 May

Look at this. Also opencode uses freaking 11k tokens of system prompt. Even at decent pre-fill of ~130 t/s it means waiting 84 seconds to start a session. What's the point? :D The pi agent is a lot saner here. Moreover, one could say, let's cache on disk very long common KV cache chunks, no? Hash it with all the parameters and put a sensible TTL if not used. But also: only cache it if you see it repeated N times across different sessions.

English

346

44.8K

Luigi Maselli@grigi0·1 May

Artix linux > #archlinux still strong on distrowatch youtube.com/watch?v=NGla_1…

YouTube

Luigi Maselli@grigi0

Un Linux senza #systemd è possibile #dinit #artixlinux

English

127

Luigi Maselli@grigi0·19 Nis

@mudler_it @lu_zero_ huggingface.co/mudler/Qwen3.6…

QME

Luigi Maselli@grigi0·19 Nis

Tested Qwen3.6-35B-A3B-APEX-I-Compact, almost as good as Qwen3.6-35B-A3B-UD-Q4_K_M but it takes 4Gb less. //cc @mudler_it @lu_zero_

English

134

Luigi Maselli@grigi0·17 Nis

@UnslothAI @Alibaba_Qwen @opencode Just beware chat template for thinking reddit.com/r/LocalLLaMA/c…

English

Luigi Maselli@grigi0·17 Nis

@UnslothAI Yes Qwen3.6 35B A3B UD Q4 K M is the new champ also in my local bench! Amazing! 🎊 @Alibaba_Qwen @opencode

English

Luigi Maselli@grigi0·17 Nis

Qwen3.6 35b a3b perform worse than Qwen3.5 in my benchmark, but it should be something related to the quantization or tool calling //cc @UnslothAI gemma4 26b a4b still better for me #localai #llm

English

150

Luigi Maselli@grigi0·9 Nis

There isn't aclear winner but qwen is faster grigio.org/are-local-llms…

English

Luigi Maselli@grigi0·9 Nis

Are Local LLMs good enough for Vibe Coding? Gemma4-26B-A4B vs Qwen3.5-35B-A3B

English

100

Luigi Maselli@grigi0·4 Nis

nothink is the best tradeoff between accuracy and speed

English

Luigi Maselli@grigi0·4 Nis

Gemma 4 26B A4B is currently the best tradeoff in agentic benchmark for GPU poor people, specially the nothink version

English

Luigi Maselli@grigi0·4 Nis

Create your personal benchmark with llm-eval-simple, specially for local llm github.com/grigio/llm-eva…

English

Luigi Maselli@grigi0·1 Nis

Create your personal LLM benchmark with Opencode Benchmark Dashboard github.com/grigio/opencod…

English

Luigi Maselli@grigi0·1 Nis

Currently qwen-3.6-plus-free is the best model you can run on #opencode for quality and speed.

English

117

Luigi Maselli@grigi0·29 Mar

@lu_zero_ Purtroppo ci sono progetti che uso che dipendono strettamente da systemd github.com/cockpit-projec… Eccezioni a parte, #dinit funziona bene sia per processi di sistema che per sessione utente

Italiano