frzlt

886 posts

frzlt

@DenysYaroshenko

0.5x dev

Katılım Mart 2017

65 Takip Edilen4 Takipçiler

frzlt@DenysYaroshenko·15h

@creepydotorg @grok It means pineapple

English

322

Creepy.org@creepydotorg·19h

Hey @grok what does this tattoo actually mean?

English

220

1.1K

1.3M

frzlt@DenysYaroshenko·21h

@MiaAI_lab what hardware are you using ?

English

Mia@MiaAI_lab·22h

Running agentic coding benchmarks on DeepSeek-v4-Flash and Step-3.7-Flash. Will post results soon.

English

1.9K

frzlt@DenysYaroshenko·1d

@davideciffa What is decent ?

English

365

mrciffa@davideciffa·1d

Go local. You only need a 16GB card to get started with decent models!

Julien Chaumond@julien_c

run local models TODAY

English

13.7K

frzlt@DenysYaroshenko·2d

@Hunchoquavo153 God of lightning got tased by an electroshock, the worst scene

English

103

Ratatouille@Hunchoquavo153·3d

One of the top scenes in MCU 🎬 Thor: Ragnarok

English

111

728

29.1K

606.7K

frzlt@DenysYaroshenko·3d

@LottoLabs How dumb is it ? It seems it is dumber even gemma 12b how it feel on practice ?

English

292

Lotto@LottoLabs·3d

Okay so Gemma diffusion is cool but we need proper llama support this gemmadiffusion pr is cli only What am I supposed to do with this

English

4.9K

frzlt@DenysYaroshenko·3d

@LottoLabs What if there wil be no open weights anymore thats why they rebrand

English

809

Lotto@LottoLabs·3d

What if Qwen 3.7 27b is so good they just drop it as Qwen 4 The new rebrand today, the long wait between max and plus and the open weight models? Maybe we get qwen 4 27b 👀

English

206

14.7K

frzlt@DenysYaroshenko·3d

@TeksEdge It is even less intellegent than gemma 12b though

English

609

David Hendrickson@TeksEdge·3d

🚨 Oh 💩! HUGE!! Google just jumped on the text-diffusion train with DiffusionGemma (Gemma 4 family)!! 🔥 26B total → only ~3.8B active params ⚡ Up to 4x faster token output (1000+ tok/s on high-end GPUs) 🛠️ Block-parallel generation = built-in self-correction & better code/markdown editing 📦 Fully open source (Apache 2.0) — available NOW on HF 🏆 Localmaxxers win again!! No new hardware needed — runs great on your 24GB+ setups. Who’s trying it first? 👇

Google AI Developers@googleaidevs

DiffusionGemma, our experimental open model released under an Apache 2.0 license, explores text diffusion, an exceptionally fast approach to text generation. Here’s how DiffusionGemma accelerates development: + Faster token output: By shifting the bottleneck from memory bandwidth to raw compute, the model generates up to 4x faster token output on dedicated GPUs + Accessible hardware footprint: Activates just 3.8B parameters during inference, fitting comfortably within 24GB-VRAM high-end consumer GPUs when quantized + Novel workflows: Parallel token generation enables self-correction, making it ideal for code infilling, in-line editing, and non-linear structures DiffusionGemma prioritizes speed over raw quality and accelerates best on compute-bound hardware (like @NVIDIAAI GPUs). Standard @GoogleGemma 4 remains recommended for production quality and memory-bound devices.

English

126

14.9K

frzlt@DenysYaroshenko·3d

@GoogleDeepMind @googlegemma It's even weaker than 12b though

English

Google DeepMind@GoogleDeepMind·4d

DiffusionGemma is our new experimental open model with up to 4x faster output on dedicated GPUs. Instead of predicting word-by-word, it generates entire blocks of text simultaneously. This lets the model self-correct and format complex markdown in real time.

English

108

261

2.4K

180.8K

frzlt@DenysYaroshenko·3d

@LottoLabs wow finally, I was waiting for it so long

English

246

Lotto@LottoLabs·4d

What is unsloth cooking up

Unsloth AI@UnslothAI

Google releases DiffusionGemma.✨ The new 26B-A4B diffusion text model runs locally on 18GB RAM. It supports high-speed text generation, thinking, image, video and 256K context. Run and train via Unsloth Studio. GGUF: huggingface.co/unsloth/diffus… Guide: unsloth.ai/docs/models/di…

English

166

16.5K

frzlt@DenysYaroshenko·4d

@yacineMTB ok, is this that bad for developers ?

English

422

kache@yacineMTB·4d

very proud of my little brother. he plays basketball in socal and works 6 hours and spent 10k$ in tokens in the past 9 days, saving his company an order of magnitude more than his salary

English

327

15.5K

frzlt@DenysYaroshenko·4d

@witcheer qwen 3.7 27b please

English

witcheer@witcheer·4d

May 2026 was a massive month for local AI runtimes: - llama.cpp: merged MTP speculative decoding - MLX: 4x faster performance on M5 chips - vLLM 0.21: stabilised DeepSeek V4 on Blackwell - Ollama: added Codex App support - LM Studio: shipped stable MTP

English

frzlt@DenysYaroshenko·5d

@justinmk Add screenshots in repo page

English

181

justinmk@justinmk·5d

guh.nvim vs octo.nvim in case you haven't moved to codeberg 🙄 github.com/justinmk/guh.n…

English

10.4K

frzlt@DenysYaroshenko·6 Haz

@yacineMTB damn true

English

kache@yacineMTB·6 Haz

the amount of time i spend cleaning up LLM code is greater or equal to the amount of time it would have taken me to write it myself

English

231

114.2K

frzlt@DenysYaroshenko·6 Haz

@AiBreakfast it was all the time there, qwen is there

English

AI Breakfast@AiBreakfast·5 Haz

The most underrated thing in AI right now is that “good enough” local intelligence has arrived. Gemma 4 12B on a 16GB laptop covers everything everything normal users need. Unlimited, free forever, and completely offline.

English

496

29.5K

frzlt@DenysYaroshenko·5 Haz

@LottoLabs It's sad we won't get another qwen 27b this year

English

Lotto@LottoLabs·4 Haz

Google really seen this and said let’s launch 12b

Lotto@LottoLabs

It’s kinda sad we knowing we won’t get another Gemma model this year

English

frzlt@DenysYaroshenko·4 Haz

@bijanbowen x.com/atomic_chat_hq…

atomic.chat@atomic_chat_hq

New Google Gemma 4 12B claims near-26B performance - we tested both! We ran both models locally on one RTX 4090 and gave each the same task: write a self-contained HTML5 canvas animation with real physics in one file without libraries. Three scenes - a Galton board, two blocks colliding off a wall, and a chaotic triple pendulum Outputs: Gemma 4 26B-A4B: 15 GB VRAM usage, 6.9k tokens, 138 tok/s Gemma 4 12B: 9 GB VRAM usage, 8.9k tokens, 80 tok/s Same Gemma 4 family, but the 26B-A4B won every scene and ran ~1.7x faster - on just 4B active params. The 12B stayed very close though, on almost half the VRAM - which makes it the ideal model for a 16 GB laptop

QME

BijanBowen@bijanbowen·3 Haz

Gemma 4 12b is a freak of nature at coding

English

1.2K

143.9K

frzlt@DenysYaroshenko·1 Haz

@mweinbach Coding 54 😂

English

Max Weinbach@mweinbach·1 Haz

Nemotron 3 Ultra model! 550B parameter, near state of the art open weight (on par with GLM-5.1 and Kimi K2.6) We shall see how it is in practice

English

234

15.7K

frzlt@DenysYaroshenko·31 May

@TimJayas Qwen 27b electricity only

English

Tim Jayas@TimJayas·30 May

DeepSeek just EMBARRASSED Claude Opus 4.7 Just switched to DeepSeek V4 Pro for a few days Cost: DeepSeek V4 Pro = $2.02 Claude Opus 4.7 = $265.21 Same quality for most of the medium tasks with no noticeable difference in output I know it’s hosted in China with cheap electricity but at this point western labs are getting cooked on price

English

628

68K

frzlt@DenysYaroshenko·29 May

@no_casuls @grok що за гра

Українська

🖕😎🖕Nocasuls™ ☝️🤓@no_casuls·29 May

От шо яхтклуби вміють так це напхати секреток, вже годину шарюся по 4 екранах міста і все якусь хуйню знаходжу

Українська

1.4K

frzlt@DenysYaroshenko·26 May

@Vivek4real_ It seems like my 3090 worth every peny

English

Vivek Sen@Vivek4real_·24 May

BREAKING: MICROSOFT JUST ANNOUNCED TO BAN ITS OWN ENGINEERS FROM USING AI DUE TO THE COST OF USING IT. VP OF NVIDIA SAID, “THE COST OF AI FOR MY TEAM WAS MORE THAN HUMANS” “AI CAN COST MORE THAN HUMAN WORKERS NOW”

English

1.1K

6.8K

48.7K

17.4M

Keşfet

@creepydotorg @grok @MiaAI_lab @davideciffa @Hunchoquavo153 @LottoLabs @TeksEdge @GoogleDeepMind