Stellan Haglund

800 posts

Stellan Haglund

@Stellanhaglund

Entrepreneur and code ninja. https://t.co/GyK8fQSFZe

Katılım Mart 2009

213 Takip Edilen251 Takipçiler

Stellan Haglund@Stellanhaglund·2 Mar

@ivanfioravanti What did you reach with 0.6B?

English

149

Ivan Fioravanti ᯅ@ivanfioravanti·2 Mar

Playing with my italian wine classification and Qwen3.5-0.8B fine-tuning with MLX on Apple Silicon! From 10% to 72% after first 100 iters with batch size 12. Let's see what I can reach with it!

English

105

Stellan Haglund@Stellanhaglund·2 Mar

@mkurman88 Seeing a lot of people saying 4b is great to

English

216

Mariusz Kurman@mkurman88·2 Mar

@Stellanhaglund I jumped straight into 9B, and it’s a goat!

English

1.9K

Mariusz Kurman@mkurman88·2 Mar

Qwen 3.5 4B !!!! 🌶️🌶️🌶️

Magyar

356

31K

Stellan Haglund@Stellanhaglund·23 Şub

@elonmusk @tetsuoai Agree, I take it it’s ok to do this from @grok then?

English

709

Elon Musk@elonmusk·23 Şub

@tetsuoai Banger 🤣🤣 How dare they steal the stuff Anthropic stole from human coders??

English

762

748

13.9K

715.8K

tetsuo@tetsuoai·23 Şub

I can't believe someone would just steal from Anthropic like this. The millions of man-hours Anthropic spent hand-writing code, text, art, books, etc. to generate enough data for training must be taken into consideration here. Where is the respect for IP?

Anthropic@AnthropicAI

We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.

English

366

13.6K

1.2M

Stellan Haglund@Stellanhaglund·5 Eyl

@OfficialLoganK can I use a fine tuned version of flash 2.0 with the batch api?

English

Stellan Haglund@Stellanhaglund·8 Tem

@OfficialLoganK Does it work with fine tuned models of either version?

English

105

Logan Kilpatrick@OfficialLoganK·7 Tem

We just shipped "Batch mode" in the Gemini API with 50% discounts on our 2.5 models and the ability to enqueue billions of tokens at a time!

English

112

163

1.9K

203.5K

Stellan Haglund@Stellanhaglund·18 Haz

@_philschmid @multimodalart A bit out of context, but I’ve been trying to find an answer to if it’s possible to use a fine tuned model with the batch prediction api?

English

Philipp Schmid@_philschmid·17 Haz

@multimodalart Sorry it is: - Gemini 2.5 Flash (Stable, updated pricing from 05-20)

English

293

Philipp Schmid@_philschmid·17 Haz

Gemini 2.5 is production ready! We just launched 3 new Gemini models with 2.5 Pro and Flash being now generally available and a new Gemini 2.5 Flash Lite preview! 🧠⚡️🔦 Here is all you need to know: 🔦 New Gemini 2.5 Flash Lite (Preview) with Thinking, 1M context, only $0.1/$0.4, better as 2.0 flash and tool-use. 🧠 ⚡ Gemini 2.5 Flash and 2.5 Pro are now Generally Available (GA) and production-ready. 💰 Updated pricing for 2.5 Flash to $0.30/1M input and $2.50/1M output tokens. 🤗 Start building all 3 Models today in AI Studio and via Gemini API! New Model IDs: `gemini-2.5-flash-lite-preview-06-17`, `gemini-2.5-flash`, `gemini-2.5-pro`

English

330

26.6K

Stellan Haglund@Stellanhaglund·22 May

@LingYang_PU That looks pretty sequential, isn’t the upside with diffusion to predict all the tokens every step?

English

242

Ling Yang@LingYang_PU·22 May

We present MMaDA, first diffusion that unifies text reasoning, multimodal understanding, and image generation through Mixed Long-CoT, and unified RL - UniGRPO. 📚 Paper: arxiv.org/abs/2505.15809 💻 Code: github.com/Gen-Verse/MMaDA 📦 Model: huggingface.co/Gen-Verse/MMaD…

Aran Komatsuzaki@arankomatsuzaki

MMaDA: Multimodal Large Diffusion Language Models - UniGRPO, a unified RL algo tailored for diffusion foundation models - MMaDA-8B surpasses Show-o and SEED-X in multimodal understanding, and excels over SDXL and Janus in text-to-image generation

English

128

607

86.9K

Stellan Haglund@Stellanhaglund·29 Nis

@victormustar How much memory does it need?

English

130

Victor M@victormustar·29 Nis

You can really feel the A3B here 🚀 (hardware is mbp m4 max 128GB btw)

English

5.5K

Victor M@victormustar·29 Nis

Qwen/Qwen3-30B-A3B is absolute game changer 🤯 ⬇️ Are you on a Mac? then use MLX weights - getting 100 tokens/sec here. It changes everything for real world use cases.

English

101

992

142.6K

Stellan Haglund@Stellanhaglund·18 Nis

@rodydavis Do you also compile all these to wasm?

English

Rody Davis@rodydavis·16 Mar

I was able to create a CRDT in SQLite with pure C (no dependencies) custom loadable extensions! 🚨 uuid.c (From official SQLite misc extensions) hlc.c (Dart HLC ported to C) crdt.c (Dart logic with triggers ported to C) gist.github.com/rodydavis/d197…

English

4.4K

Stellan Haglund@Stellanhaglund·22 Mar

@steren @ollama Tokens per second?

English

416

Steren@steren·22 Mar

With @ollama 0.6.2, Gemma 3 27B now runs on 1 Cloud Run GPU. That's 27 billion parameters. The biggest Gemma 3 variant. The most capable open model you can run on a single GPU.

English

512

56.1K

Stellan Haglund@Stellanhaglund·18 Mar

@OfficialLoganK is it possible to use masks with flash 2 image generation somehow?

English

Stellan Haglund@Stellanhaglund·17 Mar

@OfficialLoganK Can you add masks?

English

Logan Kilpatrick@OfficialLoganK·16 Mar

We just updated the Gemini API docs to make native image generation 🖼️ even more simple to access and get started with across Python, JS, and cURL, happy building : ) ai.google.dev/gemini-api/doc…

English

126

196

2.3K

212.4K

Stellan Haglund@Stellanhaglund·2 Mar

@piersmorgan So what’s your solution?

English

Piers Morgan@piersmorgan·1 Mar

This is good to see, but nobody, including President Zelensky, should be under any illusion that Ukraine needs America to end the war and get peace.

Sky News@SkyNews

'You have full backing across the United Kingdom' Prime Minister Sir Keir Starmer tells Ukrainian President Volodymyr Zelenskyy, 'We stand with you and Ukraine for as long as it may take'. Live updates: trib.al/lGomQ0C 📺 Sky 501, Virgin 602, Freeview 233 and YouTube

English

504

6.4K

1.5M

Stellan Haglund@Stellanhaglund·22 Şub

@piersmorgan It’s a complete disaster

English

195

Piers Morgan@piersmorgan·22 Şub

Blocking so many loons tonight. Is it just me or is X getting madder by the day?

English

5.7K

496

17.4K

1.2M

Stellan Haglund@Stellanhaglund·10 Şub

@abacaj Really like your recent content on grpo! 👍

English

170

anton@abacaj·10 Şub

GRPO llama3.2-1B inject "Wait a second..." when model is wrong and let it regenerate then compute loss

English

337

36.2K

Stellan Haglund@Stellanhaglund·6 Şub

@cerebras Like is it possible to make sure the data doesn’t leave Europe when using the api?

English

Cerebras@cerebras·6 Şub

@Stellanhaglund Flash Answers can be disabled in settings

English

295

Stellan Haglund@Stellanhaglund·6 Şub

@cerebras what are the options for European customers with GDPR requirements?

English

Stellan Haglund@Stellanhaglund·6 Şub

@cerebras I mean for inference, the data centers are located in the us right?

English

Stellan Haglund@Stellanhaglund·6 Şub

@elonmusk when something is community noted, why aren’t other posts containing the same argument also marked with the same note? I see posts all day saying stuff I already saw was community noted as false several days ago.

English

Keşfet

@ivanfioravanti @mkurman88 @elonmusk @tetsuoai @grok @OfficialLoganK @_philschmid @multimodalart