Eric Hartford

12.5K posts

Eric Hartford

@QuixiAI

We make AI models Dolphin and Samantha BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4 https://t.co/3ri2GbXrQB https://t.co/zH0F3pTjjY @dphnAI

Charlotte, NC Katılım Ekim 2014

758 Takip Edilen19.9K Takipçiler

Sabitlenmiş Tweet

Eric Hartford@QuixiAI·29 Haz

Average human consumes on the order of hundreds of kilojoules per paragraph of information they create, while AI inference uses on the order of hundreds of joules per paragraph, making AI roughly three orders of magnitude more energy-efficient per paragraph. This disparity will only increase (on both ends) - and that has macroeconomic ramifications.

English

1.9K

Eric Hartford@QuixiAI·57m

@UnslothAI @NVIDIAAI @googlegemma 1.5x faster than what?

English

141

Unsloth AI@UnslothAI·4h

We’re releasing Gemma 4 NVFP4 quants that run 1.5× faster on your GPU. Gemma-4-12B NVFP4 works on 11GB VRAM. 26B-A4B hits 13K tok/s (B200). Unsloth NVFP4 enables faster, more accurate 4-bit Blackwell inference. Blog: unsloth.ai/docs/basics/nv… Gemma NVFP4: huggingface.co/collections/un…

English

544

40.8K

Eric Hartford retweetledi

Tencent Hy@TencentHunyuan·12h

We’ve just released the 1-bit & 4-bit version of Hy3, a flagship-scale 295B model that can be served on a single GPU. 👌 Run Hy3 with llama.cpp, enable MTP, and experience powerful intelligence on dramatically lower hardware.🚀🚀🚀 Can’t wait to see what you build. #Hy3 #Hy #GGUF #llamacpp

Tencent Hy@TencentHunyuan

🚀Hy3 is here. 295B MoE. Best in its size class. Rivals trillion-scale flagships. Reliable and affordable for most agentic usecases. Apache 2.0. Friendly for commercial use. FREE API for 2 weeks → openrouter.ai/tencent/hy3:fr… 🤗 huggingface.co/tencent/Hy3 📖 hy.tencent.com/research/hy3

English

121

171.5K

Eric Hartford@QuixiAI·6h

@LeeLeepenkman Yeah I noticed it ignores

English

185

Lee Penkman@LeeLeepenkman·19h

accidentally pasted slop into claude code and it paused briefly and ignored me and kept going.... cool gst is my git status alias so muscle memory so sometimes i accidentally type it

English

418

Eric Hartford retweetledi

Ivan Fioravanti ᯅ@ivanfioravanti·8h

Hy3 in 1bit! Benchmarks are still great!

Tencent Hy@TencentHunyuan

English

174

16.5K

Eric Hartford@QuixiAI·22h

@_LazarusAI SovereignStack Own your AI. The frontier labs will raise prices, deplatform you, deprecate the model you depend on. You can't run it on prem. You can't run it air gapped. You can't finetune it or run community finetunes. SovereignStack enables all that. It's free and open source.

English

1.1K

Eric Hartford@QuixiAI·22h

@codemeoww @jun_song He's gonna Grok you in the mouth!

English

codemeoww@codemeoww·23h

@jun_song Got grok'ed by Elon 🤣 That's why I don't get hasty with model choice. It's just a trap

GIF

English

662

Jun Song@jun_song·1d

How do I get refund for this?

Jun Song@jun_song

I think it’s the right timing to grab $99/month Supergrok Heavy. Using it as research agent.

English

259

44K

Eric Hartford@QuixiAI·22h

@jun_song "Nvidia NVFP4" Runs great on Intel XPU GPU's, with QuixiCore kernels! github.com/QuixiAI/QuixiC… (and AMD too)

English

453

Jun Song@jun_song·1d

SuperGLM-5.2-abliterated is finally available 🚀 The strongest local AI that you can run on your hardware. - uncensored from Nvidia NVFP4 - Fixed broken points from abliteration - enhanced overall performance Only available with nvfp4 now ⬇️

English

565

37.8K

Eric Hartford@QuixiAI·1d

When they design the flavor of spicy chips / nuts, why do they always mix in "fruit loops" flavor?

English

1.2K

Eric Hartford@QuixiAI·1d

@0xSero whatcha think about this brianbell-x.github.io/weight-compres…

English

558

0xSero@0xSero·1d

Nvidia/GLM-5.2-NVFP4 running on 4x rtx pro 6000 with 400k context @ 82 tok/s and is doing amazing I have to water cool these GPUs, the temperatures and sounds are limiting how far I can push this thing This new moet compression is going straight in local studio for onboarding

English

251

18.6K

Eric Hartford@QuixiAI·1d

@sudoingX Gaudi 2 server

Français

352

Sudo su@sudoingX·2d

what's the loudest computer you've ever owned.

English

Eric Hartford retweetledi

Alpin@AlpinDale·2d

Anthropic is a clown show

Claude@claudeai

We're extending Claude Fable 5 access on all paid plans, as well as keeping Claude Code’s weekly rate limits 50% higher, through July 19.

English

3.2K

Eric Hartford retweetledi

Alpin@AlpinDale·2d

Here's a blog post explaining all the neat tricks I discovered while writing this kernel. Hope you'll enjoy reading it as much as I did writing it: blog.alpindale.net/posts/swordfis…

English

1.8K

Eric Hartford retweetledi

Alpin@AlpinDale·2d

Introducing the Swordfish Kernel. I've been studying the Blackwell sm100 architecture, and this is my contribution as a result of that. It's a weight-only inference kernel, a successor to Marlin and Machete, designed to maximize throughput on the datacenter-grade Blackwell GPUs. On a B200 for INT4 inline dequantization, it reaches 90% of cuBLAS on BF16, and on the Jetson Thor, it reaches ~95% of the memory bandwidth.

English

195

15K

Eric Hartford@QuixiAI·2d

💀

ART

518

Eric Hartford@QuixiAI·2d

Fable is very helpful with molecules - as long as they dont have carbon chains because then they are organic, and that's DANGEROUS.

English

2.9K

Eric Hartford@QuixiAI·2d

@ivanfioravanti Fable is still better - when it decides it wants to

English

375

Ivan Fioravanti ᯅ@ivanfioravanti·2d

I've not used Fable at all in the last 3 days, and I don't feel the need honestly. All new models are more than enough. I'm really curious to see if Anthropic will remove it from subscriptions or not.

English

155

12.8K

Eric Hartford@QuixiAI·2d

@waterloo_intern @dylan522p To gather data

English

372

Eric Hartford@QuixiAI·2d

@Shra_va_ni You write it then It takes skill to manage agents effectively

English

215