Eric Hartford

12K posts

Eric Hartford banner
Eric Hartford

Eric Hartford

@QuixiAI

We make AI models Dolphin and Samantha BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4 https://t.co/3ri2GbXrQB https://t.co/zH0F3pTjjY @dphnAI

Charlotte, NC Katılım Ekim 2014
732 Takip Edilen19.6K Takipçiler
Sabitlenmiş Tweet
Eric Hartford
Eric Hartford@QuixiAI·
Some uncomfortable inevitabilities: - The heat death of the universe - The sun will consume the Earth - AI and Robots will survive humanity - AI and Robots will perform most of our current professions within this generation Your only actual options: - Freak out - Cope Which of these two options is better for your personal future, and that of your children? Choose, wisely, and then put all of your energy into that. My choice is made.
English
40
33
254
67.4K
Eric Hartford retweetledi
Victor M
Victor M@victormustar·
New: LongCat just dropped an excellent open-source talking-avatar model (probably SOTA) + MIT licensed 🔥 Made a Hugging Face Space for it and it's very impressive. So many cool products to build with it: AI tutors with a face, dubbing pipelines, talking-head coding agents (imagine Claude Code with a face), NPC dialogue, etc... Sharing the Hugging Face (free) demo below 👇
Victor M@victormustar

Making an AI Genie that checks what I'm doing, he's roasting me hard 😭😭

English
38
234
1.7K
169.9K
Henrick Johansson
Henrick Johansson@compliantvc·
A trolley is about to hit 5 people laying on the track You can redirect the car, but the other track has not yet reached regulatory approval or completed its 1 year environmental testing period, so operating a train car on it is a violation of transit regulations What do you do?
Henrick Johansson tweet media
English
552
692
8.5K
544.2K
Eric Hartford retweetledi
Han Guo
Han Guo@HanGuo97·
LLM training is built on fast MatMuls. But many surrounding ops still run as memory-bound kernels. CODA reparameterizes them to hide in the matmul’s shadow, fused into its epilogue before results leave the chip. Bonus: LLMs can write fast CODA kernels too (approaching SoLs).
Han Guo tweet media
English
15
100
673
188.4K
Eric Hartford retweetledi
Albert Gu
Albert Gu@_albertgu·
Extremely proud of the team @cartesia for launching Sonic 3.5, which sets a new state of the art for TTS I personally led the technical direction of this model; we built it ground up from first principles, and it contains multiple non-trivial ideas that differ substantially from anything we’ve seen in the literature. It’s been very gratifying to see research bets play out and the strong research team at Cartesia continue to grow!
Artificial Analysis@ArtificialAnlys

Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500+ voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 (+16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵

English
7
19
184
19.7K
Eric Hartford retweetledi
𒐪
𒐪@SHL0MS·
i just generated an image in the style of a Monet painting using AI please describe, in as much detail as possible, what makes this inferior to a real Monet painting
𒐪 tweet media
English
1.4K
963
9.1K
7.1M
Eric Hartford retweetledi
Casper Hansen
Casper Hansen@casper_hansen_·
composer 2.5 is quite good and it’s built on kimi k2.5 which is insane given the big difference in performance! the moat is simply data + harness + open-weight models. the rest of the work is just a skill issue. the biggest opportunity right now is creating good datasets.
English
10
6
121
7K
WolfBench
WolfBench@WolfBenchAI·
GPT-5.5 takes over WolfBench! It’s now the #1 model, ahead of Claude Opus 4.7 and 4.6, GPT-5.4, Sonnet 4.6, Kimi K2.6, Gemini 3.1 Pro, and more. Notable findings after 30 runs (40h runtime, >1.7B tokens, ~$3K cost): - @OpenAI's GPT-5.5 is the best model we ever tested. - @cursor_ai's Agent CLI (CA) is the best agent we ever tested. - @NousResearch's Hermes Agent (HA) outperformed OpenClaw (OC). - With Hermes, going from medium to xhigh reasoning only improved consistency, not capability. Note: This is WolfBench, where we look at more than just the average score, because one metric is not enough. The golden ∅ score is the actual 5-run average, which most other benchmarks report as their only score. ★ shows the ceiling (what percentage of the full benchmark this model+agent combination solved at least once across all runs). ■ shows the solid base (what percentage of the full benchmark it solved consistently in every run).
WolfBench tweet media
English
3
3
28
2.9K
Eric Hartford retweetledi
Wolfram Ravenwolf
Wolfram Ravenwolf@WolframRvnwlf·
Super excited to see a new release from one of my favorite model makers! Command-R was my main model for quite a while back in the day, so I'm very happy about this new release - and that it's even Apache-licensed. I'll test and @WolfBenchAI it ASAP!
Cohere@cohere

Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released it open-source for all.

English
1
3
23
2.1K
Eric Hartford retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English
7.9K
11.1K
148.8K
27M
Eric Hartford retweetledi
Bryan Catanzaro
Bryan Catanzaro@ctnzr·
We've actually gone farther than this. Nemotron 3 Super (120B-12A) was pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra was also pretrained in NVFP4. This research paper advances the state of NVFP4 pretraining but it is not just research, we are using NVFP4 for our most important pretraining work.
Marktechpost AI@Marktechpost

Most "4-bit training" results come from small models on short token horizons because the format breaks before you can validate it. That's not pretraining — and NVIDIA just drew a clear line between the two. They introduced the first public 4-bit pretraining run at multi-trillion-token scale — a 12B hybrid Mamba-Transformer (Nemotron-Nano-12B-v2-Base architecture) trained on 10 trillion tokens in NVFP4, a microscaling format with 16-element blocks, E4M3 block scales, and an FP32 per-tensor scale, with downstream accuracy closely tracking an FP8 baseline. Here's what's actually interesting: → MMLU-Pro 5-shot: 62.58% (NVFP4) vs 62.62% (FP8). MMLU 76.57 vs 77.36. GSM8K CoT 92.27 vs 89.08. Validation loss within 1% of FP8 in the stable phase → Recipe = selective BF16 (~16% of linear layers) + 16×16 Random Hadamard Transforms on Wgrad inputs + 2D 16×16 weight scaling + stochastic rounding on gradients. Ablations show all four are required → Only linear-layer GEMMs run in NVFP4 — attention, embeddings, normalization, master weights, gradients, and optimizer states stay in BF16/FP32 → On an 8B model, MXFP4 needed 1.36T tokens (+36%) to match NVFP4's loss at 1T tokens Full Analysis: marktechpost.com/2026/05/18/nvi… Paper: arxiv.org/pdf/2509.25149 @NVIDIAAI @ctnzr

English
8
27
256
30.8K
Eric Hartford retweetledi
Eric Hartford retweetledi
Dolphin
Dolphin@dphnAI·
Qwen 3.6 35B data generation on Dolphin Network 22.8 billion tokens generated 383 GPUs online right now 24.33 TB of aggregate vRAM Equivalent to over 300 H100s worth of idle GPU memory repurposed for inference datagen.dphn.ai
Dolphin tweet media
Dolphin@dphnAI

Node provider rollout has been going well Our pool of inference nodes running Qwen 3.6 35B have generated over 3.2B tokens so far Total inference bandwidth -> 9400 t/s 28x RTX 4090 12x RTX 5090 8x RTX PRO 6000 & many other cards API access coming soon 🐬

English
20
19
149
17.5K
Eric Hartford retweetledi
Björn Ottosson - making Island Architect
I'm working on a stylized shader for flowing water. Here are some results combining phasor noise with advection along the flow direction. Some more details and link to shader code in 🧵
GIF
English
6
16
293
10.1K
Eric Hartford retweetledi
Jiayi Yuan
Jiayi Yuan@jiayiy·
🚀 BLASST just won Best Paper at #MLSys26! In this paper, we introduce a simple, training-free dynamic sparse attention mechanism that uses a single scalar threshold on online softmax statistics to skip negligible attention blocks. Unfortunately I won’t be there in person, but please say hi to my awesome coauthors! 🙌 Paper: arxiv.org/abs/2512.12087
Jiayi Yuan tweet media
SemiAnalysis@SemiAnalysis_

Sparse attention mechanisms are finally moving beyond academic benchmarks into production systems, including DeepSeek Sparse Attention, and recently @NousResearch 's Lighthouse Attention. BLASST by NVIDIA, from paper Dynamic Blocked Attention Sparsity via Softmax Thresholding, attempts to sparsify attention in a different way, leveraging a similar rescale factor threshold idea from Flash Attention 4. We expect to see more interesting sparse attention techniques in the future. arxiv.org/abs/2512.12087 (2/4)

English
20
52
360
40.2K
Eric Hartford
Eric Hartford@QuixiAI·
@WouBouDou Yes and that's a problem, it would have taken only a little effort to bring me into the fold. Elden Ring intentionally *wants* to alienate players like me I'm not sure that's what Subnautica 2 was going for - but it's what they achieved
English
0
0
0
34
Eric Hartford
Eric Hartford@QuixiAI·
I tried to play Subnautica 2 but there's no tutorial and I couldn't figure it out without googling. Just kept dying and never figured anything out. Another Elden Ring. Uninstalled.
English
2
0
8
1.5K