JimZ

5.2K posts

JimZ

@moonares

Train deep neural nets since 2015. Weight lifting. Tesla talk and FSD analysis.

North Carolina Katılım Temmuz 2010

59 Takip Edilen3K Takipçiler

Sabitlenmiş Tweet

JimZ@moonares·18 Kas

Incredible experience. Every dollar spent for this trip well worth it. #Starship #SpaceX

English

1.6K

142.3K

JimZ@moonares·9 Mar

@bcherny How to do this in terminal? Or desktop only

English

1.8K

Boris Cherny@bcherny·9 Mar

New in Claude Code: Code Review. A team of agents runs a deep review on every PR. We built it for ourselves first. Code output per Anthropic engineer is up 200% this year and reviews were the bottleneck Personally, I’ve been using it for a few weeks and have found it catches many real bugs that I would not have noticed otherwise

Claude@claudeai

Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.

English

459

506

7.4K

1.2M

JimZ@moonares·3 Mar

@ivanfioravanti wishful thinking. Compute is everything. It's fighting for capital game as there are already 4-5 perfomrant labs now while they mostly focus on large variation models

English

304

Ivan Fioravanti ᯅ@ivanfioravanti·3 Mar

I bet the former Qwen leading team will create a new lab and even stronger models 🚀

English

163

8.3K

JimZ@moonares·3 Mar

@bcherny Language support?

Français

10.5K

Boris Cherny@bcherny·3 Mar

🎶 I've been using voice mode to write much of my CLI code this last week Can't wait to hear what you think.

Thariq@trq212

Voice mode is rolling out now in Claude Code. It’s live for ~5% of users today, and will be ramping through the coming weeks. You'll see a note on the welcome screen once you have access. /voice to toggle it on!

English

271

182

3.5K

657.5K

JimZ@moonares·18 Şub

@JustinLin610 Speed difference on H200 arch?

English

299

Junyang Lin@JustinLin610·18 Şub

bf16 too big? here it is the fp8 version that u been expecting for! try to enjoy the benefits of our size and sparsity on ur device!

Qwen@Alibaba_Qwen

🚀 Qwen3.5-397B-A17B-FP8 weights are now open! It took some time to adapt the inference frameworks, but here we are: ✅ SGLang support is merged 🔄 vLLM PR submitted → github.com/vllm-project/v… Check the model card for example code. vLLM support landing in the next couple of days! Hugging Face: huggingface.co/Qwen/Qwen3.5-3… ModelScope: modelscope.cn/models/Qwen/Qw…

English

268

19.8K

JimZ@moonares·15 Şub

For Claude code, $20/month pro plan it is a helpful assistant, $200/month MAX plan, it is a slave army

English

409

JimZ@moonares·13 Şub

@TaraBull Damn

English

151

TaraBull@TaraBull·13 Şub

Lord of the Rings trilogy combined is 9 hrs 18 mins and it could have been over in seconds

English

1.2K

1.7K

21.8K

9.1M

JimZ retweetledi

Lonely@Lonely__MH·8 Şub

@novoreorx 一点见解: 先进化为超级个体，再统领 Agent 组建超级团队。前者决定了你的天花板，后者决定了你的规模化能力。二者相辅相成：自己是灵魂，Agent 是分身。✌️

中文

2.1K

JimZ@moonares·26 Oca

@elonmusk @davidasinclair 8000~10000, mixed up with weight training

English

661

Elon Musk@elonmusk·26 Oca

@davidasinclair What’s the least number of steps needed? That’s what I think most people would like to know.

English

1.3K

311

15.4K

1.2M

David Sinclair@davidasinclair·25 Oca

7,000 steps/day is linked to ~50–70% lower mortality. You move! jamanetwork.com/journals/jaman…

English

237

396

6.4K

1.4M

JimZ@moonares·9 Ara

@MistralAI Had some high hopes for Ministral3-3B, did some benchmarks, and the reults fell short, especially week in tool-calling. Also surprisingly weak in vision benchmarks, some quite bad.

English

Mistral AI@MistralAI·2 Ara

Introducing the Mistral 3 family of models: Frontier intelligence at all sizes. Apache 2.0. Details in 🧵

English

171

806

5.4K

1.3M

JimZ@moonares·28 Eki

@thinkymachines @UnslothAI can we have OPD support with unsloth? @danielhanchen

English

492

Thinking Machines@thinkymachines·27 Eki

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other approaches for a fraction of the cost. thinkingmachines.ai/blog/on-policy…

English

404

2.8K

1.9M

JimZ@moonares·27 Eki

@NashvilleFSD Real world

English

JimZ@moonares·27 Eki

2nd day with FSD v14, met a fallen off tire on highway

English

309

JimZ@moonares·23 Eki

Anyone who was affected by the FAIR layoffs, feel free to DM. We are a fortune 500 company on east coast currently growing our AI tech devision needing all sorts of expertise across AI/ML, post training, data, eval, infra and SW.

English

801

JimZ@moonares·23 Eki

@tydsh My company on east coast is hiring AI reserachers & engineers, welcome to DM

English

65.4K

Yuandong Tian@tydsh·23 Eki

Several of my team members + myself are impacted by this layoff today. Welcome to connect :)

English

473

268

6.4K

4.4M

JimZ@moonares·21 Eki

@chuckcook No, I refuse to believe it.

English

Chuck Cook@ChuckCook·21 Eki

@moonares 5

344

Chuck Cook@ChuckCook·21 Eki

Here is my very first Unprotected Left Turn Video. youtu.be/vH3zQhzucjY

YouTube

Sawyer Merritt@SawyerMerritt

Tesla FSD Beta was released 5 years ago today. The video below is from the very first night of public testing. Watching FSD V14.1.3 rolling out wide today, so much has improved over the past 5 years. What a time to be alive.

English

383

58.9K

JimZ@moonares·17 Eki

@chuckcook @Tesla_AI LLM has ARC, HLE. FSD has Chuck's unprotected left

English

168

Chuck Cook@ChuckCook·17 Eki

Here is a highlight clip from this morning's dedicated FSD Supervised v14.1.2 Unprotected Left Turns video. I'm so impressed with what @Tesla_AI has done on this very complex traffic pattern. Safety critical systems are NOT easy to deploy to the masses. Don't underestimate the importance of this core @Tesla capability.

English

1.1K

55.7K

JimZ@moonares·15 Eki

@aelluswamy Auto fart?

Italiano

414

Ashok Elluswamy@aelluswamy·15 Eki

FSD v14.1.2, going to early access today, will debut a much awaited feature 🏎️💨

English

1.1K

695

6.8K

1.7M

JimZ@moonares·7 Eki

@SawyerMerritt Sounds like 14.3 before Christmas

English

381

Sawyer Merritt@SawyerMerritt·7 Eki

Elon: “FSD V14.2 rolls out in a few weeks and then 14.3 a few weeks later, depending on safety testing. There is so much change that we are carefully confirming each one.”

Elon Musk@elonmusk

Many more Tesla self-driving V14 improvements still to come, but wide rollout of V14.1 has begun. 14.2 rolls out in a few weeks and then 14.3 a few weeks later, depending on safety testing. There is so much change that we are carefully confirming each one.

English

1.5K

110.8K

JimZ@moonares·7 Eki

This is interesting. Assuming 10GB of the 12GB is for the model, usually the model deployed is at INT4 , so this is a 20B model. Taking LLM as an example, 20B is the size that starts to having a taste of frontier model. With AI5, Tesla can definitely push the model size to another 2X~5X.

Whole Mars Catalog@wholemars

FSD 14.1 is about 12 GB

English

992

Keşfet

@bcherny @ivanfioravanti @JustinLin610 @TaraBull @novoreorx @elonmusk @davidasinclair @MistralAI