JimZ

5.2K posts

JimZ banner
JimZ

JimZ

@moonares

Train deep neural nets since 2015. Weight lifting. Tesla talk and FSD analysis.

North Carolina Katılım Temmuz 2010
59 Takip Edilen3K Takipçiler
Sabitlenmiş Tweet
JimZ
JimZ@moonares·
Incredible experience. Every dollar spent for this trip well worth it. #Starship #SpaceX
JimZ tweet media
English
37
73
1.6K
142.3K
JimZ
JimZ@moonares·
@bcherny How to do this in terminal? Or desktop only
English
0
0
9
1.8K
Boris Cherny
Boris Cherny@bcherny·
New in Claude Code: Code Review. A team of agents runs a deep review on every PR. We built it for ourselves first. Code output per Anthropic engineer is up 200% this year and reviews were the bottleneck Personally, I’ve been using it for a few weeks and have found it catches many real bugs that I would not have noticed otherwise
Claude@claudeai

Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.

English
459
506
7.4K
1.2M
JimZ
JimZ@moonares·
@ivanfioravanti wishful thinking. Compute is everything. It's fighting for capital game as there are already 4-5 perfomrant labs now while they mostly focus on large variation models
English
0
0
2
304
Ivan Fioravanti ᯅ
Ivan Fioravanti ᯅ@ivanfioravanti·
I bet the former Qwen leading team will create a new lab and even stronger models 🚀
English
17
7
163
8.3K
JimZ
JimZ@moonares·
@bcherny Language support?
Français
2
0
3
10.5K
Junyang Lin
Junyang Lin@JustinLin610·
bf16 too big? here it is the fp8 version that u been expecting for! try to enjoy the benefits of our size and sparsity on ur device!
Qwen@Alibaba_Qwen

🚀 Qwen3.5-397B-A17B-FP8 weights are now open! It took some time to adapt the inference frameworks, but here we are: ✅ SGLang support is merged 🔄 vLLM PR submitted → github.com/vllm-project/v… Check the model card for example code. vLLM support landing in the next couple of days! Hugging Face: huggingface.co/Qwen/Qwen3.5-3… ModelScope: modelscope.cn/models/Qwen/Qw…

English
14
7
268
19.8K
JimZ
JimZ@moonares·
For Claude code, $20/month pro plan it is a helpful assistant, $200/month MAX plan, it is a slave army
English
2
0
3
409
TaraBull
TaraBull@TaraBull·
Lord of the Rings trilogy combined is 9 hrs 18 mins and it could have been over in seconds
English
1.2K
1.7K
21.8K
9.1M
JimZ retweetledi
Lonely
Lonely@Lonely__MH·
@novoreorx 一点见解: 先进化为超级个体,再统领 Agent 组建超级团队。前者决定了你的天花板,后者决定了你的规模化能力。二者相辅相成:自己是灵魂,Agent 是分身。✌️
中文
0
1
2
2.1K
Elon Musk
Elon Musk@elonmusk·
@davidasinclair What’s the least number of steps needed? That’s what I think most people would like to know.
English
1.3K
311
15.4K
1.2M
JimZ
JimZ@moonares·
@MistralAI Had some high hopes for Ministral3-3B, did some benchmarks, and the reults fell short, especially week in tool-calling. Also surprisingly weak in vision benchmarks, some quite bad.
English
0
0
0
59
Mistral AI
Mistral AI@MistralAI·
Introducing the Mistral 3 family of models: Frontier intelligence at all sizes. Apache 2.0. Details in 🧵
Mistral AI tweet media
English
171
806
5.4K
1.3M
Thinking Machines
Thinking Machines@thinkymachines·
Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other approaches for a fraction of the cost. thinkingmachines.ai/blog/on-policy…
Thinking Machines tweet media
English
61
404
2.8K
1.9M
JimZ
JimZ@moonares·
2nd day with FSD v14, met a fallen off tire on highway
English
0
0
0
309
JimZ
JimZ@moonares·
Anyone who was affected by the FAIR layoffs, feel free to DM. We are a fortune 500 company on east coast currently growing our AI tech devision needing all sorts of expertise across AI/ML, post training, data, eval, infra and SW.
English
2
0
0
801
JimZ
JimZ@moonares·
@tydsh My company on east coast is hiring AI reserachers & engineers, welcome to DM
English
0
0
1
65.4K
Yuandong Tian
Yuandong Tian@tydsh·
Several of my team members + myself are impacted by this layoff today. Welcome to connect :)
English
473
268
6.4K
4.4M
JimZ
JimZ@moonares·
@chuckcook No, I refuse to believe it.
English
0
0
0
83
Chuck Cook
Chuck Cook@ChuckCook·
Here is a highlight clip from this morning's dedicated FSD Supervised v14.1.2 Unprotected Left Turns video. I'm so impressed with what @Tesla_AI has done on this very complex traffic pattern. Safety critical systems are NOT easy to deploy to the masses. Don't underestimate the importance of this core @Tesla capability.
English
55
90
1.1K
55.7K
Ashok Elluswamy
Ashok Elluswamy@aelluswamy·
FSD v14.1.2, going to early access today, will debut a much awaited feature 🏎️💨
English
1.1K
695
6.8K
1.7M
Sawyer Merritt
Sawyer Merritt@SawyerMerritt·
Elon: “FSD V14.2 rolls out in a few weeks and then 14.3 a few weeks later, depending on safety testing. There is so much change that we are carefully confirming each one.”
Elon Musk@elonmusk

Many more Tesla self-driving V14 improvements still to come, but wide rollout of V14.1 has begun. 14.2 rolls out in a few weeks and then 14.3 a few weeks later, depending on safety testing. There is so much change that we are carefully confirming each one.

English
51
60
1.5K
110.8K
JimZ
JimZ@moonares·
This is interesting. Assuming 10GB of the 12GB is for the model, usually the model deployed is at INT4 , so this is a 20B model. Taking LLM as an example, 20B is the size that starts to having a taste of frontier model. With AI5, Tesla can definitely push the model size to another 2X~5X.
Whole Mars Catalog@wholemars

FSD 14.1 is about 12 GB

English
1
0
1
992