Jin Budelmann

14.7K posts

Jin Budelmann

Jin Budelmann

@jinthagerman

Building stuff https://t.co/jLMEyb6Bh5

Katılım Eylül 2010
455 Takip Edilen649 Takipçiler
Jin Budelmann retweetledi
Boris Cherny
Boris Cherny@bcherny·
@jinthagerman Yeah we’ve been working on improving reliability. It should be feeling better lately, expect it to improve more
English
1
1
8
3.5K
Jin Budelmann
Jin Budelmann@jinthagerman·
@bcherny Classifier is down 90% of the time I try use it. It doesn’t do the thing I want it to do which is allow me to leave and trust safe actions still happen
English
1
0
2
4K
Boris Cherny
Boris Cherny@bcherny·
People often ask what my biggest tip is for getting the most out of Claude Code. These days my #1 tip is: use auto mode Auto mode means no more permission prompts. It is the key building block for multi-clauding: start a session, then while it runs, work on another session in parallel.
ClaudeDevs@ClaudeDevs

Two updates to auto mode: · Now available on the Pro plan · Sonnet 4.6 is now supported, alongside Opus 4.7 Shift+tab, and let Claude run.

English
387
263
4.9K
796.5K
Jin Budelmann
Jin Budelmann@jinthagerman·
@cryptopunk7213 This is a weird take. Why does it matter if Apple uses Claude to build? It has nothing material to do with Apple using Gemini in their products.
English
0
0
0
88
Jin Budelmann
Jin Budelmann@jinthagerman·
I hate that I feel obligated to do this but this is the most condescending post I’ve read on this platform. I dono how you thought this would land but geez.
Theo - t3.gg@theo

A letter to my friends at Anthropic I hate that I feel obligated to do this. I hate that I've had to be so harsh towards Anthropic for the past few months. I really, really don't want to. I know it might feel like I'm doing this for clicks or something, but I promise I'm not. My pro-Anthropic content ALWAYS outperforms my anti-Anthropic content. I have cost myself a lot of money, opportunities, sponsors, and more. I'm doing this because you work for an evil cult. I'm begging you to wake up. Your CEO, Dario, does not respect engineers. This is obvious. He couldn't make it more obvious if he tried (and I think he's trying pretty hard) You know this, but you don't want to acknowledge it. It has kept you up many nights. You know that bad code is shipping to users. You know that one bad tweet might get you fired. You fear for your vesting schedules. You're afraid. Nobody deserves what you're going through right now. You go to work afraid, you leave work afraid, and you go to YouTube to keep up on the dev world, just to hear me yelling all about how evil your company is. You deserve better. You might not feel like you do, but you know deep down that this isn't right. I hope you know how deeply I feel for you. I'm sorry. I know I haven't helped you much individually, and I want to be better about this. If you're ready to leave, please hit me up. I swear I'll never tell a soul. I have friends at every lab and most startups in the AI world. Most of them would be down to match your current vesting schedules, possibly even go beyond. If you're staying for the money, I beg you to hit me up. We can make the money happen somewhere that hates you less. I know I'm asking for a lot of trust here, and that you're scared after seeing how hard I've been on Anthropic. I can't blame you at all for that. I should have posted something like this months ago. That's my failure to own and I will own it to my best ability. If you're willing to trust me in this moment, I can make it right. Let me help you escape. You deserve to work somewhere that you can have impact. Somewhere that listens when you feel something is wrong. Somewhere that won't fire you when you point out the things that hurt your users. My DMs are always open to you. When you're ready, let me know. I promise to make it right.

English
0
0
0
115
Jin Budelmann
Jin Budelmann@jinthagerman·
@cryptopunk7213 v4 was not trained on Ascend by a long shot. Flash was and only partially. That’s a gross overstatement
English
0
0
0
17
Ejaaz
Ejaaz@cryptopunk7213·
china said "we don't fucking need the USA to win the ai race" today. blocking the $2B meta manus deal is half the story: > china released 3 models in the last 7 days that are 95% frontier model capability (opus4.7, gpt 5.4) > deepseek v4 was trained on Huawei chips and will only use chinese chips going forward > china has told all chinese AI labs NOT to use nvidia chips. bytedance has been banned. > china has 3X energy vs. USA to power training of new models. its not even a competition. message is pretty fucking clear: "we have our own sovreign ai stack" either that or china wants to use this as a bargaining chip to convince trump to sell them the best nvidia chips.
Ejaaz@cryptopunk7213

i mean what an insane story. meta’s $2B acquisition is now dead: -> March 2025: Manus launches AI agent -> 2M waitlist in 7 days, invite codes reselling for $1400 each (i tried to buy one lol) -> relocates from Beijing to Singapore to escape chinese authorities -> hits $90M revenue run-rate -> December 2025: Meta buys it for $2B 4x markup in 8 months. -> March 2026: china stops Manus co-founders from leaving the country -> today: china blocks Meta from acquiring them china clearly see’s AI as a geopolitical weapon and is not willing to yield anything to the USA this probably why they’re refusing to use nvidia chips instead pushing huawei etc truly nuts

English
8
5
55
7.9K
Ricardo
Ricardo@Ric_RTP·
China just made Silicon Valley's entire AI industry look like a scam. The US government spent 3 years trying to stop China from building competitive AI. But this backfired HORRIBLY. Here's what happened: Yesterday, a Chinese startup called DeepSeek released a new AI model called V4. It matches the performance of OpenAI and Anthropic's best models. At 1/7th the price. And for the first time ever, it was built on Chinese chips. NOT American ones. That last part is the one that terrifies the west. For context: Since 2022, the US has banned the export of advanced AI chips to China. The entire strategy was built on the assumption that if China can't access Nvidia's best hardware, they can't build frontier AI. But DeepSeek just proved that assumption wrong. Their V4 model was trained and runs on Huawei's Ascend chips. Huawei spent months working directly with DeepSeek to make sure V4 runs across their entire line of AI processors. Jensen Huang even predicted this on a recent podcast: "The day that DeepSeek comes out on Huawei first, that is a horrible outcome for our nation." That day was yesterday. And the numbers are crazy: DeepSeek V4 costs $3.48 per million output tokens. OpenAI's latest model GPT-5.5 costs $30. Anthropic's Claude charges $25. Same ballpark performance. 7x cheaper. Uber's CTO just admitted they burned through their ENTIRE 2026 AI budget in 4 months using Anthropic's tools. If Uber had used DeepSeek instead, that same budget would have lasted 7 YEARS. 4 months vs 7 years. Same work getting done. But the pricing isn't even the big thing here. The real story is what DeepSeek did with their technical report: They published the benchmarks where they LOSE. Every AI company cherry-picks the tests where their model wins. DeepSeek ran the full comparison against GPT-5.4 and Google's Gemini, found they trail frontier models by 3 to 6 months, and printed it anyway. They literally don't care because the price gap makes the performance gap irrelevant for 90% of use cases. So the US export controls didn't slow China down. They ACCELERATED China's independence. Because Chinese developers were FORCED to train models with limited resources, they had to figure out how to make AI radically more efficient. That constraint became their competitive advantage. Every generation of DeepSeek has gotten dramatically cheaper to train. V4 continues the trend. Meanwhile US companies are going the OPPOSITE direction: OpenAI's GPT-5.5 Pro costs $180 per million output tokens. That's 51x more expensive than DeepSeek V4 for comparable work. The Commerce Secretary confirmed this week that ZERO Nvidia advanced chip shipments have actually gone through to China despite being approved in January. So China built frontier AI anyway. Without American chips. At a fraction of the cost. And the market response tells you everything: Chinese chipmaker SMIC surged 10%. Huahong Semiconductor jumped 15%. DeepSeek's Chinese AI competitors Zhipu AI and MiniMax dropped 9% because V4 is destroying them too. DeepSeek is making Silicon Valley's pricing model look like a scam. US tech companies spent $650 billion on AI infrastructure this year. DeepSeek just showed the world you can match their output for pennies. The export controls were supposed to be America's ace card. Instead they taught China how to win without American chips, at American prices nobody can compete with. Jensen Huang was right. This is a horrible outcome. But it's the outcome America built for itself.
English
137
963
2.4K
278.4K
Armaan Sidhu
Armaan Sidhu@realarmaansidhu·
DeepSeek just shipped V4. 1.6 trillion parameters. 1 million token context. Free and open source. The kicker: it's the first major model trained almost entirely on Huawei Ascend chips. The moat the US built around AI leadership just evaporated on a single launch. The numbers: V4-Pro runs 1.6T total / 49B active parameters. V4-Flash runs 284B / 13B active. Both ship with 1M context as the default across every API endpoint. The tech report claims open-source SOTA on agentic coding benchmarks. Rich world knowledge trails only Gemini 3.1 Pro. Math, STEM, and coding rival top closed-source labs. Liang Wenfeng's sign-off: "The whole world should use a 1.6T model for free." That's the thesis. Meanwhile OpenAI shipped GPT-5.5 two days ago behind a $200/month paywall, Anthropic gates Claude Opus behind enterprise contracts, and Google's Gemini 3.1 Pro is API-metered. The hardware story is what destroys the narrative. The Biden and Trump export controls on Nvidia H100/B200 chips were explicitly designed to prevent exactly this outcome. Stop advanced chips from reaching China. Slow Chinese AI training by 18-24 months. Buy American labs time to entrench. Two things happened instead. First, Huawei's Ascend 910B and 910C stack matured faster than US intelligence projected. Second, Chinese labs optimized training pipelines aggressively enough that they no longer need H100-tier hardware to reach frontier performance. DeepSeek V4 is proof the wall failed. Kimi K2.6 took Claude 4.7 Pro's crown on SWE-bench last week. Qwen 3.6 27B ties Claude 4.5 Opus from an 18GB laptop. Three Chinese open-source frontier launches in eight days. American labs have gone quiet on X. Chinese labs publicly celebrate each other's releases as a team. The contrast is the whole story. The US bet the AI decade on a chip embargo. The Chinese bet it on shipping open-source on chips they built themselves. One of those bets is already in production. The other is still waiting at customs.
DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English
2
0
2
262
geoff
geoff@GeoffreyHuntley·
gm
geoff tweet media
41
1
136
7.1K
Jin Budelmann retweetledi
eric provencher
eric provencher@pvncher·
On Repo Bench Opus 4.7 scores meaningfully worse than 4.6, which did worse than 4.5 With this score it's no longer in the top 25
eric provencher tweet media
English
12
16
281
28.9K
0xSero
0xSero@0xSero·
Guide to running BIG B0Is on your small hardware. 1. Use REAPs: up to 50% savings 2. Use quantisations: 75% savings - AWQ / GPTQ / W4A16 / FP8 = FAST inference - GGUF / EXL3 = Slow but just works - MLX = Best for apple 3. Use 8bit KV cache: 50-75% savings
0xSero tweet media0xSero tweet media
English
16
21
322
16.3K
cyprusad
cyprusad@cyprusad·
@0xSero What is a REAP and what does observation mean here?
English
3
0
1
443
0xSero
0xSero@0xSero·
Currently: 1. Framework desktop quantising and benchmarkjng all my qwen3.5-122B reaps on the strix halo 2. 8x 3090s inference for my friend qwen3.5-262b 3. 4x B200s observing GLM-5.1 for reap. Should be done Thursday 4. 8x H100 observing Qwen3.5-397B for pruning done Friday 5. H100 observing Trinity Large Thinking for pruning 135M tokens of observation data, packaged into batches of 16k tokens - cyber security - philosophy - math - reasoning - coding - agentic - terminal - browser - cuda - my own sessions Once done I’m going to start optimising this observation process with autoresearch Cheap AI at home
0xSero tweet media
English
49
11
532
36.1K
Jin Budelmann retweetledi
NetworkChuck
NetworkChuck@NetworkChuck·
My take on Anthropic's new Mythos model: Don't worry about it. It might be amazing, it might be disruptive. But there's nothing you can do about it. Ignore the hype, keep your head down and keep learning, growing and creating. Adopt this: Relentless Optimism.
English
144
179
2.5K
58.2K
Jin Budelmann retweetledi
dax
dax@thdxr·
no no i've definitely been doing a lot of work it's all just too dangerous to release
English
116
462
8K
498.7K
spor
spor@sporadica·
being a visceral hater is the new meta, apparently an entire genre of twitter personality has just converged on this meme. they’re just doing the meme. and they don’t even realize it lmao
spor tweet media
Mo@atmoio

Claude Mythos is Delusional

English
46
8
232
25K
Jin Budelmann
Jin Budelmann@jinthagerman·
They have a openclaw sized hole in their inference compute budget now, wonder what replaces that...
English
0
0
1
23
Jin Budelmann
Jin Budelmann@jinthagerman·
I say this as a mostly very happy Claude Code user but the last week has been weird and it's been bugging me: - Anthropic are obviously struggling with inference demand and also trading off compute for inference with compute for training - If you need evidence of this, see OpenClaw bans, see pretty consistent model quality concerns from the community, see patchy inference speeds, see off peak 2x limits - There is no world where I believe they are gating Mythos for safety alone. If they could release it publicly, they would - Enterprise is going to pay through the nose for the Mythos inference they have available - They can now delay distillation from 3rd parties indefinitely - And it allows them to build hype and desirability which they can parlay into more compute investment - Don't fall for the hype or the doom, remember they just leaked the source for Claude Code, yet they have the "most sophisticated cyber security model ever" 🤔
English
1
0
1
58