Herr Greenrush (e/acc)

11.8K posts

Herr Greenrush (e/acc) banner
Herr Greenrush (e/acc)

Herr Greenrush (e/acc)

@HerrGreenrush

Coder and GPU-poor AI alchemist. News and thoughts about technology. English/Deutsch.

Switzerland เข้าร่วม Ekim 2017
2.9K กำลังติดตาม687 ผู้ติดตาม
Herr Greenrush (e/acc) รีทวีตแล้ว
Romain Huet
Romain Huet@romainhuet·
We just launched Codex use cases! It’s a gallery of practical examples across coding and non-coding tasks, with real ways to use Codex. One thing I really like: if you have the app, you can open the starter prompt for each use case directly in Codex! developers.openai.com/codex/use-cases
English
56
203
1.9K
256.6K
Herr Greenrush (e/acc) รีทวีตแล้ว
Bloomberg
Bloomberg@business·
Physical Intelligence, a two-year-old AI robotics startup by ex-DeepMind staffers, is in talks to double its valuation to more than $11 billion bloomberg.com/news/articles/…
English
2
17
122
26.5K
Herr Greenrush (e/acc) รีทวีตแล้ว
Sam Altman
Sam Altman@sama·
The first steel beams went up this week at our Michigan Stargate site with Oracle and Related Digital
English
826
294
5.2K
738.3K
Herr Greenrush (e/acc) รีทวีตแล้ว
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
AI inference isn't a commodity. It's a managed experience. Labs that understand the interactivity lever operate at 60%+ margins. The rest race to zero. (5/5)
SemiAnalysis tweet media
English
5
11
140
17.5K
Herr Greenrush (e/acc) รีทวีตแล้ว
Epoch AI
Epoch AI@EpochAIResearch·
The total memory bandwidth of AI chips shipped since 2022 has reached 70 million terabytes per second, growing 4.1x per year. That's around 300,000x more data per second than global internet traffic.
Epoch AI tweet media
English
5
18
166
7.8K
Herr Greenrush (e/acc) รีทวีตแล้ว
Ben Bajarin
Ben Bajarin@BenBajarin·
From here on out, all silicon architecture designs will be designed with agentic AI in mind. Basically, all silicon becomes agentic workload-focused.
English
7
4
27
4.7K
Herr Greenrush (e/acc) รีทวีตแล้ว
Zephyr
Zephyr@zephyr_z9·
"HG Tech recently raised its projection of optical transceiver demand in China from 20 million in 2026 to 20 to 30 million. That's around 1/3 of global demand." Interesting
tphuang@tphuang

Glenn is framing this Reuters article correctly. The most important part to take-away from this article is its px as part of Atlas-350 & that ByteDance is ordering it. Since Reuters report of production # keep going up every time, I would seriously just ignore any # they provide. Remember all the stories of DeepSeek V4 coming out in a week? Where is it? There are already 7 Atlas-350 vendors during the launch last week & Reuters report its targeting shipment in 2H? This is entirely ludicrous. Now, I wouldn't compare Ascend-950PR to H200, since the former is a lower cost option using cheaper/lower end HBM designed for inference. PR - (Prefill & Recommendation). Given the lower yield of producing HBM3 type of HBM (stack 8 die on top of each other, if yield of 1 die 90% -> 8 dies yield 43%. If you layer 4 die instead, yield 66%). Take a look at current inference speed of Minimax or GLM on Macbooks using VRAM. You don't need HBM3 to run inference on these models. Reuters article also fundamentally doesn't understand what "premium version" is. Ascend-950DT (Decode & training) - hence the support for 2 TB/s interconnect speed, 4 TB/s 144GB HBM & connected together in Atlas-950 thru OXC/OCS - fully optical SuperNode that can be connected w/ 63 other ones to form 1 ZFLOP SuperCluster, enough for multiple trillion parameter training runs. There are many reason why HW is doing things this way. 950 die is clearly smaller than 910C due to lower yield of SMIC N+2 vs TSMC N7. Training is harder than inference from networking + software pov. See where optical network tech is now wrt Hisilicon & such. 8-layer HBM is harder than 4-layer HBM, so you start of by producing 4-layer HBM, which is more than sufficient for inference. Inference demand is also much higher than training due to the recent OpenClaw phenomenon. As for production #. since China is expected to have close to 100k wpm of 5-7nm capacity by EOY. Let's say 25k is used for AI chips. And you get 80 good die per wafer (+ 2 die per chip), do your own calculation. HG Tech recently raised its projection of optical transceiver demand in China from 20 million in 2026 to 20 to 30 million. That's around 1/3 of global demand. Again, do your own projections. Things shouldn't be difficult to figure out if you put your mind into it. And no, EUV is not a must. It's a good to have.

English
3
4
44
13.5K
Herr Greenrush (e/acc) รีทวีตแล้ว
AI at Meta
AI at Meta@AIatMeta·
We’re releasing SAM 3.1: a drop-in update to SAM 3 that introduces object multiplexing to significantly improve video processing efficiency without sacrificing accuracy. We’re sharing this update with the community to help make high-performance applications feasible on smaller, more accessible hardware. 🔗 Model Checkpoint: go.meta.me/8dd321 🔗 Codebase: go.meta.me/b0a9fb
AI at Meta tweet media
English
56
194
1.6K
154K
Herr Greenrush (e/acc) รีทวีตแล้ว
matthew sigel, recovering CFA
NVDA H100 rental prices hit 18-month high: $2.59/hour
matthew sigel, recovering CFA tweet media
Nederlands
22
75
678
137.5K
Herr Greenrush (e/acc) รีทวีตแล้ว
The AI Investor
The AI Investor@The_AI_Investor·
The fact that bad takes, or FUD, spread much faster on X than good ones says something about either the algorithm or human behavior. Take TurboQuant as an example. My post was early, but it got low engagement, while some other FUD posts got millions of views. I am not saying I should get higher engagement, but X should punish bad takes and low quality content instead of promoting it. @nikitabier
The AI Investor tweet mediaThe AI Investor tweet mediaThe AI Investor tweet media
English
3
2
37
3.1K
Herr Greenrush (e/acc) รีทวีตแล้ว
Theo Bearman
Theo Bearman@theobearman·
Per the Fortune article: After being contacted by Fortune, the company acknowledged that is developing and testing with early access customers a new model that it said represented a “step change” in AI capabilities, with significantly better performance in “reasoning, coding, and cybersecurity” than prior Anthropic models. Regardless of when 'Mythos' is publicly released, I look forward to reading Anthropic's discussion "in a system card or elsewhere" of "how that model’s capabilities and propensities affect or change the analysis in the Risk Report" within the next 30 days.
Theo Bearman tweet media
prinz@deredleritt3r

Anthropic has been testing a new model called "Mythos" with certain customers: - a "step change" in AI capabilities, including "dramatically higher scores" in coding, academic reasoning and cybersecurity - "currently far ahead of any other AI model in cyber capabilities” - part of a new "Capybara" series of models, which are larger and more intelligent than Opus - more expensive to run than Opus; not yet ready for general release

English
0
3
18
3K
Herr Greenrush (e/acc) รีทวีตแล้ว
M1
M1@M1Astra·
Claude Mythos Blog Post Saved before it was taken down. m1astra-mythos.pages.dev
English
127
229
2.3K
2.7M
Herr Greenrush (e/acc) รีทวีตแล้ว
Dan Nystedt
Dan Nystedt@dnystedt·
Micron inaugurated its new US$1.8 billion Tongluo fab campus in Taiwan on Thursday (3/26), media report, adding it expects volume shipments from the site to start in its fiscal year 2028 (CYQ3 2027) and will hire 1,000 new workers, raising its total in Taiwan to 15,000 by end-2026. 1/2 $MU #Powerchip #Taiwan #semiconductors #semiconductor
English
1
14
85
7.5K
Herr Greenrush (e/acc) รีทวีตแล้ว
Jukan
Jukan@jukan05·
* REUTERS: Chinese big tech companies, including ByteDance and Alibaba, are expected to place large-volume orders for Huawei’s 950PR. * REUTERS: Chinese big tech companies are reportedly satisfied that the 950PR offers better compatibility with Nvidia’s CUDA software ecosystem and faster response speeds compared with the 910C. * REUTERS: Huawei plans to ship about 750,000 units of the 950PR this year.
Jukan@jukan05

Wait, what? According to Reuters, the 950PR is said to come in two versions: one using DDR and the other using HBM.

English
7
56
334
48.2K
Herr Greenrush (e/acc) รีทวีตแล้ว
Jukan
Jukan@jukan05·
According to Korean media, purchasing departments at domestic semiconductor manufacturers such as Samsung and SK hynix are checking on a daily basis whether key materials such as helium can be procured and how prices are moving, as they make every effort to prevent production disruptions. Spot helium prices have already surged by more than 50%. Since these materials account for only a small portion of semiconductor production costs, they are not expected to have a direct impact on chip prices. However, the biggest concern is potential disruption to production. Accordingly, Samsung and SK hynix are said to be securing inventory by accepting market prices without resistance. (The Elec)
English
7
32
308
51.9K
Herr Greenrush (e/acc) รีทวีตแล้ว
Financial Times
The Pentagon has been blocked by a US court from punishing Anthropic over its refusal to allow unrestricted use of its technology in warfare, in a blow to the Trump administration in its row with the AI start-up. ft.trib.al/H73mt2F
Financial Times tweet media
English
35
307
788
41.4K
Herr Greenrush (e/acc) รีทวีตแล้ว
Bloomberg
Bloomberg@business·
Anthropic is considering going public as soon as in October, sources say, as the artificial intelligence company races with rival OpenAI to hold an IPO bloomberg.com/news/articles/…
English
24
112
532
158.3K
Herr Greenrush (e/acc) รีทวีตแล้ว
SemiVision👁️👁️
SemiVision👁️👁️@semivision_tw·
$LITE is currently building a new U.S.-based laser manufacturing facility in North Carolina to produce InP-based optical components for AI data centers, with $NVDA as a major customer. The project involves an investment of several hundred million dollars, with mass production targeted to begin around mid-2028. open.substack.com/pub/tspasemico…
SemiVision👁️👁️ tweet media
English
0
3
30
2.8K