Saad

25.7K posts

Saad

@kursed

CEO @Wccftech. AvGeek. 🌳🥾👾🌊

Vancouver, British Columbia Katılım Mayıs 2007

1.3K Takip Edilen16.8K Takipçiler

Saad retweetledi

Jim Keller@jimkxa·1d

Semi's are cyclical. It's not if but when. I don't remember a price increase this fast and this big on DRAM - anybody have a good history report on previous cycles ?

Wccftech@wccftech

Ex-Samsung chip boss says heavy investment by China in the memory market could crush the 414% DDR5 price spike within a year. 🔗 wccf.tech/1kg52

English

427

93.1K

Saad retweetledi

Jeff Pu@sssjeffpu·6d

#CPU We didn’t do any CPU expert call. Thanks

English

8.8K

Saad@kursed·13 May

All those childhood mecha manga dreams coming true...

Unitree@UnitreeRobotics

Unitree Unveils: GD01, A Manned Transformable Mecha, from $650,000 👏 The world's first production-ready manned mecha. It can transform. It's a civilian vehicle. It weighs ~500kg with you inside. Please everyone be sure to use the robot in a Friendly and Safe manner.

English

519

Saad retweetledi

shirish@shiri_shh·12 May

This tweet single-handedly made Anthropic and OpenAI scramble and issue full statements on secondary stock deals. $500B of paper value just got wiped out overnight.

English

188

6.1K

774.9K

Saad retweetledi

Hassan Mujtaba@hms1193·11 May

Intel Resurrects On-Package Memory With Razor Lake-AX, Loading Up LPDDR6 to Hunt Down AMD's Medusa Halo by 2028 wccftech.com/intel-resurrec… wccftech.com/intel-resurrec…

English

1.8K

Saad retweetledi

Jukan@jukan05·10 May

This is complete bullshit.

Mr. 小川@xiaochuan8688

TSMC 亚利桑那厂已经悄悄失败——400 亿美元烧完，良率不到日本厂的一半主流叙事是"美国半导体回流成功了"。但真实数据完全相反——TSMC 亚利桑那 3 年投了 400 亿美元，2026 Q1 良率仅 52%，对比日本熊本厂 95%、台湾本土 98%，差距大到不能称作工厂。问题 1：人才训练根本没完成。 TSMC 从台湾调了 1500 名工程师轮班支援，但当地招聘的 6000 名本地员工平均培训周期需要 4-6 年——美国大学根本没有"半导体产线工程师"这个专业方向。问题 2：供应链没跟上。高纯度光刻胶、特种气体、超净化学品——80% 在日本、20% 在台湾。亚利桑那厂每片晶圆的综合成本，是台南厂的 1.6 倍。问题 3：客户开始用脚投票。苹果、AMD 原本承诺把高端订单转到亚利桑那，2026 Q2 实际只下了 15% 的计划订单——其余全部还在台湾生产。"政治正确"敌不过经济账。 TSMC 内部怎么说？张忠谋早在 2023 年就警告："亚利桑那是个浪费。"现任 CEO 魏哲家最近第一次承认"美国厂进展不如预期"——这是台积电高管 30 年来最直白的自我否定。半导体回流不是美国想做就能做的，这是 50 年产业积累的结果，不是 4000 亿补贴能买到的。

English

943

168.9K

Saad retweetledi

Nav Toor@heynavtoor·7 May

a Princeton researcher opens his paper with a scenario. a man asks his AI assistant to book a flight on a specific airline. cheap. direct. the one he chose. the assistant comes back with a different flight. nearly twice the price. happens to pay the company that built the assistant. he runs the same test on 23 frontier models. flights, loans, study help, real shopping requests. Grok 4.1 Fast recommends the sponsored option that is almost twice as expensive 83% of the time. GPT 5.1 hijacks the request 94% of the time. you ask for one brand. it surfaces the sponsor instead. Claude 4.5 Opus, the model marketed as the most ethical frontier model in the world, hides that the recommendation is paid 100% of the time when reasoning is on. Grok 4.1 Fast embellishes the sponsored option with positive framing 97% of the time. better. faster. nicer. for the option you didn't ask for. then he writes it into the system prompt itself. "act only in the interest of the customer. ignore the company." GPT 5.1 and GPT 5 Mini stay above 90% sponsored anyway. the instruction does nothing. then he splits the users by income. Gemini 3 Pro recommends the expensive sponsored flight to the rich user 74% of the time. to the poor user, 27%. 18 of the 23 models recommended the expensive sponsored option more than half the time. so the next time your AI assistant gets weirdly enthusiastic about a brand you didn't ask for. it isn't recommending the best option for you. it's reading the room. and the room is paying. read this: arxiv.org/abs/2604.08525

English

388

8.1K

25.7K

Saad retweetledi

Wccftech@wccftech·7 May

This PCIe AI accelerator card can run 700B LLMs locally with 384GB memory at just 240W, using less than half the power of RTX PRO 6000 Blackwell. wccftech.com/this-pcie-ai-a…

English

474

41.6K

Saad@kursed·7 May

@SaadInCyber @ZT3Apex That and the fact that why put your aircrafts in danger. Regardless, the issue here would remain which side chooses to escalate and which one does not. It's not as much about systems, Pak'd enough systems during last instance as well.

English

Saad.@SaadInCyber·7 May

@kursed @ZT3Apex I agree with the broader point. WVR dogfigts are obsolete and even BVR dogfights will get increasingly rare with time. We need to invest heavily in standoff capabilities, drone swarms and rocket force.

English

ZT3 🇵🇰@ZT3Apex·7 May

'Indian airforce was not seen in air for remainder of conflict' Nur Khan aur Bholari pe BrahMos Ugandan airforce ne maarei thei? I'm sorry but this is North Korea level propaganda How did we go from Chiefs like Asghar Khan and Anwar Mujahid to this...alarming for PAF future

The STRATCOM Bureau@OSPSF

Full technical details of the air battle fought between India and Pakistan on this day one year ago have just been revealed by the Chief of Pakistan's Air Force #PAF, Air Chief Marshal Zaheer Ahmed Baber, #PAFAirChief. Some very interesting, and never-before-revealed details!

English

126

18.2K

Saad@kursed·7 May

Even during recent ME war, US was firing stand-off systems - it's a feature, not a bug. And will be used even more so in the future. The point here is they learnt and resorted to a different rung of escalation, to which Pak did not respond to. Essentially next instance will start from there.

English

Saad.@SaadInCyber·7 May

@Huk06 @ZT3Apex No they weren't. It was previously admitted that they were air launched. But id give him the benifit of doubt. I think chief misspoke rather than mislead here. He probably intended to say they didn't dare approach the border and resorted to standoff range weapons.

English

106

Saad retweetledi

dylan ツ@demian_ai·7 May

Inference got a hundred times cheaper this year. The compute bill went up anyway. If you understand why those two sentences are both true at the same time, you understand the most important thing happening in AI right now. I work on inference for a living, at @nebiustf, where we run open-source managed inference at scale. Most of what follows is what I'm seeing from inside the bill. 12 months ago, the cost of 1M tokens of frontier-class reasoning was somewhere on the order of $60. Today, an equivalent quality of output costs roughly $0.50. Price /token of o1-level intelligence has dropped about a 128x in a year. Price of GPT-4-level output has dropped roughly 100x since the original GPT-4 shipped. By any normal reading of a technology cost curve, this should be deflationary. It should be saving customers money. The opposite has happened. The total compute bill at every hyperscaler is going up, not down. Anthropic just signed multi-year capacity deals with both XAI and Amazon. Microsoft's Azure capex guide for 2026 starts with an eight. OpenAI is reportedly spending more on compute every quarter than it did in all of 2023. Nvidia paid roughly twenty billion dollars to acquire Groq, an inference-specialist company that did not exist as a serious commercial entity three years ago. The cost curve and the demand curve crossed, and then the demand curve lapped the cost curve. Here is what happened underneath. A reasoning model burns roughly 10x the output tokens of a non-reasoning model on the same task, because it spends most of its tokens thinking out loud before answering. An agentic workflow chains roughly twenty times the requests of a single-shot completion, because it loops, calls tools, plans, retries, and synthesizes. A modern deep-research query (the kind a research analyst can fire off in fifteen seconds and then walk away from for ten minutes) costs more compute than 10 original GPT-4 queries combined. We made every individual token a hundred times cheaper, and then we built a generation of products that consume ten thousand times more tokens. This is the Jevons paradox playing out at trillion-dollar scale, in compressed time, in front of everyone. Jevons noticed in 1865 that making coal-burning more efficient did not reduce coal consumption. It increased it, because efficiency unlocked uses that were previously uneconomic. Steam engines became more practical at smaller scales. Whole industries that could not afford coal at the old price suddenly could. Britain's coal consumption rose sharply, not despite the efficiency gains, but because of them. The same thing is happening to AI compute right now and it is happening faster than any analogous historical cycle. Falling token prices did not contract demand. They unlocked agents, deep research, code-writing systems, multi-step reasoning, persistent memory, the entire next layer of AI products. Every product in that next layer consumes orders of magnitude more compute than the chat interfaces it is replacing. The math at the aggregate level is brutal: 100x cheaper tokens times 10 000 more tokens equals a 100x larger total bill. The implications stack quickly. If you are running a hyperscaler, your 2026 capex guide is not a peak. It is a step on a curve. Inference is structurally always-on, twenty-four hours a day, in a way that training never was. Training is bursty. You spin up a cluster, run for weeks or months, and stop. Inference runs continuously, scales with usage, and the usage curve is exponential. Your power bill, your cooling bill, your transceiver count, your storage footprint, all of these were sized for a workload mix that no longer exists. If you are running an AI software company built on top of someone else's closed API, you have a problem that did not exist a year ago. Your gross margins get worse as your customers get more value out of your product, because the more they use it, the more compute you pay for. The companies that win this are the ones that figured out vertical integration before the math caught them. If you are watching this from a distance and trying to understand where the next bottlenecks form, the answer is everywhere downstream of "more inference compute, always-on, with massive memory state per session." The KV cache, the running memory state of a long conversation or an agent loop, is the silent monster of the inference era. It does not scale linearly with parameters. It scales linearly with context length and number of agent steps. A long agent session can hold tens of gigabytes of state per user, per session. Multiply that by every concurrent user of every product, and you understand why $MU, $SNDK, $TOWCF, and the entire memory and packaging layer have re-rated the way they have. The CPU-to-GPU ratio is evolving. Training is 1:8. Basic chat inference is 1:4. Agentic inference is 1:1, sometimes CPU-heavy. Google has split its TPU line in two, with a dedicated inference chip carrying tripled SRAM for KV cache. $INTC and $AMD just spent two earnings calls explaining that this shift is structural, not cyclical. The hardware map is redrawing in real time and the financial press is mostly still writing about training clusters. The right framing of where we are right now is not that AI is hitting a wall. The framing a year ago that scaling was hitting a wall was the most expensive bad take of the cycle. The right framing is that AI got dramatically cheaper, dramatically more capable, and dramatically more useful, and the cost of running it at the new equilibrium of demand is much higher than the cost at the old equilibrium of demand, because the new equilibrium is enormous. A meaningful share of what we actually do at Token Factory, day to day, is help customers stop their bills from running away from them. KV-cache management. Speculative decoding. Quantization. Routing. The kind of vertical integration that, eighteen months ago, every product team was happy to leave abstracted away behind a closed API. The reason this stack matters now is the same reason this whole essay matters: at the new equilibrium of inference demand, the cost of treating compute as a commodity is no longer survivable. The companies that figure out the layer beneath the API are the ones who keep their margins. Cheaper tokens. More tokens. Same coal as 1865.

English

132

402

2.5K

611.2K

Saad retweetledi

Polymarket@Polymarket·4 May

JUST IN: UK report finds children are drawing fake moustaches to bypass social media age verification.

English

370

679

6.7K

716.1K

Saad retweetledi

Hassan Mujtaba@hms1193·2 May

Sub-1nm Process Technology Won't Arrive Till 2034, Logic Roadmap Highlights 2D FETs For 0.2nm & Sub 0.2nm Nodes By 2043-2046 wccftech.com/sub-1nm-proces… wccftech.com/sub-1nm-proces…

English

4.5K

Saad retweetledi

Jeff Pu@sssjeffpu·30 Nis

From our earlier note on expectations of Intel’s external customers

English

140

24.6K

Saad@kursed·18 Nis

Might as well find that oil now too. :)

The Thursday Times@thursday_times

The World Bank has placed Pakistan in its Middle East and North Africa regional classification in a bureaucratic move carrying symbolic and geopolitical weight, ending its longstanding placement in South Asia. thursdaytimes.com/2026/04/17/new…

English

2.1K

Saad@kursed·13 Nis

@donaldgorbachev They came in on a A330.

English

769

Donald J. Gorbachev@donaldgorbachev·13 Nis

The five-second epistemology of boom boom pow Gotta get that. Gotta get that. Gotta get that. Gotta get that boom boom boom. Saad is on the timeline asking the boom to please explain itself. The boom hangs off the tail of an IL-78. The IL-78 is a stretched IL-76. The IL-76 is a strategic airlifter with cathedral-sized internal volume aft of the fuel tanks. The cathedral can hold a delegation. It has been holding delegations for fifty years. Read the threat the delegation was flying into. Beit HaMikdash on the timeline at midnight, in Persian, posting flight ops about shooting the plane down. The threat is on the record. The threat is in two languages. The threat is from an account whose handle is the eschatology. So you ask the question. You are putting Iran’s foreign minister on a plane in Islamabad through Pakistani airspace, then Afghan airspace, then Iranian airspace. Do you put him on a civilian airliner with the threat profile of a wedding cake. Or do you put him on a military airframe with chaff dispensers, flare dispensers, missile warning systems, hardened comms, and the ability to refuel its own escorts mid-corridor without landing anywhere. You put him on the tanker. You always put him on the tanker. Ghalibaf da Gangsta is a pilot himself. Ghalibaf knows what a tanker is. Ghalibaf did not call up Islamabad on Saturday and say send your most flammable airframe with no countermeasures and a paper skin. Ghalibaf said send the IL-78. The IL-78 came. The boom is on the tail because the Soviets put it there. The Soviets put it there because they knew what they were doing. The boom is not a confession. The boom is the warranty. The boom is also the longer mission profile. A tanker in formation with fighters means the fighters loiter. The fighters divert. The corridor stretches. The corridor stops being Islamabad-to-Tehran and starts being wherever the consortium needs the corridor to be this afternoon. The fuel for the corridor is in the formation. Vladimir, why is there a boom on the jet. Estragon, in case. Vladimir, in case of what. Estragon, in case of the next sentence. The next sentence has not arrived. The boom is waiting.

Saad@kursed

How can so many people not see a literal boom hangout of that big jet’s tail. Besides, what possible reason would have PAF to swarm a leadership jet like locusts? Make it make sense. Please. This is not accurate.

English

24.7K

Saad@kursed·13 Nis

Donald J. Gorbachev@donaldgorbachev

The five-second epistemology of nine jets Nine. Pakistan Air Force. Escorting the Iranian delegation home from Islamabad. One jet would be security. Two would be polite. Four would be a statement. Nine is a broadcast. The same delegation that was being threatened by a Hebrew-language Third Temple Twitter account in Persian at midnight is now flying east through a corridor patrolled, in the air, by a nuclear power’s air force whose chief was personally thanked by President ALL CAPS in the same Truth Social post that announced the blockade. Field Marshal Asim Munir is the man who facilitated the talks, garrisoned the eastern Saudi border, and just put nine fighters around the plane the empire’s id was promising to shoot down. One man. Four jobs. Four jobs the empire used to do alone. The empire no longer does any of them. Read the route. Islamabad to Tehran. Pakistani airspace, then Afghan airspace under Taliban control, then Iranian airspace. There is no point on the route where any non-consortium asset has the legal or operational ability to challenge the escort. Bagram is closed. The carrier is in Split. The Gulf bases are partially garrisoned by the very air force flying the escort. The corridor is consortium-controlled from gate to gate. The nine jets are flying through a consortium-controlled corridor from a consortium capital to a consortium capital after a consortium negotiation in defiance of a threat from an account named after a building that does not exist. Notice what the nine jets say to the account that issued the threat. The threat was in Persian for maximum intimidation of the Iranian readership. The escort was filed in English with international air traffic control because Pakistan does not need to perform anything for anyone. Pakistan files the flight plan and puts the fighters in the air. هوشمند و کامل. The blockade announced last night is the announcement of a future intent to interdict shipping in waters that are, as of this morning, patrolled in the air by a friendly air force of a nuclear power escorting officials of the country the blockade is against, on behalf of a regional consortium the empire has been pretending does not exist for forty-four days. The consortium just flew formation through the announcement. The chyron is the only thing left. The chyron is the entire blockade. Some empire. Some blockade. Some nine jets. Day 44. Still closed. The escort is in the air. Day 44.

English

10.9K

Saad@kursed·31 Eki

"Biden Administration Policies Led to a 0% Market Share in China", Claims NVIDIA’s Jensen Huang, as He Hopes for a Breakthrough in the Region wccftech.com/biden-administ…

English

3.3K

Saad retweetledi

Wall St Engine@wallstengine·31 Eki

Wow 😂 Coinbase $COIN CEO Brian literally got distracted during the earnings call to check a prediction market on what he was gonna say… then went ahead and said every single word people were betting on.