Sam Hogan 🇺🇸

5.3K posts

Sam Hogan 🇺🇸

@samhogan

ceo @inference_net | inference infra for AI-native teams

San Francisco, CA Katılım Mayıs 2012

1.5K Takip Edilen27.9K Takipçiler

Sam Hogan 🇺🇸@samhogan·1h

@Pauly4010 Yeah something like that. Not sure on the exact shape yet. Whatever it is, it needs to be super super easy to use, pretty much fully automatic

English

361

🅿️@Pauly4010·2h

@samhogan Routing is optimized weekly bc inference is now easier. Do you think evals (use case, regression-gated) become the primitive long term? I think evals are new PRDs where AC becomes CI gates and hill-climbing signals. Routing saves $, evals ensures credibility. Thoughts?

English

415

Sam Hogan 🇺🇸@samhogan·2h

This is called *preselling* With the news that OpenRouter might get acquired, there is going to be a land grab for inference volume. I wouldn’t bet against the Ramp Labs team, but this space feels ripe for disruption by a small team dedicated solely to these problems.

Veeral Patel@vral

Today we’re launching Ramp Router. 3 years ago, we built an internal LLM router at @tryramp that powers AI products for 70,000 customers. Back then it was mostly about saving money. Now it feels obvious: the best model changes constantly. GPT, Claude, Gemini, Grok, Qwen, DeepSeek, Kimi, GLM - prices and capabilities move every week. So we’re opening up access to everyone. One OpenAI-compatible endpoint. The right model for every request. Lower cost without rewriting your app. Reserve access to use it.

English

15.1K

Sam Hogan 🇺🇸@samhogan·1h

@hrojantorse Token spend is a CFO problem but token efficiency is an engineering problem

English

465

Vishwa Naik@hrojantorse·2h

@samhogan Ramp owns decent distribution, though I’ve always viewed expense management has a CFO job and from my experience, where we are with token spend isn’t (and perhaps shouldn’t) be a token spend problem yet.

English

494

Sam Hogan 🇺🇸@samhogan·2h

@vral @tryramp You guys move fast! Only three weeks behind :) x.com/samhogan/statu…

Sam Hogan 🇺🇸@samhogan

Want to try GLM 5.2 in production but worried how it might change your product? Don’t worry, we got you: 1. Install Inference Gateway (docs.inference.net) 2. Keep sending traffic to your current provider 3. Gateway automatically starts sorting through your live data using an RLM to generate evals for your app. This takes ~24 hours. 4. Gateway starts mirroring live traffic to GLM 5.2 to run evals. Traffic is only mirrored - you’re still using your old provider in prod. 5. Once evals look healthy, you get a Slack notification letting you know it’s safe to switch. 6. Switch model identifier in your code to “glm-5.2” Congrats, you just saved 90% on your monthly token bill, and you own your LLM stack end to end.

English

360

Veeral Patel@vral·5h

English

154

185

1.3K

356.1K

Sam Hogan 🇺🇸 retweetledi

Ibrahim Ahmed@atbeme·5h

Inference volume at @inference_net has grown 5x in the last 30 days

English

1.2K

Sam Hogan 🇺🇸@samhogan·1d

@Kimi_Moonshot Looking forward to serving Kimi K3 on @inference_net!

English

3.3K

Kimi.ai@Kimi_Moonshot·1d

Kimi K3 has received far more love than we expected, and our GPUs are feeling it. Over the past 48 hours, demand has pushed close to the limits of our current capacity. To protect the experience of existing subscribers, we're temporarily pausing new subscriptions and prioritizing compute for current members. Existing subscribed users are not affected. We're adding capacity as fast as we can and will reopen new subscription spots in batches. Going forward, we'll also split membership into two more focused plans: Kimi Membership for Kimi Web, App, and Work; and Kimi Code Membership for coding workflows. This will help us match compute more precisely and keep the experience stable. Thank you for your patience and understanding!

English

1.5K

2.6K

36.6K

12.6M

Sam Hogan 🇺🇸 retweetledi

signüll@signulll·1d

lol openai comms right now is an incoherent policy geek who joined two weeks ago & a researcher who speaks uh researcher language? that is not parseable by the average individual trying to play defense against all of twitter. remarkable. genuinely amateur shit, especially from a company whose product is supposedly intelligence.

English

1.1K

82.2K

Sam Hogan 🇺🇸@samhogan·1d

@tszzl Hmmm open models being decel is a horrible take tho

English

1.6K

roon@tszzl·1d

ZXX

89.6K

Sam Hogan 🇺🇸 retweetledi

signüll@signulll·2d

ZXX

1.4K

40.6K

Sam Hogan 🇺🇸@samhogan·2d

@Teknium @deanwball Genuinely shameful take.

English

1.4K

Sam Hogan 🇺🇸@samhogan·2d

@max_paperclips For sure! But AI will have a wide and deep open source ecosystem from nearly the beginning, whereas regular software had 20+ years of being almost entirely proprietary. I’m also not sure if the labs are the best to tackle compliance, security, etc

English

193

Shannon Sands@max_paperclips·2d

there's nothing wrong either with this coexisting with closed labs. Oracle etc continued to exist. White label services that have ticked all the boxes for regulatory obligations, standards and reporting & whatnot else that AI will inevitably require aren't going anywhere. We know what the Schelling points look like. as you say "deterministic" software already went through this

English

358

Sam Hogan 🇺🇸@samhogan·2d

The diffusion of deterministic software, ie code, took ~50 years to reach full saturation. For the first 20 years, most companies bought software from vendors like Microsoft and Sun Microsystems. In the 1990s, Linux and the OSS came along and disrupted this by helping to create a whole new class of people who could easily create software. Suddenly businesses were hiring their own software developers and building tools in house using open source components. I expect we’re going to speedrun this adoption curve for non-deterministic software, ie models, in just five years. Companies are going to build their own ML teams who use open source models and libraries to create and manage their own AI stack rather than relying on a proprietary provider. The large vendors like OpenAI and Anthropic will remain relevant for some time, but will ultimately be small compared to the ecosystem as a whole.

Aravind Srinivas@AravSrinivas

At its peak, Sun Microsystems was valued at 205B (394B if inflation adjusted). Sold software in enterprise servers. Got disrupted by Linux, x86, and commodity hardware. Ended up selling to Oracle for 7.4B, losing 96% of its value. Open source models running on local hardware can have a similar impact given what’s going on.

English

9.8K

Sam Hogan 🇺🇸@samhogan·2d

@doodlestein I’m talking about enterprise engineering teams with 1000+ people moving the majority of their token usage off closed source models to Kimi and GLM

English

241

Jeffrey Emanuel@doodlestein·2d

@samhogan Disagree, I’m tokenmaxxing more than just about anyone and I’m still a huge user of both Fable and Sol. Yes, I signed up for the $200/month Kimi plan. But that compares to 56 accounts across Plus, Max, and the Google one (just 7 of those). Remember, the labs are ahead internally

English

607

Sam Hogan 🇺🇸@samhogan·2d

Crazy how fast the vibes have shifted. All the people who were tokennmaxxing and fanboying over OpenAI/Anthropic just three months ago are now moving to open models and publicly questioning the durability of the labs’ moat

English

124

Sam Hogan 🇺🇸 retweetledi

Will Manidis@WillManidis·2d

My friend Dean Ball has advanced an argument for the de facto protection of American frontier intelligence providers. Dean does not propose banning Chinese open-weight models. Banning things requires Congress. He proposes something more characteristic of the modern administrative state: every agency issues enough warnings, bulletins, and speculative security notices that no regulated company will risk touching them. Even a reader sympathetic to Dean would call this protectionism, and protectionism has a long history in America. More precisely, it's a proposal to use the informal, coercive power of the terminal, late-stage bureaucratic state to clear the American market of a cheaper frontier competitor to OpenAI or Anthropic. But throughout the history of American industrial protectionism, it has always had two features. First, it's done in the daylight, and two, it comes with a bill. In the spring of 1952, the United States was fighting a war in Korea. Truman concluded that a shutdown would endanger soldiers abroad and ordered the Secretary of Commerce to seize and operate most of the nation's steel mills. The Supreme Court sent him straight back to Congress in the Youngstown Steel case. Justice Black, writing the majority's opinion, begins with the rule that Dean's proposal is seemingly designed to evade: that presidential power "must stem either from an act of Congress or from the Constitution itself." It's easy to flatten the Youngstown decision into the proposition that the president could not seize a steel mill. Its actual lesson is subtler: that an emergency does not dissolve the difference between making a law and executing one, that the importance of the object does not create the authority, that the inconvenience of the regulatory process is not inherently a source of presidential power. Truman's approach failed not because steel was unimportant, but because it was so important that the constitutional bargain had to be made and the policy had to be carried through the front door. Much like policy proposals from the rest of the AI agenda, Dean is proposing a smaller action in formal appearance and a much larger one in practical effect. We will not ban Kimi, we will not prohibit it from use, and we will certainly not publish a rule declaring Chinese weights unlawful. But we will whisper about it. A regulator may even ask management whether it has considered the reputational consequences of relying on the Chinese model, but the agency certainly will never be coherent enough to ask anyone to stop. It merely ensures that continuing becomes professionally indefensible. This is how we grow the administrative state, with bureaucrats that we placed in these roles, without accepting responsibility for the actual process of governing. America has tried this experiment before. Operation Chokepoint didn't make payday lending, firearm sales, or any of the other seemingly distasteful businesses caught in its net illegal, but it encouraged banks to understand that serving legally disfavored customers would invite regulatory interest. We didn't pass a law, we simply just asked, "Are you sure you really want to be doing this?" Reputational risk was powerful precisely because it's not law. It has no limiting content. A regulator did not need to identify a violation or even a material financial risk. He only needed to make the bank afraid of being asked what was actually going on here. The analogy is almost embarrassingly exact to Dean's policy proposal. Dean need not prove that a Chinese model contains a backdoor, nor prove that it uses any more distillation than American models do. He simply needs to announce that there may be one. The agency does not need to order a company to stop using it, but simply ask whether management has considered the risk. The absence of formal policy is by design. The Supreme Court dealt with this technique in NRA v. Vullo. New York's financial regulator could not directly punish the NRA's speech, so she allegedly pressured the insurers and banks she regulated to sever their relationships with it. The Court's rule was unanimous: government officials may not use their offices to "coerce private parties" into suppressing what the government disfavors. The communication must be understood in the context of the regulator's power, including the regulated party's knowledge that the person offering advice can also investigate, prosecute, fine, and settle. The current administration has gone even further. In April 2026 the FDIC and OCC issued a final rule to prohibit regulators from criticizing institutions, formally or informally, on the basis of reputational risk, and from encouraging banks to deny services to lawful but politically disfavored businesses. In June, the federal banking agencies removed the remaining references to reputational risk from their supervisory materials. Dean is proposing that this administration recreate for AI the same machinery that all of us argued against when we were widely debanked. A government that can quietly remove Kimi from the market can also quietly remove gun makers, crypto companies, churches, newspapers, or American open-weight models from it. The bureaucracy does not remain attached to the intentions of those who staff it at the current moment. You don't get to build this machine just because your friends happen to be in office right now and keep it pointed at where you left it. Protectionism through a whisper is not a more modest protectionism than by law. Protectionism also has always come with a bill. OpenAI and Anthropic increasingly speak of themselves as national institutions. Their compute is "strategic infrastructure," their losses are "national security losses." Their competitors are not just competitors, but instruments of hostile states, and their access to power, chips, capital, copyrighted material, and public customers is a matter of national survival and great power competition. When Washington decided that the atom was too dangerous and too important to remain an ordinary private business, Congress created the Atomic Energy Commission and transferred the Manhattan Project assets and responsibilities to it. Production facilities and reactors were government-owned, and technical information sat under federal control, and private participation only returned later through a statutory licensing regime. The existential framing of the atom by its greatest proponents produced public control. When national security concerns helped to preserve AT&T's integrated position, that is, a monopoly, in 1956, Bell did not receive this protection for nothing. The consent decree required compulsory licensing of roughly 9,000 patents and restricted Western Electric's commercial activity outside the telephone system. The settlement diffused the inventions accumulated inside the protected monopoly into the broader economy before breaking it up just a few decades later. The pattern is really simple. It's not that every tariff necessarily demands nationalization. It's that the bigger the shield you are asking for, the bigger the bill you owe to the American taxpayer. And OpenAI and Anthropic have been unambiguous about asking for the biggest shields of all time. Listen to what they are asking for: public infrastructure, privileged energy, federal preemption of state law, favorable copyright treatment, government contracts, export controls, and a domestic market swept clear of their strongest price competitor, all filed under national security interests. And what do they want to pay? Almost nothing. OpenAI has floated giving 5% of the company to the American taxpayer. They would like the benefits of nationalization at the price of being an ordinary public company. There is also a profound moral hazard buried in Dean's proposal, as well as adjacent commentary on this. The labs say the Chinese companies distilled their models. Perhaps they did. Perhaps distillation matters. And perhaps the Chinese labs are running distillation attacks on scales that the Western labs are. I can't be sure of this. But if the reward for failing to secure an API is that the government removes the resulting competitor, the taxpayer is paying the lab to be careless. We know how to secure an API. Know-your-customer laws exist. Access controls exist. Extraction detection exists. If you spend some fraction of the hundreds of billions being raised to defend the asset whose theft is said to threaten the republic, you might be able to stop some of this. Theft remains theft when the lock is bad, but the owner of a badly secured store does not receive ownership of the street for his failure to protect it. Dean's fourth point is that open-weight AI ends in communism: the state builds the training runs and subsidizes the product of intelligence and gives the models away. But, at least for me, this is not a particularly Chinese idea, but one of the most American ones imaginable. The roads we build are public. Our radio spectrum is publicly allocated. The government funded the early internet and much of the research base behind modern computing. The state is welcome to build a platform, and American businesses are welcome to be built on top. Just because they're bad for our market position doesn't mean we get to call them Chinese in some fundamental way. There will be inference companies and application companies and security companies and fine-tuning companies and data companies and chip companies and 10,000 businesses we don't even have names for yet. A public road existing does not abolish the trucking industry, nor does it nationalize it. Sure, this may reduce the value of a couple trillion dollars of equity in the first generation of model companies, but it's certainly not communism. This technology may be civilizational without its present owners being permanent. And that is the thing that I feel like none of you will say out loud: that AI is welcome to be a civilizational technology when we ask for support, and an ordinary private product when anyone asks what the public receives in return. The United States has two honest options. First, treat AI as a competitive industry. Then the answer to Kimi is a better model, run cheaper and exported harder, with written rules excluding Chinese systems from defense, intelligence, and critical infrastructure when a concrete security case can be made. Or two, decide frontier AI is too important for ordinary competition. Protect the labs through pseudo-nationalization, guarantee there's a market for them, and exclude the rivals. But in that second case, the American taxpayer must be paid, likely through a majority of equity in these companies, if not full nationalization. What no one gets is that private upside, public infrastructure, government-mandated scarcity, and immunity from cheaper competition delivered through a late bureaucratic state issuing warnings is a disgusting ask for something that is easy to name: regulatory capture. There is a serious American argument for protecting industries that we can't afford to lose. But there has never been a serious argument for doing it invisibly, for free, through a bureaucracy instructed to manufacture fear, even if we can do it because our friends happen to be in office right now. If the labs want to be protected, they should ask for it in the way that Americans have always asked for it. In public. With a price.

Dean W. Ball@deanwball

Some observations on Kimi: 1. It's a very good model! I don't think its performance can be explained away by distillation or anything like that. In agentic coding sessions, it seems pretty much on par with the best public models of Q1 2026. In my fairly limited use, it also seemed very token hungry. It's not obvious to me that this model is actually that cheap to run. 2. I am personally surprised the Chinese state continues to allow the open sourcing of models this good, given potential risks. To be clear, I *myself* might be fine with models presenting this level of marginal risk being open weight, but I am surprised that China is fine with it. I suspect the reason they are is 75% explained by strategic blindness/lack of AGI-pilledness (the CCP is very Yann Lecun-y in its views of AI). The other 25% or so is their lack of compute for customer inference (making China's open-weight strategy an unintended byproduct of US export controls) and the normal Chinese strategy of aggressive exports. For the companies, as opposed to the government, the decision to open source is partially ideological and partially because they are behind, and they know that very few people would pay for sub-frontier models from China. 3. Open-weight models are inherently decelerationist, and I'm continually surprised to see the so-called "accelerationists" so excited about open-weight models. I suspect the reason they are is that they know open-weight models are effectively ungovernable, and they simply like the overall cloak of ungovernability open-weight models create over the whole of AI. It's not a bad strategy; it reminds me of James Scott's recounting of the hill people in "the art of not being governed." Still, in the end, open-weight models deter further AI capex. 4. One probable outcome of an open-weight-model-dominant world is full AI communism, which is precisely what China proposes: rather than a market product, AI is a "public good" which will ultimately be provided by the state as a kind of "digital public infrastructure." This future strikes me as a dystopian hellscape, but I've never met an open-weight models advocate who doesn't ultimately concede this is where things end. You'd be surprised how many 'accelerationists' lobbied me, while I was in government, to support an eleven or twelve-figure federally funded data center so that startups could train models at a subsidy and then give them away for free. There was no other way for AI to progress, they said. Perhaps this is the logical end state of things. Nonetheless, I find myself surprised to see supposed accelerationists excited about such an outcome. I think many of them just don't know what they're doing. Many accelerationists do not view the creation and serving of frontier models as a legitimate business. 5. I would guess that the Trump Administration will at some point realize that their best strategy here would be to create large amounts of regulatory risk around the use of open-weight Chinese models. You don't need to "ban open source" (one of the dumber motifs of AI policy discussion). You just need to direct every agency to issue soft law that creates FUD. "A Federal Reserve Advisory Bulletin found that there may be backdoors in Chinese AI models." It needn't be that well justified. You just create enough regulatory risk that every regulated enterprise backs off. You probably don't want to create so much regulatory risk that you scare off the hyperscalers from serving Chinese models; this will just drive startups to sketchier providers. There's a happy middle ground here. I'd assume they will do some version of this. 6. It's probably true that open-weight models of this capability make the world a bit more dangerous, but not so much more that you'll really notice. At some point the models will be capable enough that you will notice. "A nonliving, invisible, dangerous, and infinitely self-replicating agent escaped from a Chinese lab," you say? Color me shocked.

English

102

140

1.3K

387.6K

Sam Hogan 🇺🇸@samhogan·2d

What the world’s best AI researchers see right before leaking state secrets

English

103

6.2K

Sam Hogan 🇺🇸 retweetledi

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex·3d

excellent lecture from Zhilin, primarily about Kimi's scaling strategy

English

788

53.4K

Sam Hogan 🇺🇸@samhogan·2d

Guessing we will see more direct digs lab on lab as things start to fall apart

Tibo@thsottiaux

GPT-5.6 Sol confirmed to be an extremely good model

English

5.7K

Sam Hogan 🇺🇸@samhogan·2d

@benhylak Completely insane. The least logically sequential paragraphs I’ve ever read.

English

442

ben hylak@benhylak·3d

it's insane that this is still true. i love codex, but i never read what it says. it's completely unintelligible. same for ChatGPT.

ben hylak@benhylak

maybe i'm going crazy but i really can't read chatgpt outputs anymore. the structure of the response is so schizophrenic.

English

525

50.6K

Sam Hogan 🇺🇸 retweetledi

jules@julesrosenberg·3d

5 things @samhogan does differently > signs company documents from the terminal > trained 5 function-specific @inference_net agents > keeps his entire to-do list in one apple note (over a year old) > moved his entire team to @opencode for llm portability > blocks twitter on his phone from 9-5 (300+ muted words) ep 6 of show me your stack is live!

English

42.9K

Keşfet

@Pauly4010 @hrojantorse @vral @tryramp @inference_net @Kimi_Moonshot @tszzl @Teknium