Steven Martin

193 posts

Steven Martin

@Colosteve2000

🚛 Logistics veteran 🤖 Building AI agents & automations 📈 Using AI for stock research 💻 Founder of FMCSA Helper

参加日 Ekim 2024

19 フォロー中8 フォロワー

Steven Martin@Colosteve2000·28m

@intelfabs I think when its not the flavor of the month anymore for CEO's and the boardroom. There will be a reckoning of some sort for AI. Its not going away by a longshot, but someone will need to start rationalizing the spending.

English

John Intel@intelfabs·37m

We are very far from AI replacing us Because enterprises can’t quantify AI productivity (only 2% can) If enterprises don’t quantify AI productivity, markets can’t price it. If markets can’t price it, capital stays stuck in infrastructure. If capital stays stuck in infrastructure, the AI rally stays narrow. Nothing changes until enterprises start measuring AI productivity instead of just talking about it

English

Steven Martin@Colosteve2000·51m

@deforestpeg This is gold!!! Things like this should go viral. Not inane questions asking if you like Fable or 5.5 better than the other!! Great post.

English

clay@deforestpeg·12h

everyone complains about AI api costs. almost nobody optimizes. i kept typing the same 5 fixes in replies so i built the thing that finds them in your actual logs the demo workload (synthetic, 30 days, every inefficiency labeled): $2,330 spend, $1,038 of it recoverable biggest single fix: a 6k-token system prompt billed at full price 24,000 times. one cache_control block serves it at 10% of the price $378 back no llm anywhere in the analysis. every number traces to a formula, and it refuses to extrapolate monthly savings from 3 days of logs because that's marketing, not analysis

English

528

Steven Martin@Colosteve2000·1h

@haider1 You can control your costs using any LLM. Here is the magic solution. Don't use it first only use it when you need to.

English

Haider.@haider1·1h

google and openai are both capable of building a mythos-sized model and achieving mythos-level performance but openai especially chose cost optimization instead, and kept its max model smaller mythos is probably at least 2x the size of gpt-5.5 maybe even 3x

English

2.4K

Steven Martin@Colosteve2000·1h

@Jason @Apple It will only be open source until it catches up!

English

142

@jason@Jason·3h

Given token economics, we really need @apple’s new ceo to go all in on workstations that can run local, open source models Ideally, with a router that can flip between local models and frontier models when the former gets stuck. And America needs an open source champion — we really should not be comfortable with the Chinese owning the open source LLM market to the extent they do

English

516

34.1K

Steven Martin@Colosteve2000·1h

@Beth_Kindig Nvidia would be crazy to not hedge and have a backup plan. Forget the talk about China, Taiwan is in the ring of fire, large earthquakes have consistently happened here for a millennium. They should be hedging with Intel and Samsung, maybe even Rapidus.

English

146

Beth Kindig@Beth_Kindig·3h

Nvidia is reportedly testing Intel’s 18A for manufacturing a new design integrating four GPU dies into a single package. $NVDA $INTC $TSM $AMD

English

126

15.5K

Steven Martin@Colosteve2000·1h

benzinga.com/news/fda/26/05…

ZXX

Steven Martin@Colosteve2000·1h

$BIIB Money maker?? Let me tell you. BIIB is the most purchased stock in the last 30 days by members of Congress. Guess what happens on August 24. Biogen has an "imminent FDA decision that day" I wonder what they know that we don't :P

English

Steven Martin@Colosteve2000·1h

@_JKNFT_ @TTrimoreau Sad thing is they give you more job security. Who else is going to fix that MVP release the boss loves?

English

JK@_JKNFT_·8h

@TTrimoreau "People who are building stuff in Claude code without proper coding knowledge are a threat to me." There, I fixed it for you.

English

Thomas Trimoreau@TTrimoreau·21h

People who are building stuff in Claude code without proper coding knowledge scares me.

English

2.3K

Steven Martin@Colosteve2000·1h

@IanCutress @Index_shu Yes, indeed how do you miss Rapidus?

English

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠@IanCutress·17h

@Index_shu What about Rapidus ?!?

English

9.5K

しゅー@インデックス投資🇺🇸@Index_shu·1d

これが日本を代表する新たな半導体企業群です

Kr$na@krishdotdev

It's not FAANG anymore. It's MANGO.

日本語

10.5K

3.1M

Steven Martin@Colosteve2000·1h

@SemiAnalysis_ If there was every a no shit sherlock moment this would be it. SemiAnalysis: Subscriptions save you lots of money. Me: you didn't think i know that?

English

SemiAnalysis@SemiAnalysis_·1d

Recently, we purchased one of each Anthropic/OpenAI subscription plan and randomly ran long horizon coding tasks until we exhausted the weekly limit. It's widely believed that a $200/month plan maxes out at ~$2000/month worth of tokens (assuming API pricing). However, we found that the subscriptions are actually far more generous. (2/4)

English

155

492

5.4K

2.6M

SemiAnalysis@SemiAnalysis_·1d

What's the better business model for an AI lab, subscription or API? (1/4)🧵

English

599

129.6K

Steven Martin@Colosteve2000·1h

$25.17 cents vs $833.34 I know what model i am trying first. Guess what if its not good enough you can still spend your money on Fable or Chat GPT maybe less if you input what DeepSeek came up with.

English

Steven Martin@Colosteve2000·1h

I see a change bumbling through the cracks. I see this talk about Fable capability, but who can afford it. I see the calls for Open Ai to release GPT 5.6 even if its "a little bit worse than Fable 5. Newsflash GPT 5.5 is considerably more expensive. So you are now "hoping" the most expensive model of all drop their price? Guess what a 300% price drop on output tokens puts it right around the cost of Fable 5 right now. Please come join the common users with Kimi 2.6 and DeepSeek V4. Make sure you take 10 minutes to tailor your prompts to for each model. I think you will be surprised. I believe that most people could reduce 75% of their usage or more to one of those 2 models. Tell me what you think or what your doing to control costs.

English

Steven Martin@Colosteve2000·2h

@Pablo01618 They just might, if Oracle asks! You don't think Larry Ellison has a few politicians on speed dial? I do.

English

Lisan Al Pablo@Pablo01618·13h

What if the fed came in and backstopped $ORCL in our Wall Thesis $ORCL plays a pivotal roll in the buildout and eats at every single substrate layer, as far as bang for your buck thats where I would be punting if I was the federal government

English

430

Steven Martin@Colosteve2000·3h

@WayWLeung Great post Chuip! You nailed it slowly but surely Lip-Bu Tan is winning them over. He is doing it the exactly the way he did at Cadence. He is a steady leader that practices what he preaches. Wall Street has developed a trust with him over the years

English

CHUIP LEUNG@WayWLeung·15h

$intc 美银对Intel的最新评级大幅改变（从敌视改成买入），当三个最顽固最敌视Intel的Wall Street巨头投行改变了态度时，Intel重新估值的趋势已经势不可挡，Intel foundry会重新带Intel站立回去半导体王者地位，成为美国和世界AI行业核心的一环。 Boa的态度改变再一次说明了在AI年代，真的没有必要再去迷信分析师，自己的研究更加重要，这些分析师偏见和敌视以及滞后都会让投资者失去先机。恭喜各位早早投资Intel的朋友，拿着低成本（我均价$18.64）一直持有，伴随着Intel的重新崛起而不断获得丰厚回报

中文

10.6K

Steven Martin@Colosteve2000·3h

@Bhavani_00007 None of the above I use OpenCode Go and ChatGpt plus plan. That covers most of my needs. If I need more I directly use DeepSeek api

English

125

Bhavy☄️@Bhavani_00007·15h

developers, which one are you choosing? $200 Claude Code or $100 Codex + $100 Claude Code

English

124

15.7K

Steven Martin@Colosteve2000·3h

@imnotharsh 30 year partnership renewed with fresh ideas for the Ai boom. Another win for Intel.

English

109

ImNotHarsh | 📈💸@imnotharsh·3h

JUST IN: $INTC Intel and $HPE Hewlett Packard Enterprise Company expand 30 year partnership to deliver Enterprise IT infrastructure across on‑premises, hybrid, and edge environments At HPE Discover 2026, where Intel serves as a Visionary sponsor, the long-standing alliance will emphasize integrated infrastructure solutions designed for enterprise modernization, practical artificial intelligence deployment, and secure hybrid operations from edge to cloud. The primary focus will center on HPE ProLiant Compute Gen12 servers powered by Intel Xeon 6 processors. These platforms deliver measurable advancements in performance, including up to 2.5 times higher high-performance computing throughput in relevant workloads, alongside improved efficiency measured in performance per watt and per rack. The processors support CPU-based AI inferencing for practical enterprise use cases, such as small language models and high-performance LLM inference through validated vLLM integrations, without requiring accelerator hardware in every deployment. Security enhancements will receive prominent attention. The solutions incorporate Intel technologies that provide quantum-resistant cryptography aligned with NIST and CNSA 2.0 standards, Intel Software Guard Extensions for confidential computing, and Trust Domain Extensions for virtual machine isolation. These features address compliance and data protection requirements in regulated and mission-critical environments. Edge and telecommunications workloads will feature through HPE Edgeline systems combined with Intel Xeon 6 capabilities. The partnership will highlight reduced power consumption, smaller deployment footprints, and improved network performance for 5G core and distributed AI scenarios. This supports organizations managing growing data volumes outside traditional data centers. Hybrid cloud and virtualization offerings will also be showcased. These include HPE GreenLake for flexible consumption-based models, validated Azure Local solutions, VMware trusted platforms, and modern VM management through HPE Morpheus VM Essentials on Intel architecture. Broader ecosystem elements, such as HPE Alletra storage and Juniper networking integrations, will demonstrate end-to-end optimized stacks for AI, analytics, and industry-specific applications. Customer benefits will be framed around total cost of ownership reductions, accelerated time to value, sustainability improvements, and simplified compliance. Case studies and demonstrations in HPE Customer Innovation Centers, along with on-demand resources, will illustrate real-world outcomes in data center refresh, edge digitization, and AI readiness. The messaging positions the partnership as a foundation for organizations seeking balanced, silicon-rooted performance and security while advancing practical AI adoption across core and distributed environments. Attendees can expect technical sessions, solution briefs, and interactive exhibits reinforcing these capabilities.

English

5.3K

Steven Martin@Colosteve2000·6h

@usr_bin_roygbiv Not such a bargain then. So Much for that idea

English

206

Roy@usr_bin_roygbiv·6h

@Colosteve2000 not right now under any circumstances the cost to rent for 1 year is 80% of the cost to buy new

English

3.4K

Roy@usr_bin_roygbiv·17h

Hot take: They're not subsidized their margins are insane. They are just absolutely raping api customers. Anyone who has used deepseek or hosted anything and done the math on hardware/power costs knows this

Chubby♨️@kimmonismus

Subscription plans are massively subsidized. And by massively, I mean absurdly: Claude Max 20x: $200/month, with usage reportedly worth around $8,000 ChatGPT Pro 20x: $200/month, with usage reportedly worth around $14,000

English

100

2.7K

285.9K

Steven Martin@Colosteve2000·6h

@demishassabis @bodonoghue85 Now imagine this 4x faster using Cerebra’s technology and even more crazy add @TileRT_AI. Obviously you aren’t likely to need both but imagine the speed!

English

Demis Hassabis@demishassabis·1d

Awesome to see this innovation in text diffusion. DiffusionGemma is lightning fast, 4x faster than other Gemma 4 models! Congrats to @bodonoghue85 and the team who worked so hard on this - excited to see what people build with it!

Google Gemma@googlegemma

Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇

English

1.4K

134.2K

Steven Martin@Colosteve2000·6h

@LikesFriedRice @usr_bin_roygbiv Well if you try to use DeepSeek the same way as GPT 5.5 yeah you could have problems. Every model has its own idiosyncrasies.

English

PeterLikesFriedRice 🍚🥢@LikesFriedRice·6h

@usr_bin_roygbiv Isn’t DeepSeek terrible at coding. I’ve tried v4 twice and regretted it both times. Went back to Codex 5.5

English

600

Steven Martin@Colosteve2000·6h

@albustime @usr_bin_roygbiv @albustime I ask this with the utmost respect. If you are privy to the actual costs please share that knowledge with us. Even if it’s a generalization please consider it.

English

Akshobya@albustime·7h

@usr_bin_roygbiv LOL anyone who has maintained thousands of H100's at the slurm level knows that your 3090-based numbers are not related to reality. The fact is rate limits and surge-routing make subscription cheap, and this API comp is done by people without knowledge of these dynamics

English

ディスカバー

@intelfabs @deforestpeg @haider1 @Jason @Apple @apple @Beth_Kindig @_JKNFT_