Steven Martin

193 posts

Steven Martin banner
Steven Martin

Steven Martin

@Colosteve2000

🚛 Logistics veteran 🤖 Building AI agents & automations 📈 Using AI for stock research 💻 Founder of FMCSA Helper

参加日 Ekim 2024
19 フォロー中8 フォロワー
Steven Martin
Steven Martin@Colosteve2000·
@intelfabs I think when its not the flavor of the month anymore for CEO's and the boardroom. There will be a reckoning of some sort for AI. Its not going away by a longshot, but someone will need to start rationalizing the spending.
English
0
0
0
15
John Intel
John Intel@intelfabs·
We are very far from AI replacing us Because enterprises can’t quantify AI productivity (only 2% can) If enterprises don’t quantify AI productivity, markets can’t price it. If markets can’t price it, capital stays stuck in infrastructure. If capital stays stuck in infrastructure, the AI rally stays narrow. Nothing changes until enterprises start measuring AI productivity instead of just talking about it
John Intel tweet mediaJohn Intel tweet media
English
1
0
1
38
Steven Martin
Steven Martin@Colosteve2000·
@deforestpeg This is gold!!! Things like this should go viral. Not inane questions asking if you like Fable or 5.5 better than the other!! Great post.
English
0
0
1
6
clay
clay@deforestpeg·
everyone complains about AI api costs. almost nobody optimizes. i kept typing the same 5 fixes in replies so i built the thing that finds them in your actual logs the demo workload (synthetic, 30 days, every inefficiency labeled): $2,330 spend, $1,038 of it recoverable biggest single fix: a 6k-token system prompt billed at full price 24,000 times. one cache_control block serves it at 10% of the price $378 back no llm anywhere in the analysis. every number traces to a formula, and it refuses to extrapolate monthly savings from 3 days of logs because that's marketing, not analysis
clay tweet media
English
3
0
10
528
Steven Martin
Steven Martin@Colosteve2000·
@haider1 You can control your costs using any LLM. Here is the magic solution. Don't use it first only use it when you need to.
English
0
0
0
82
Haider.
Haider.@haider1·
google and openai are both capable of building a mythos-sized model and achieving mythos-level performance but openai especially chose cost optimization instead, and kept its max model smaller mythos is probably at least 2x the size of gpt-5.5 maybe even 3x
English
9
0
51
2.4K
@jason
@jason@Jason·
Given token economics, we really need @apple’s new ceo to go all in on workstations that can run local, open source models Ideally, with a router that can flip between local models and frontier models when the former gets stuck. And America needs an open source champion — we really should not be comfortable with the Chinese owning the open source LLM market to the extent they do
English
69
44
516
34.1K
Steven Martin
Steven Martin@Colosteve2000·
@Beth_Kindig Nvidia would be crazy to not hedge and have a backup plan. Forget the talk about China, Taiwan is in the ring of fire, large earthquakes have consistently happened here for a millennium. They should be hedging with Intel and Samsung, maybe even Rapidus.
English
0
0
0
146
Beth Kindig
Beth Kindig@Beth_Kindig·
Nvidia is reportedly testing Intel’s 18A for manufacturing a new design integrating four GPU dies into a single package. $NVDA $INTC $TSM $AMD
English
14
15
126
15.5K
Steven Martin
Steven Martin@Colosteve2000·
$BIIB Money maker?? Let me tell you. BIIB is the most purchased stock in the last 30 days by members of Congress. Guess what happens on August 24. Biogen has an "imminent FDA decision that day" I wonder what they know that we don't :P
English
1
0
0
14
Steven Martin
Steven Martin@Colosteve2000·
@_JKNFT_ @TTrimoreau Sad thing is they give you more job security. Who else is going to fix that MVP release the boss loves?
English
0
0
0
3
JK
JK@_JKNFT_·
@TTrimoreau "People who are building stuff in Claude code without proper coding knowledge are a threat to me." There, I fixed it for you.
English
1
0
0
45
Thomas Trimoreau
Thomas Trimoreau@TTrimoreau·
People who are building stuff in Claude code without proper coding knowledge scares me.
English
49
1
57
2.3K
Steven Martin
Steven Martin@Colosteve2000·
@SemiAnalysis_ If there was every a no shit sherlock moment this would be it. SemiAnalysis: Subscriptions save you lots of money. Me: you didn't think i know that?
English
0
0
1
7
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
Recently, we purchased one of each Anthropic/OpenAI subscription plan and randomly ran long horizon coding tasks until we exhausted the weekly limit. It's widely believed that a $200/month plan maxes out at ~$2000/month worth of tokens (assuming API pricing). However, we found that the subscriptions are actually far more generous. (2/4)
SemiAnalysis tweet media
English
155
492
5.4K
2.6M
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
What's the better business model for an AI lab, subscription or API? (1/4)🧵
SemiAnalysis tweet media
English
15
45
599
129.6K
Steven Martin
Steven Martin@Colosteve2000·
$25.17 cents vs $833.34 I know what model i am trying first. Guess what if its not good enough you can still spend your money on Fable or Chat GPT maybe less if you input what DeepSeek came up with.
English
0
0
0
24
Steven Martin
Steven Martin@Colosteve2000·
I see a change bumbling through the cracks. I see this talk about Fable capability, but who can afford it. I see the calls for Open Ai to release GPT 5.6 even if its "a little bit worse than Fable 5. Newsflash GPT 5.5 is considerably more expensive. So you are now "hoping" the most expensive model of all drop their price? Guess what a 300% price drop on output tokens puts it right around the cost of Fable 5 right now. Please come join the common users with Kimi 2.6 and DeepSeek V4. Make sure you take 10 minutes to tailor your prompts to for each model. I think you will be surprised. I believe that most people could reduce 75% of their usage or more to one of those 2 models. Tell me what you think or what your doing to control costs.
Steven Martin tweet mediaSteven Martin tweet media
English
1
0
0
71
Steven Martin
Steven Martin@Colosteve2000·
@Pablo01618 They just might, if Oracle asks! You don't think Larry Ellison has a few politicians on speed dial? I do.
English
0
0
1
18
Lisan Al Pablo
Lisan Al Pablo@Pablo01618·
What if the fed came in and backstopped $ORCL in our Wall Thesis $ORCL plays a pivotal roll in the buildout and eats at every single substrate layer, as far as bang for your buck thats where I would be punting if I was the federal government
English
0
0
8
430
Steven Martin
Steven Martin@Colosteve2000·
@WayWLeung Great post Chuip! You nailed it slowly but surely Lip-Bu Tan is winning them over. He is doing it the exactly the way he did at Cadence. He is a steady leader that practices what he preaches. Wall Street has developed a trust with him over the years
English
0
0
0
12
CHUIP LEUNG
CHUIP LEUNG@WayWLeung·
$intc 美银对Intel的最新评级大幅改变(从敌视改成买入),当三个最顽固最敌视Intel的Wall Street巨头投行改变了态度时,Intel重新估值的趋势已经势不可挡,Intel foundry会重新带Intel站立回去半导体王者地位,成为美国和世界AI行业核心的一环。 Boa的态度改变再一次说明了在AI年代,真的没有必要再去迷信分析师,自己的研究更加重要,这些分析师偏见和敌视以及滞后都会让投资者失去先机。 恭喜各位早早投资Intel的朋友,拿着低成本(我均价$18.64)一直持有,伴随着Intel的重新崛起而不断获得丰厚回报
中文
14
6
82
10.6K
Steven Martin
Steven Martin@Colosteve2000·
@Bhavani_00007 None of the above I use OpenCode Go and ChatGpt plus plan. That covers most of my needs. If I need more I directly use DeepSeek api
English
1
0
0
125
Bhavy☄️
Bhavy☄️@Bhavani_00007·
developers, which one are you choosing? $200 Claude Code or $100 Codex + $100 Claude Code
English
124
1
94
15.7K
Steven Martin
Steven Martin@Colosteve2000·
@imnotharsh 30 year partnership renewed with fresh ideas for the Ai boom. Another win for Intel.
English
0
0
1
109
ImNotHarsh | 📈💸
ImNotHarsh | 📈💸@imnotharsh·
JUST IN: $INTC Intel and $HPE Hewlett Packard Enterprise Company expand 30 year partnership to deliver Enterprise IT infrastructure across on‑premises, hybrid, and edge environments At HPE Discover 2026, where Intel serves as a Visionary sponsor, the long-standing alliance will emphasize integrated infrastructure solutions designed for enterprise modernization, practical artificial intelligence deployment, and secure hybrid operations from edge to cloud. The primary focus will center on HPE ProLiant Compute Gen12 servers powered by Intel Xeon 6 processors. These platforms deliver measurable advancements in performance, including up to 2.5 times higher high-performance computing throughput in relevant workloads, alongside improved efficiency measured in performance per watt and per rack. The processors support CPU-based AI inferencing for practical enterprise use cases, such as small language models and high-performance LLM inference through validated vLLM integrations, without requiring accelerator hardware in every deployment. Security enhancements will receive prominent attention. The solutions incorporate Intel technologies that provide quantum-resistant cryptography aligned with NIST and CNSA 2.0 standards, Intel Software Guard Extensions for confidential computing, and Trust Domain Extensions for virtual machine isolation. These features address compliance and data protection requirements in regulated and mission-critical environments. Edge and telecommunications workloads will feature through HPE Edgeline systems combined with Intel Xeon 6 capabilities. The partnership will highlight reduced power consumption, smaller deployment footprints, and improved network performance for 5G core and distributed AI scenarios. This supports organizations managing growing data volumes outside traditional data centers. Hybrid cloud and virtualization offerings will also be showcased. These include HPE GreenLake for flexible consumption-based models, validated Azure Local solutions, VMware trusted platforms, and modern VM management through HPE Morpheus VM Essentials on Intel architecture. Broader ecosystem elements, such as HPE Alletra storage and Juniper networking integrations, will demonstrate end-to-end optimized stacks for AI, analytics, and industry-specific applications. Customer benefits will be framed around total cost of ownership reductions, accelerated time to value, sustainability improvements, and simplified compliance. Case studies and demonstrations in HPE Customer Innovation Centers, along with on-demand resources, will illustrate real-world outcomes in data center refresh, edge digitization, and AI readiness. The messaging positions the partnership as a foundation for organizations seeking balanced, silicon-rooted performance and security while advancing practical AI adoption across core and distributed environments. Attendees can expect technical sessions, solution briefs, and interactive exhibits reinforcing these capabilities.
ImNotHarsh | 📈💸 tweet media
English
4
5
72
5.3K
Roy
Roy@usr_bin_roygbiv·
@Colosteve2000 not right now under any circumstances the cost to rent for 1 year is 80% of the cost to buy new
English
1
0
0
3.4K
Roy
Roy@usr_bin_roygbiv·
Hot take: They're not subsidized their margins are insane. They are just absolutely raping api customers. Anyone who has used deepseek or hosted anything and done the math on hardware/power costs knows this
Chubby♨️@kimmonismus

Subscription plans are massively subsidized. And by massively, I mean absurdly: Claude Max 20x: $200/month, with usage reportedly worth around $8,000 ChatGPT Pro 20x: $200/month, with usage reportedly worth around $14,000

English
100
90
2.7K
285.9K
Demis Hassabis
Demis Hassabis@demishassabis·
Awesome to see this innovation in text diffusion. DiffusionGemma is lightning fast, 4x faster than other Gemma 4 models! Congrats to @bodonoghue85 and the team who worked so hard on this - excited to see what people build with it!
Google Gemma@googlegemma

Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇

English
62
94
1.4K
134.2K
Steven Martin
Steven Martin@Colosteve2000·
@albustime @usr_bin_roygbiv @albustime I ask this with the utmost respect. If you are privy to the actual costs please share that knowledge with us. Even if it’s a generalization please consider it.
English
0
0
0
14
Akshobya
Akshobya@albustime·
@usr_bin_roygbiv LOL anyone who has maintained thousands of H100's at the slurm level knows that your 3090-based numbers are not related to reality. The fact is rate limits and surge-routing make subscription cheap, and this API comp is done by people without knowledge of these dynamics
English
2
0
7
2K