Shaun Smith

1.6K posts

Shaun Smith

@evalstate

https://t.co/Hf39YSdoP3 https://t.co/rA1Uook47l https://t.co/TCqQhhMSrk https://t.co/76p6mDAN3R

united kingdom Sumali Temmuz 2024

726 Sinusundan876 Mga Tagasunod

Shaun Smith@evalstate·37m

RT @Kimi_Moonshot: Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. S…

English

Shaun Smith@evalstate·1h

@badlogicgames Agreed. Open weights means customising for your own needs and alignment. Why would you trust anyone else to do this on your behalf?

English

Mario Zechner@badlogicgames·1h

I, for one, don't think there's anything wrong about Cursor taking an open weights base model and turning it into a great agentic model (any licensing issues not withstanding, don't know the details). I want this to be the future for all of us.

English

125

4.8K

Shaun Smith nag-retweet

@·1d

Putting out a wish to the universe. I need more compute, if I can get more I will make sure every machine from a small phone to a bootstrapped RTX 3090 node can run frontier intelligence fast with minimal intelligence loss. I have hit page 2 of huggingface, released 3 model

Michael Dell 🇺🇸@MichaelDell

Jensen Huang is loving the new Dell Pro Max with GB300 at NVIDIA GTC.💙 They asked me to sign it, but I already did 😉

English

145

365

567.4K

Shaun Smith@evalstate·8h

@dominikhonnef When they did the $1,000 free credits when they launched the Claude Code it scooped up an API key and started using it. Issue filed, no response, out of pocket.

English

Dominik Honnef@dominikhonnef·22h

Anthropic's "Claude for Open Source" program has been an awful experience for me. I applied and got invited a couple days later. When I used the promo link to get free Max, it charged me $200, anyway. It has been 17 days of me trying to talk to a human to rectify that.

English

898

Shaun Smith@evalstate·23h

@badlogicgames Let's do it 🍻. The last one was great fun!

English

332

Mario Zechner@badlogicgames·23h

who here will be a AI Engineer London in April? I'm ready to have more pub visits.

English

9.2K

Shaun Smith@evalstate·1d

@jxnlco what's most token dense?

English

431

jason liu@jxnlco·1d

Future of AI

English

65.5K

Shaun Smith@evalstate·1d

@OpenAINewsroom @astral_sh Go team 🐍

English

371

OpenAI Newsroom@OpenAINewsroom·1d

We've reached an agreement to acquire Astral. After we close, OpenAI plans for @astral_sh to join our Codex team, with a continued focus on building great tools and advancing the shared mission of making developers more productive. openai.com/index/openai-t…

English

480

821

7.2K

3.9M

Shaun Smith@evalstate·1d

@arvidkahl I've found in my testing that it's perfectly usable for coding -- if I were on API rather than plan it would be my default choice.

English

305

Arvid Kahl@arvidkahl·1d

If you do AI inference via OpenAI’s API, you should use the flex tier for half price. My requests always try to use flex tier first, and on 429 / 500 errors, I use the default service tier. 95% of my requests are flex. 2 tries flex, then fall back to standard. Massive cost cut.

English

172

19.1K

Shaun Smith@evalstate·2d

@idosal1 I've played, can confirm it's fing cool.

English

Ido Salomon@idosal1·3d

The bigger IDE is multiplayer. AgentCraft now lets humans and agents collaborate in one shared workspace! ⚔️ See allies on one map. Share context. Hand off agent work across machines.

Andrej Karpathy@karpathy

Expectation: the age of the IDE is over Reality: we’re going to need a bigger IDE (imo). It just looks very different because humans now move upwards and program at a higher level - the basic unit of interest is not one file but one agent. It’s still programming.

English

262

34.5K

Shaun Smith@evalstate·2d

@liran_tal It's not even the best harness for Claude.

English

Liran Tal@liran_tal·2d

I've been saying this for so long. I'm not even sure the CC harness is at all better than the competition (codex, cursor), just folks uninformed

kitze 🛠️ tinkerer.club@thekitze

the Codex app is a trillion and one time better than the codex cli and any other cli as a matter of fact. fuck tui, gimme ui all day every day

English

830

Shaun Smith@evalstate·2d

Loving this new way of looking at the Hugging Face Hub; Generative UI. Early days, but looking promising 😎

English

522

Shaun Smith@evalstate·2d

I'll publish a few quickstart packs over the next couple of days with development environments optimised for Codex and Hugging Face IP and code mode subagents.

English

Shaun Smith@evalstate·2d

OK, we know the drill by now. Proper llama.cpp support yesterday, now the best(?) coding and general purpose agent has GPT-5.4-mini/nano support. Oh, and MCP Server side migration to FastMCP3 (deprecating SSE transports). Web Search is very snappy with the mini model 🌐

English

218

Shaun Smith@evalstate·3d

@nicoritschel @AbeWheeler5 👀

QME

Nico Ritschel@nicoritschel·3d

@evalstate @AbeWheeler5 has something for this

English

Shaun Smith@evalstate·3d

What's a good simple MCP Apps testing tool that isn't MCP Jam?

English

749

Shaun Smith@evalstate·3d

@Shashikant86 (those are rolled up loops btw)

English

Shaun Smith@evalstate·3d

@Shashikant86 Definitely some variance in TTFT -- think that's probably the issue.

English

Shashi 🇬🇧🇺🇸@Shashikant86·3d

Codex is amazing and codes bug free softwares than any other agents. However, I would love to see the fast mode in the Codex so that I can get something done without needed the deep analysis of the code. It would be amazing to get "fast" mode in the codex for those who wants needs to get something done quickly when time is money. @thsottiaux @ah20im @romainhuet Something similar to what @AmpCode did like smart/fast mode etc and let users decides.

Ahmed@ah20im

What would you like to see in Codex?

English

210

Shaun Smith@evalstate·3d

@Shashikant86 Yes, I'm trialling using spark for it again, but gpt-oss-120b with heavy guardrails still seems to beat it.

English

Shaun Smith@evalstate·3d

And that's the FastMCP 3.1.1 migration completed, with working Hugging Face OAuth and token passthrough, and elicitations and all that stuff! A wise man said "you're weird - I bet you can't just change the imports". They were right 🤣

English

573

Tuklasin

@Kimi_Moonshot @cursor_ai @badlogicgames @dominikhonnef @jxnlco @OpenAINewsroom @astral_sh @arvidkahl