cole murray

8K posts

cole murray

@_colemurray

ai/ml | cto | second time founder | former sr. sde @ amazon

San Francisco, CA Katılım Şubat 2015

968 Takip Edilen3.9K Takipçiler

Sabitlenmiş Tweet

cole murray@_colemurray·25 Eki

Advice given to someone asking about AI Consulting: I don't think an ML background is required to be successful in AI consulting, but obviously helps. I think the biggest "skill" learned in ML is how to successfully do feedback loops in a system. In an ML system, this typically involves cleaning data, making model tweaks, performance evals etc. In LLMs, in nearly every case you won't be fine-tuning the model, but iterating on prompts is a very similar workflow. I do think it would be helpful to at least get a high-level learning of how the models "actually" work and become familiar with the basic terms. e.g. tokens, transformers, attention, what happens on each input -> output iteration as the model is predicting. You don't need to know the underlying math (helpful though), but having the understanding of what is happening is helpful. Most of the AI consulting market is more on full-stack / product development skills and less ML. This isn't the most lucrative opportunities, but they are available in abundance. Major areas now and over the next year: - RAG: this is basically just glorified search lol. Useful in many contexts but severely overhyped - Agents: The models aren't quite there yet IMO for this to be useful, but in 2025 I think this will be a major theme and a HUGE area of interest/investment. Becoming good at this will be valuable. - Evals: Performance evaluations are a relatively untapped market. Most AI products you see today are flying by the seat of their pants. Without eval metrics, you can't truly know if your prompt changes are improving the system. This is somewhat more difficult to sell as a consultant as it requires a more sophisticated buyer, but is worth a lot of money if you can do it well

English

232

47K

cole murray@_colemurray·10h

if you are using supabase in your codebase, in most cases you can replace them with AWS RDS or Google Cloud SQL to improve your security and latency

Supabase@supabase

If you are using getUser() in your codebase, in most cases you can replace them with getClaims() to speed up your app and reduce DB loads.

English

991

cole murray@_colemurray·10h

amber alert as a payload delivery mechanism

Salina Mendoza@inababi

Just got an amber alert. Clicked the link and it brought me to this. I kid you not. Whose contract is this?

English

412

cole murray@_colemurray·12h

looks like this got released in 0.120.0 0.120.0 2026-04-11 02:54 UTC

English

174

cole murray@_colemurray·12h

why is codex now suggesting me apps? this feels like malware lol

English

1.7K

cole murray@_colemurray·12h

imagine getting 720 sideflip no scoped by this

Eren Chen@ErenChenAI

720 degree Side Flip

English

423

cole murray@_colemurray·12h

@aliceisplaying x.7 series is haunted x.com/_colemurray/st…

cole murray@_colemurray

@BLUECOW009 something cursed with the x.7 versioning 3.7 had the same cope

English

345

alice@aliceisplaying·1d

has anthropic ever made a model so misaligned as 4.7

English

207

19.6K

cole murray@_colemurray·12h

@JustJake Definitely Jira. Even the big ransomware gangs use Jira

English

108

Jake@JustJake·13h

You think the illuminati uses Slack? Jira? How do they plan their world domination sprints

English

5.5K

cole murray@_colemurray·15h

@bcardarella +1 x.com/_colemurray/st…

cole murray@_colemurray

claude queue times definitely feel extended feel like most of my session time is just spent waiting now

Brian Cardarella@bcardarella·22h

Is anybody tracking Claude speed? It definitely feels like it is going slower nowadays.

English

1.3K

cole murray@_colemurray·15h

@needaubrey /goal build <> client and verify against the API

English

Aubrey Darwin Niederhoffer@needaubrey·18h

OpenAI has to catch up on connectors. i prefer 5.5 to Opus 4.7, but claude's better connectors/plugins mean I mostly use Claude. crazy to me that OpenAI isn't all in on this fight, since connectors aren't hard to build and i assume it costs them billions in enterprise revenue

English

6.8K

cole murray@_colemurray·16h

you either exit VC as a cultural icon or end up funding loophole gambling

Y Combinator@ycombinator

Totalis (@totalistrading) lets users parlay on anything. Combine multiple event markets into one trade across politics, crypto, stocks, sports, weather, and macro. Starting with parlays, expanding into structured products. Congrats on the launch, @ImTheBigP & @ericliujt! ycombinator.com/launches/QUr-t…

English

348

cole murray@_colemurray·17h

@kubedoll Use the app token or create a bot user

English

Cadence Agyirey 🇬🇭🏳‍⚧@kubedoll·18h

who's going to solve the agentic git identity problem

English

489

cole murray@_colemurray·19h

@simonw I’m more surprised if anyone doesn’t do this yes agents can reward hack the tests to pass, but writing them first *mostly* mitigates this

English

118

Simon Willison@simonw·20h

Do you have your coding agents include automated tests for the code that they write?

English

19.8K

cole murray@_colemurray·19h

@0xDesigner cc @LandseerEnga

0xDesigner@0xDesigner·21h

what are the best xcode/simulator mcps or tools to make agentic testing easier?

English

3.1K

cole murray@_colemurray·19h

@RhysSullivan in this thread, we learn about a tokenizer and how you could achieve exactly this if you wanted

English

328

Rhys@RhysSullivan·22h

why do LLMs generate long duplicate strings of text so slowly? consider an LLM outputting 100x's, if you ask it to do that 10 times it's linear amount of time to general, when instead the LLM could just reference that symbol 10 times

English

cole murray@_colemurray·23h

@avimakesrobots told you background agents was the way!

GIF

English

Avi Peltz@avimakesrobots·1d

when the Linear tickets are actually well written, Superset makes me feel like I have a 6-person dev team

English

2.5K

cole murray@_colemurray·1d

@adam__isom a lot of cloudflares services have a very different execution model than you otherwise might expect. example, there is no RDS/Cloud sql equivalent. D1 is a SQLite db and you’ll run into scaling issues

English

adam ✧ ❥ ~@adam__isom·1d

is Cloudflare the way forward, over AWS/GCP? I'm new to it but intrigued, they seem to do things differently from the ground up/first-principles whatever please reply with your opinion/exp if you've used it

English

808

cole murray@_colemurray·1d

@ChadNauseam a lot of words when you could’ve already deployed OpenInspect github.com/ColeMurray/bac…

English

941

cole murray@_colemurray·1d

@jerryjliu0 do it anyway use the agent’s access credentials

English

664

Jerry Liu@jerryjliu0·1d

I asked my eng team if I could ship code to prod They told me no 💀

Aaron Levie@levie

CEOs are uniquely prone to AI psychosis because they’re sufficiently distant from the last mile of work that still has to happen to generate most value with AI. So when they play with AI, they see the happy path results, often not considering the next 10 or 20 things that have to happen to get sustainable results from agents. “Look I made this awesome product prototype”. Yes but you didn’t have to review the code before it went into production and fix a bunch of issues. “Look I generated a contract”. Yes but you didn’t verify all the terms before it goes out to the counterparty and didn’t have to wire up all the past contracts to work with. The best thing you can do as a CEO is to use AI a *ton* to figure out the real implications of agents in the enterprise, and come out the other side with an appreciation for both the upside and the real work that goes into them.

English

139

33.4K

cole murray@_colemurray·1d

@jjackyliang it’s the new “unlimited vacation”. marketed as something desirable but actually worse for the employee

English

119

jacky@jjackyliang·1d

what with all this obsession of "member of technical staff"???

English

cole murray@_colemurray·1d

@jonahseguin when you know the real reason

English

10.3K

jonah@jonahseguin·1d

Can someone please explain to me why we are still waiting until AFTER a package is published and distributed to take action? Why doesn’t npm scan packages with Socket or similar before allowing them to be distributed?

Socket@SocketSecurity

🚨 BREAKING: Active supply chain attack across npm, PyPI, and Crates.io. Socket detected TrapDoor, a crypto stealer campaign hitting 34 malicious packages and 384 versions and artifacts, with attackers repeatedly pushing new releases across ecosystems. TrapDoor targets #crypto, #DeFi, AI, and security developers, stealing wallets, SSH keys, cloud credentials, GitHub tokens, browser data, env vars, and API keys. Socket detected releases with a median detection time of 5 minutes, 27 seconds. The fastest detection occurred 58 seconds after publication.

English

975

148.2K

Keşfet

@aliceisplaying @JustJake @bcardarella @needaubrey @kubedoll @simonw @0xDesigner @LandseerEnga