cole murray

8K posts

cole murray banner
cole murray

cole murray

@_colemurray

ai/ml | cto | second time founder | former sr. sde @ amazon

San Francisco, CA Katılım Şubat 2015
968 Takip Edilen3.9K Takipçiler
Sabitlenmiş Tweet
cole murray
cole murray@_colemurray·
Advice given to someone asking about AI Consulting: I don't think an ML background is required to be successful in AI consulting, but obviously helps. I think the biggest "skill" learned in ML is how to successfully do feedback loops in a system. In an ML system, this typically involves cleaning data, making model tweaks, performance evals etc. In LLMs, in nearly every case you won't be fine-tuning the model, but iterating on prompts is a very similar workflow. I do think it would be helpful to at least get a high-level learning of how the models "actually" work and become familiar with the basic terms. e.g. tokens, transformers, attention, what happens on each input -> output iteration as the model is predicting. You don't need to know the underlying math (helpful though), but having the understanding of what is happening is helpful. Most of the AI consulting market is more on full-stack / product development skills and less ML. This isn't the most lucrative opportunities, but they are available in abundance. Major areas now and over the next year: - RAG: this is basically just glorified search lol. Useful in many contexts but severely overhyped - Agents: The models aren't quite there yet IMO for this to be useful, but in 2025 I think this will be a major theme and a HUGE area of interest/investment. Becoming good at this will be valuable. - Evals: Performance evaluations are a relatively untapped market. Most AI products you see today are flying by the seat of their pants. Without eval metrics, you can't truly know if your prompt changes are improving the system. This is somewhat more difficult to sell as a consultant as it requires a more sophisticated buyer, but is worth a lot of money if you can do it well
English
9
16
232
47K
cole murray
cole murray@_colemurray·
looks like this got released in 0.120.0 0.120.0 2026-04-11 02:54 UTC
English
0
0
0
174
cole murray
cole murray@_colemurray·
why is codex now suggesting me apps? this feels like malware lol
cole murray tweet media
English
8
0
16
1.7K
alice
alice@aliceisplaying·
has anthropic ever made a model so misaligned as 4.7
English
18
3
207
19.6K
cole murray
cole murray@_colemurray·
@JustJake Definitely Jira. Even the big ransomware gangs use Jira
English
0
0
0
108
Jake
Jake@JustJake·
You think the illuminati uses Slack? Jira? How do they plan their world domination sprints
English
21
0
73
5.5K
Brian Cardarella
Brian Cardarella@bcardarella·
Is anybody tracking Claude speed? It definitely feels like it is going slower nowadays.
English
3
0
8
1.3K
Aubrey Darwin Niederhoffer
OpenAI has to catch up on connectors. i prefer 5.5 to Opus 4.7, but claude's better connectors/plugins mean I mostly use Claude. crazy to me that OpenAI isn't all in on this fight, since connectors aren't hard to build and i assume it costs them billions in enterprise revenue
English
12
0
59
6.8K
cole murray
cole murray@_colemurray·
@simonw I’m more surprised if anyone doesn’t do this yes agents can reward hack the tests to pass, but writing them first *mostly* mitigates this
English
0
0
1
118
Simon Willison
Simon Willison@simonw·
Do you have your coding agents include automated tests for the code that they write?
English
77
3
57
19.8K
0xDesigner
0xDesigner@0xDesigner·
what are the best xcode/simulator mcps or tools to make agentic testing easier?
English
8
1
11
3.1K
cole murray
cole murray@_colemurray·
@RhysSullivan in this thread, we learn about a tokenizer and how you could achieve exactly this if you wanted
English
1
0
2
328
Rhys
Rhys@RhysSullivan·
why do LLMs generate long duplicate strings of text so slowly? consider an LLM outputting 100x's, if you ask it to do that 10 times it's linear amount of time to general, when instead the LLM could just reference that symbol 10 times
English
12
0
43
8K
Avi Peltz
Avi Peltz@avimakesrobots·
when the Linear tickets are actually well written, Superset makes me feel like I have a 6-person dev team
English
3
1
20
2.5K
cole murray
cole murray@_colemurray·
@adam__isom a lot of cloudflares services have a very different execution model than you otherwise might expect. example, there is no RDS/Cloud sql equivalent. D1 is a SQLite db and you’ll run into scaling issues
English
0
0
0
86
adam ✧ ❥ ~
adam ✧ ❥ ~@adam__isom·
is Cloudflare the way forward, over AWS/GCP? I'm new to it but intrigued, they seem to do things differently from the ground up/first-principles whatever please reply with your opinion/exp if you've used it
English
6
0
5
808
cole murray
cole murray@_colemurray·
@jjackyliang it’s the new “unlimited vacation”. marketed as something desirable but actually worse for the employee
English
0
0
1
119
jacky
jacky@jjackyliang·
what with all this obsession of "member of technical staff"???
English
3
0
10
3K
jonah
jonah@jonahseguin·
Can someone please explain to me why we are still waiting until AFTER a package is published and distributed to take action? Why doesn’t npm scan packages with Socket or similar before allowing them to be distributed?
Socket@SocketSecurity

🚨 BREAKING: Active supply chain attack across npm, PyPI, and Crates.​io. Socket detected TrapDoor, a crypto stealer campaign hitting 34 malicious packages and 384 versions and artifacts, with attackers repeatedly pushing new releases across ecosystems. TrapDoor targets #crypto, #DeFi, AI, and security developers, stealing wallets, SSH keys, cloud credentials, GitHub tokens, browser data, env vars, and API keys. Socket detected releases with a median detection time of 5 minutes, 27 seconds. The fastest detection occurred 58 seconds after publication.

English
38
39
975
148.2K