cole murray

7.4K posts

cole murray banner
cole murray

cole murray

@_colemurray

ai/ml | cto | second time founder | former sr. sde @ amazon

San Diego, CA Katılım Şubat 2015
954 Takip Edilen3.5K Takipçiler
Sabitlenmiş Tweet
cole murray
cole murray@_colemurray·
Advice given to someone asking about AI Consulting: I don't think an ML background is required to be successful in AI consulting, but obviously helps. I think the biggest "skill" learned in ML is how to successfully do feedback loops in a system. In an ML system, this typically involves cleaning data, making model tweaks, performance evals etc. In LLMs, in nearly every case you won't be fine-tuning the model, but iterating on prompts is a very similar workflow. I do think it would be helpful to at least get a high-level learning of how the models "actually" work and become familiar with the basic terms. e.g. tokens, transformers, attention, what happens on each input -> output iteration as the model is predicting. You don't need to know the underlying math (helpful though), but having the understanding of what is happening is helpful. Most of the AI consulting market is more on full-stack / product development skills and less ML. This isn't the most lucrative opportunities, but they are available in abundance. Major areas now and over the next year: - RAG: this is basically just glorified search lol. Useful in many contexts but severely overhyped - Agents: The models aren't quite there yet IMO for this to be useful, but in 2025 I think this will be a major theme and a HUGE area of interest/investment. Becoming good at this will be valuable. - Evals: Performance evaluations are a relatively untapped market. Most AI products you see today are flying by the seat of their pants. Without eval metrics, you can't truly know if your prompt changes are improving the system. This is somewhat more difficult to sell as a consultant as it requires a more sophisticated buyer, but is worth a lot of money if you can do it well
English
9
16
223
40.2K
cole murray
cole murray@_colemurray·
@49agents if the “debug sentry event agent” doesn’t have access to the sentry event, what does it do lol
English
0
0
0
16
49 Agents - Agentic Coding IDE
@_colemurray this is a valid concern but its mostly solvable by keeping your agent monitoring separate from the agent itself. if the monitoring tool can inject prompts into your agent, you have a bigger architecture problem. the fix is air-gapped notification channels, not tighter coupling
English
1
0
0
19
cole murray
cole murray@_colemurray·
this is a pretty fun novel attack vector to look out for as you wire the agent into sentry/datadog, a malicious actor could trigger a bug that causes a prompt injection
Shannon Code@Shannoncode

@euboid How do you prevent a targeted crash becoming the attack vector into the codebase? if the error injects instructions into codex, what's to prevent codex exfiltrating, or shoving some hidden payload into the codebase that might get missed by a reviewer in a pr review?

English
1
1
4
592
anirudh
anirudh@kamathematic·
"AI infra" would be nothing if AWS never open sourced Firecracker
English
14
5
292
31.4K
Aasir
Aasir@facelessfarms·
I got robbed by .@kalshi and ex-@cluely team for $100k Here's exactly what happened with receipts I ended up doing marketing for them via .@instinct_inc and the goal was simple: Post content on faceless pages, get paid per view at the agreed RPM. Listen to the guidelines and create the content So I went to work and delivered over 180 million views (majority coming from very viral videos) At the agreed rate of the campaign, that's over $100,000 owed Then the ghosting started .@elamri wouldn’t respond to my DMS for days after i told him about the payout Then suddenly mass rejections start coming through Videos that had been live for weeks generating millions of views, content that was performing exactly as intended, all rejected The reason (according to the kalshi marketing team): “We don’t approve gooner content” What were they referring to though? My video of one of the top 20 sportswomen in the world was deemed as “NSFW” by them and therefore not approved DESPITE the video having 5M+ views and directly promoting them I have 20+ screenshots of conversations of them trying to explain why and Dris refusing to pay me Dris’ own words: "they ideally just want sports content / memes (which was later direction)" Read that again. "LATER DIRECTION" ASIDE from not paying me for my content and calling it “gooner content” They're literally admitting the direction came after I'd already delivered After the views were already counted. After they realized how much they'd owe me Videos with female athletes apparently don't count as sports to them I asked Dris directly "so by this logic female sports doesn't exist?" His response: “tried mentioning this earlier will try to see if sent" It's been 4 weeks of ghosting now I’ve not been paid anything after generating 200M views I've tried handling this privately but it’s genuinely getting to a point Everyone I mentioned is at fault here because I've been ignored, ghosted, and had 100 excuses made on why I couldn't get paid To the Instinct and the Kalshi team: you know what you owe. It’s VERY scummy of you to think you can get away not paying marketers what you owe them Will never be doing business with these guys again
Aasir tweet mediaAasir tweet mediaAasir tweet mediaAasir tweet media
English
138
42
1.1K
220.5K
Matthew Rogers
Matthew Rogers@Matthewrogers·
@vxunderground can you imagine someone joins this to your wifi, and has a remote agent platform hacking your network 24x7 until it finds gold? We're in a new age of opec challenges, and I love it 😻
Matthew Rogers tweet media
English
2
0
1
62
Kye
Kye@kyeburchard·
weird day to get this dm
Kye tweet media
English
66
33
2.3K
246K
Gauri Gupta
Gauri Gupta@gauri__gupta·
can’t trust merging code without a Devin review anymore. Claude and Codex are great, but they don’t catch bugs the way Devin does. devin code review is OG. @cognition has built something special.
Gauri Gupta tweet media
English
10
9
113
11.6K
cole murray
cole murray@_colemurray·
in nearly every case (somewhat self selecting bias), the companies have tried very minimal AI rollout, but it isn’t sticking and there is opposition “a few engineers using cursor, but it’s not common” “Senior eng (vim btw) says AI is bad and they’re much better” part influencing, part guiding
English
0
0
1
71
cole murray
cole murray@_colemurray·
I see and get brought into companies to do exactly this; up-skilling of the org. the statement is valid and typically stems from: - skeptics in high positions holding the org back - lack of the “how” from a process as well as entire org adoption - fear of doing it incorrectly
Harry Stebbings@HarryStebbings

Spoke to a CRO of a hot Series B startup yesterday: “We don’t have the knowledge internally to implement AI and agents into our process.” Toast. You are toast. That is unacceptable. Everyone can learn. There is zero excuse for the above.

English
1
0
3
550
dex
dex@dexhorthy·
@garrytan interesting. You’ve seen no degradation in quality at high token usage?
English
13
0
60
7.6K
Garry Tan
Garry Tan@garrytan·
I underestimated how powerful Opus 4.6 with 1M tokens is. Even last year we were absolutely hitting context limit problems constantly. 1M tokens means you can do much more complex analysis entirely in context. Claude Code is so much better. This is the worst it will ever be.
English
183
87
2.2K
113.6K
cole murray
cole murray@_colemurray·
@thsottiaux I don’t like it on my personal commits, however… now maintaining an open source project, I do appreciate it, as it gives some level of understanding/confidence into which model made the commits. basically don’t have to wonder if some local LLM slop produced it
English
0
0
4
297
Tibo
Tibo@thsottiaux·
Do people like this? We don't do this for codex because it exists to help you and it's important that you remain the owner and accountable for your work without AI taking credit. At the same time it does mean that you can't trace how popular codex is among repos.
Yuchen Jin@Yuchenj_UW

I noticed something interesting: Claude Code auto-adds itself as a co-author on every git commit. Codex doesn’t. That’s why you see Claude everywhere on GitHub, but not Codex. I wonder why OpenAI is not doing that. Feels like an obvious branding strategy OpenAI is skipping.

English
562
68
2.4K
290K
Anthony
Anthony@kr0der·
i think if @DevinAI had clearer pricing a lot more people would try it, including me for the $20 plan it says you start with $20 of usage, with each ACU (Agent Compute Unit) being $2.25. as a user who's never tried Devin and wants to try, this is so confusing because does that mean i get ~10 prompts? 50 prompts? 2 prompts? who knows it'd be a lot less reluctant to try it out and upgrade later if it was more clear on the pricing. an ACU might make sense for an existing Devin user but to an outside user it doesn't help decide at all - i can't look at it and compare it to other software pricing mainly looking into it because of the Poke by @interaction integration but yeah imo it needs some clarity or something
Anthony tweet media
English
10
0
19
4.1K
James Zhou
James Zhou@jameszhou02·
btw their supabase storage bucket is publicly accessible via any signed url token 😭 exposes: > employee background checks > equity vesting schedules and grant amounts > performance reviews > session tokens for stripe, notion, etc > screenshots below 🧵 i also got access to their notion 😛
James Zhou tweet media
erin griffith@eringriffith

A detailed and brutal look at the tactics of buzzy AI compliance startup Delve "Delve built a machine designed to make clients complicit without their knowledge, to manufacture plausible deniability while producing exactly the opposite." substack.com/home/post/p-19…

English
104
102
2.1K
896.6K