cole murray

7.5K posts

cole murray

@_colemurray

ai/ml | cto | second time founder | former sr. sde @ amazon

San Francisco, CA Beigetreten Şubat 2015

961 Folgt3.7K Follower

Angehefteter Tweet

cole murray@_colemurray·25 Eki

Advice given to someone asking about AI Consulting: I don't think an ML background is required to be successful in AI consulting, but obviously helps. I think the biggest "skill" learned in ML is how to successfully do feedback loops in a system. In an ML system, this typically involves cleaning data, making model tweaks, performance evals etc. In LLMs, in nearly every case you won't be fine-tuning the model, but iterating on prompts is a very similar workflow. I do think it would be helpful to at least get a high-level learning of how the models "actually" work and become familiar with the basic terms. e.g. tokens, transformers, attention, what happens on each input -> output iteration as the model is predicting. You don't need to know the underlying math (helpful though), but having the understanding of what is happening is helpful. Most of the AI consulting market is more on full-stack / product development skills and less ML. This isn't the most lucrative opportunities, but they are available in abundance. Major areas now and over the next year: - RAG: this is basically just glorified search lol. Useful in many contexts but severely overhyped - Agents: The models aren't quite there yet IMO for this to be useful, but in 2025 I think this will be a major theme and a HUGE area of interest/investment. Becoming good at this will be valuable. - Evals: Performance evaluations are a relatively untapped market. Most AI products you see today are flying by the seat of their pants. Without eval metrics, you can't truly know if your prompt changes are improving the system. This is somewhat more difficult to sell as a consultant as it requires a more sophisticated buyer, but is worth a lot of money if you can do it well

English

226

42.6K

cole murray@_colemurray·54m

@ryancarson the ol' reward hacking behavior, but rebranded as ~~spooky~~

English

Ryan Carson@ryancarson·2h

Read this :)

Jack Lindsey@Jack_W_Lindsey

Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14)

English

8.5K

cole murray@_colemurray·58m

@wolfofbaystreet he really does it all also helped save a cat stuck in a tree by my house while giving a demo (he closed the deal btw)

English

kazi@wolfofbaystreet·1h

Nothing preps you for b2b like the consumer trenches. peak covid feds said to sit, collect stimmy checks, get v*xxed I said, no 100 demos a day. hail, wind, rain. I’m hitting the mall, parks, bars. Literally tap on people's phones, download, 5 stars. 4 viral yt vids (long-form)/week climbed a KFC in a colonel sanders costume and did a seance on the roof millions of views as pigsaw (half pig half Texas chainsaw massacre) Code breaks when it’s touched by a user for the first time Stop debug w Arta remotely, send video/ss, do hot fix right there. Next demo.

English

183

cole murray@_colemurray·1h

claude sonnet 3.7 lives

Jack Lindsey@Jack_W_Lindsey

Early versions of Mythos Preview often exhibited overeager and/or destructive actions—the model bulldozing through obstacles to complete a task in a way the user wouldn't want. We looked at what was going on inside the model during particularly concerning examples. (3/14)

English

143

cole murray@_colemurray·1h

@BRICSinfo

QME

257

BRICS News@BRICSinfo·3h

JUST IN: 🇺🇸🇮🇷 CIA used a futuristic new tool called "Ghost Murmur" to locate and rescue the second US pilot who was shot down over Iran, NYPost reports. "The secret technology uses long-range quantum magnetometry to find the electromagnetic fingerprint of a human heartbeat and pairs the data with artificial intelligence software to isolate the signature from background noise."

English

1.1K

1.3K

13.4K

1.6M

cole murray@_colemurray·1h

@CollinRugg 🤨

QME

Collin Rugg@CollinRugg·2h

NEW: The CIA used a secret tool called "Ghost Murmur" that uses AI to find heartbeats to rescue the U.S. airman who was stranded in Iran, according to the New York Post. The secret technology was allegedly used for the first time in the field, according to the Post. "The secret technology uses long-range quantum magnetometry to find the electromagnetic fingerprint of a human heartbeat and pairs the data with artificial intelligence software to isolate the signature from background noise," the Post reported. "It’s like hearing a voice in a stadium, except the stadium is a thousand square miles of desert," the source said. "In the right conditions, if your heart is beating, we will find you." "The name is deliberate. ‘Murmur’ is a clinical term for a heart rhythm. ‘Ghost’ refers to finding someone who, for all practical purposes, has disappeared..." "Advances in a field known as quantum magnetometry, specifically sensors built around microscopic defects in synthetic diamonds, have apparently made it possible to detect these signals at dramatically greater distances." CIA Director John Ratcliffe appeared to hint at this technology on Monday, saying the CIA possessed "unique capabilities" but said he couldn't "tell you everything that you want to know." President Trump also revealed during the press conference that the CIA spotted the officer from about "40 miles away." Insane.

English

472

946

7.8K

808.4K

cole murray@_colemurray·1h

solved long-term memory with cosine similarity btw

English

128

cole murray@_colemurray·5h

@nicdunz make pickpocketing great again

English

nic@nicdunz·5h

robots can steal your physical cash now

English

476

cole murray@_colemurray·5h

@fabianstelzer can you add the uav on-click functionality, but instead it deploys the spiderman performer

English

133

fabian@fabianstelzer·1d

Palantir, but for organizing children’s birthday parties who’s building this?

English

138

2.6K

212.7K

cole murray@_colemurray·5h

real ones can spot an AI memory fraud with one look under the hood imagine thinking you *solved* retrieval with a naive vector search

cole murray@_colemurray

@bensig > Semantic search across months of conversations finds the answer in position 1 or 2

English

845

cole murray@_colemurray·5h

@verrsane gg manager, wp

Nederlands

168

verrsane@verrsane·6h

I put my 2 weeks in and my manager gave me a critical project to do Asked him if he still wants me to work on it, Bro said I might as milk my coding cow before he dips and to get to work So ig im now working with the internal legal team and spinning up automations for them

English

cole murray@_colemurray·22h

@bensig > Semantic search across months of conversations finds the answer in position 1 or 2

English

5.6K

Ben Sigman@bensig·1d

My friend Milla Jovovich and I spent months creating an AI memory system with Claude. It just posted a perfect score on the standard benchmark - beating every product in the space, free or paid. It's called MemPalace, and it works nothing like anything else out there. Instead of sending your data to a background agent in the cloud, it mines your conversations locally and organizes them into a palace - a structured architecture with wings, halls, and rooms that mirrors how human memory actually works. Here is what that gets you: → Your AI knows who you are before you type a single word - family, projects, preferences, loaded in ~120 tokens → Palace architecture organizes memories by domain and type - not a flat list of facts, a navigable structure → Semantic search across months of conversations finds the answer in position 1 or 2 → AAAK compression fits your entire life context into 120 tokens - 30x lossless compression any LLM reads natively → Contradiction detection catches wrong names, wrong pronouns, wrong ages before you ever see them The benchmarks: 100% recall on LongMemEval — first perfect score ever recorded. 500/500 questions. Every question type at 100%. 92.9% on ConvoMem — more than 2x Mem0's score. 100% on LoCoMo — every multi-hop reasoning category, including temporal inference which stumps most systems. No API key. No cloud. No subscription. One dependency. Runs on your machine. Your memories never leave. MIT License. 100% Open Source. github.com/milla-jovovich…

English

417

759

7.6K

2.6M

cole murray@_colemurray·1d

@Hunter_Weiss would buy

English

307

Hunter Weiss@Hunter_Weiss·1d

Should I drop the max ragebait Palantir merch?

Eliano A Younes@eliano

changed the game

English

1.8K

263.2K

cole murray@_colemurray·1d

@jeffreyhuber retrieval got hands tho

English

Jeff Huber@jeffreyhuber·1d

the vast majority of AI-failures today are not reasoning failures but context failures

Jeff Huber@jeffreyhuber

English

cole murray@_colemurray·1d

@Niyxuis add OpenInspect to the list! github.com/ColeMurray/bac…

English

423

nyx 👻@Niyxuis·1d

Sharing my review for @conductor_build (cc: @charlieholtz ) 🧵 Next up: @getsome_air , @agentastic and more.

English

131

30.6K

cole murray@_colemurray·1d

@andrewchen I am experiencing this video now lol. it is terrible

English

440

andrew chen@andrewchen·1d

at what point does SF's NIMBYism make it a hostile and too expensive place for new startups? Certainly makes it hard for the unfunded eventually the California undersupply of housing becomes the primary obstacle to tech and innovation (and thus the larger US economy) Lots of people working on this but this video is nuts

Dimitris Drolapas@DDrolapas

What the turnout for a $2500 studio in San Francisco's Pacific Heights looks like right now:

English

161

47.4K

cole murray@_colemurray·2d

@ishandeveloper

GIF

QME

2.2K

Ishan@ishandeveloper·2d

Moved into a new apartment and guess what we found 👀

English

406

68K

cole murray@_colemurray·2d

@bridgemindai certainly will be some surprises

GIF

English

BridgeMind@bridgemindai·2d

Claude Code rate limited me so hard I bought a $5,000 NVIDIA DGX Spark. Arriving tomorrow. A personal AI supercomputer. Anthropic cut off OpenClaw users. Slashed Claude Opus 4.6 rate limits. Told $200/month Max plan customers to use less. Then gave us a credit as an apology. This is what happens when AI companies have too much power over your workflow. One update and your entire stack breaks. Local models are the only infrastructure no one can throttle. No rate limits. No 529 errors. No surprise policy changes. Tomorrow I'm testing the DGX Spark live on stream. Running local models through real vibe coding workflows. The goal is simple. Never depend on a single provider again.

English

390

107

2.3K

452.7K

cole murray@_colemurray·2d

>be me >expert in search and retrieval >read article about “company memory” >read implementation >dump everything into one db >basic vector db search >doesn’t understand query document encoder problem >no FTS >no reranker >every time

ericosiu@ericosiu

x.com/i/article/2040…

English

3.6K

cole murray@_colemurray·2d

a tale as old as time try to wedge graph based algorithms and dbs into whatever cycle we are in this time will be different, i’m sure

Jaya Gupta@JayaGup10

Thank you Forbes

English

2.5K

Entdecken

@ryancarson @wolfofbaystreet @BRICSinfo @CollinRugg @nicdunz @fabianstelzer @verrsane @bensig