
cole murray
7.5K posts

cole murray
@_colemurray
ai/ml | cto | second time founder | former sr. sde @ amazon
San Francisco, CA Beigetreten Şubat 2015
961 Folgt3.7K Follower
Angehefteter Tweet

Advice given to someone asking about AI Consulting:
I don't think an ML background is required to be successful in AI consulting, but obviously helps. I think the biggest "skill" learned in ML is how to successfully do feedback loops in a system.
In an ML system, this typically involves cleaning data, making model tweaks, performance evals etc. In LLMs, in nearly every case you won't be fine-tuning the model, but iterating on prompts is a very similar workflow.
I do think it would be helpful to at least get a high-level learning of how the models "actually" work and become familiar with the basic terms. e.g. tokens, transformers, attention, what happens on each input -> output iteration as the model is predicting. You don't need to know the underlying math (helpful though), but having the understanding of what is happening is helpful.
Most of the AI consulting market is more on full-stack / product development skills and less ML. This isn't the most lucrative opportunities, but they are available in abundance.
Major areas now and over the next year:
- RAG: this is basically just glorified search lol. Useful in many contexts but severely overhyped
- Agents: The models aren't quite there yet IMO for this to be useful, but in 2025 I think this will be a major theme and a HUGE area of interest/investment. Becoming good at this will be valuable.
- Evals: Performance evaluations are a relatively untapped market. Most AI products you see today are flying by the seat of their pants. Without eval metrics, you can't truly know if your prompt changes are improving the system. This is somewhat more difficult to sell as a consultant as it requires a more sophisticated buyer, but is worth a lot of money if you can do it well
English



@wolfofbaystreet he really does it all
also helped save a cat stuck in a tree by my house while giving a demo (he closed the deal btw)
English

Nothing preps you for b2b like the consumer trenches.
peak covid feds said to sit, collect stimmy checks, get v*xxed
I said, no
100 demos a day. hail, wind, rain.
I’m hitting the mall, parks, bars.
Literally tap on people's phones, download, 5 stars.
4 viral yt vids (long-form)/week
climbed a KFC in a colonel sanders costume and did a seance on the roof
millions of views as pigsaw (half pig half Texas chainsaw massacre)
Code breaks when it’s touched by a user for the first time
Stop debug w Arta remotely, send video/ss, do hot fix right there.
Next demo.
English


JUST IN: 🇺🇸🇮🇷 CIA used a futuristic new tool called "Ghost Murmur" to locate and rescue the second US pilot who was shot down over Iran, NYPost reports.
"The secret technology uses long-range quantum magnetometry to find the electromagnetic fingerprint of a human heartbeat and pairs the data with artificial intelligence software to isolate the signature from background noise."


English

NEW: The CIA used a secret tool called "Ghost Murmur" that uses AI to find heartbeats to rescue the U.S. airman who was stranded in Iran, according to the New York Post.
The secret technology was allegedly used for the first time in the field, according to the Post.
"The secret technology uses long-range quantum magnetometry to find the electromagnetic fingerprint of a human heartbeat and pairs the data with artificial intelligence software to isolate the signature from background noise," the Post reported.
"It’s like hearing a voice in a stadium, except the stadium is a thousand square miles of desert," the source said.
"In the right conditions, if your heart is beating, we will find you."
"The name is deliberate. ‘Murmur’ is a clinical term for a heart rhythm. ‘Ghost’ refers to finding someone who, for all practical purposes, has disappeared..."
"Advances in a field known as quantum magnetometry, specifically sensors built around microscopic defects in synthetic diamonds, have apparently made it possible to detect these signals at dramatically greater distances."
CIA Director John Ratcliffe appeared to hint at this technology on Monday, saying the CIA possessed "unique capabilities" but said he couldn't "tell you everything that you want to know."
President Trump also revealed during the press conference that the CIA spotted the officer from about "40 miles away."
Insane.
English

@fabianstelzer can you add the uav on-click functionality, but instead it deploys the spiderman performer
English

real ones can spot an AI memory fraud with one look under the hood
imagine thinking you *solved* retrieval with a naive vector search
cole murray@_colemurray
@bensig > Semantic search across months of conversations finds the answer in position 1 or 2
English

@bensig > Semantic search across months of conversations finds the answer in position 1 or 2

English

My friend Milla Jovovich and I spent months creating an AI memory system with Claude. It just posted a perfect score on the standard benchmark - beating every product in the space, free or paid.
It's called MemPalace, and it works nothing like anything else out there.
Instead of sending your data to a background agent in the cloud, it mines your conversations locally and organizes them into a palace - a structured architecture with wings, halls, and rooms that mirrors how human memory actually works.
Here is what that gets you:
→ Your AI knows who you are before you type a single word - family, projects, preferences, loaded in ~120 tokens
→ Palace architecture organizes memories by domain and type - not a flat list of facts, a navigable structure
→ Semantic search across months of conversations finds the answer in position 1 or 2
→ AAAK compression fits your entire life context into 120 tokens - 30x lossless compression any LLM reads natively
→ Contradiction detection catches wrong names, wrong pronouns, wrong ages before you ever see them
The benchmarks:
100% recall on LongMemEval — first perfect score ever recorded. 500/500 questions. Every question type at 100%.
92.9% on ConvoMem — more than 2x Mem0's score.
100% on LoCoMo — every multi-hop reasoning category, including temporal inference which stumps most systems.
No API key. No cloud. No subscription. One dependency. Runs on your machine. Your memories never leave.
MIT License. 100% Open Source.
github.com/milla-jovovich…

English


the vast majority of AI-failures today are not reasoning failures but context failures
Jeff Huber@jeffreyhuber
English

@Niyxuis add OpenInspect to the list!
github.com/ColeMurray/bac…
English

Sharing my review for @conductor_build (cc: @charlieholtz ) 🧵
Next up: @getsome_air , @agentastic and more.

English

@andrewchen I am experiencing this video now lol. it is terrible
English

at what point does SF's NIMBYism make it a hostile and too expensive place for new startups? Certainly makes it hard for the unfunded
eventually the California undersupply of housing becomes the primary obstacle to tech and innovation (and thus the larger US economy)
Lots of people working on this but this video is nuts
Dimitris Drolapas@DDrolapas
What the turnout for a $2500 studio in San Francisco's Pacific Heights looks like right now:
English

Claude Code rate limited me so hard I bought a $5,000 NVIDIA DGX Spark.
Arriving tomorrow. A personal AI supercomputer.
Anthropic cut off OpenClaw users.
Slashed Claude Opus 4.6 rate limits.
Told $200/month Max plan customers to use less.
Then gave us a credit as an apology.
This is what happens when AI companies have too much power over your workflow.
One update and your entire stack breaks.
Local models are the only infrastructure no one can throttle.
No rate limits. No 529 errors.
No surprise policy changes.
Tomorrow I'm testing the DGX Spark live on stream.
Running local models through real vibe coding workflows.
The goal is simple.
Never depend on a single provider again.

English

>be me
>expert in search and retrieval
>read article about “company memory”
>read implementation
>dump everything into one db
>basic vector db search
>doesn’t understand query document encoder problem
>no FTS
>no reranker
>every time
ericosiu@ericosiu
English

a tale as old as time
try to wedge graph based algorithms and dbs into whatever cycle we are in
this time will be different, i’m sure

Jaya Gupta@JayaGup10
Thank you Forbes
English










