Sam Collins

1.1K posts

Sam Collins banner
Sam Collins

Sam Collins

@smcllns

Taking a break to self-study AI. Prev: Senior Product Director @Meta, cofounder @Eventasaurus, editor @MindTheProduct, engineering @EdinburghUni.

Bay Area, CA Katılım Ekim 2009
691 Takip Edilen889 Takipçiler
Sam Collins
Sam Collins@smcllns·
TLDR: use SVG to outline then send that image with your text prompt to get Gemini 3.0 Pro to render with correct numbers and text
Sam Collins tweet mediaSam Collins tweet media
English
0
0
1
122
Sam Collins
Sam Collins@smcllns·
I found a simple technique to get reliable text and numbers in AI generated images: samcollins.blog/underdrawings I’m surprised the image models aren’t already doing this. It’s basic but works well.
Sam Collins tweet media
English
3
1
20
64.8K
Sam Collins
Sam Collins@smcllns·
“You're either in the data distribution (on the rails of the RL circuits) and flying or you're off-roading in the jungle with a machete”
Andrej Karpathy@karpathy

Fireside chat at Sequoia Ascent 2026 from a ~week ago. Some highlights: The first theme I tried to push on is that LLMs are about a lot more than just speeding up what existed before (e.g. coding). Three examples of new horizons: 1. menugen: an app that can be fully engulfed by LLMs, with no classical code needed: input an image, output an image and an LLM can natively do the thing. 2. install .md skills instead of install .sh scripts. Why create a complex Software 1.0 bash script for e.g. installing a piece of software if you can write the installation out in words and say "just show this to your LLM". The LLM is an advanced interpreter of English and can intelligently target installation to your setup, debug everything inline, etc. 3. LLM knowledge bases as an example of something that was *impossible* with classical code because it's computation over unstructured data (knowledge) from arbitrary sources and in arbitrary formats, including simply text articles etc. I pushed on these because in every new paradigm change, the obvious things are always in the realm of speeding up or somehow improving what existed, but here we have examples of functionality that either suddenly perhaps shouldn't even exist (1,2), or was fundamentally not possible before (3). The second (ongoing) theme is trying to explain the pattern of jaggedness in LLMs. How it can be true that a single artifact will simultaneously 1) coherently refactor a 100,000-line code base *and* 2) tell you to walk to the car wash to wash your car. I previously wrote about the source of this as having to do with verifiability of a domain, here I expand on this as having to also do with economics because revenue/TAM dictates what the frontier labs choose to package into training data distributions during RL. You're either in the data distribution (on the rails of the RL circuits) and flying or you're off-roading in the jungle with a machete, in relative terms. Still not 100% satisfied with this, but it's an ongoing struggle to build an accurate model of LLM capabilities if you wish to practically take advantage of their power while avoiding their pitfalls, which brings me to... Last theme is the agent-native economy. The decomposition of products and services into sensors, actuators and logic (split up across all of 1.0/2.0/3.0 computing paradigms), how we can make information maximally legible to LLMs, some words on the quickly emerging agentic engineering and its skill set, related hiring practices, etc., possibly even hints/dreams of fully neural computing handling the vast majority of computation with some help from (classical) CPU coprocessors.

English
0
0
0
174
Sam Collins
Sam Collins@smcllns·
Great write up @aparnadhinak. A q and a nit: Q: what about memory system? Identifying what to remember/forget etc. Do you consider that part of the harness? Nit: the headline from your linked pdf is a neat one liner I think belongs early in this post: a harness turns a model into an agent.
English
0
0
1
54
Sam Collins
Sam Collins@smcllns·
@jahirsheikh8 I would guess: - some consumers will get UBI or equivalent - some consumers adapt and keep earning - the biz-biz spending that doesn’t show up much in GDP metric will grow a lot But it does seem like #1 will become unavoidable
Sam Collins tweet media
English
0
0
0
31
Jahir Sheikh
Jahir Sheikh@jahirsheikh8·
Everyone says AI will replace most jobs. But if millions lose their income… Who buys the products? Who pays for the services? Who keeps the economy running? If nobody earns, nobody spends. So what exactly is the endgame here? What are we missing?
English
51
3
48
4.3K
Sam Collins
Sam Collins@smcllns·
@danshipper I do a thing like this where I wrote a small claude skill that looks for #claude comments in obsidian and it periodically checks and handles what I ask it for.
English
0
0
0
361
Dan Shipper 📧
Dan Shipper 📧@danshipper·
im writing an essay in proof using codex's in-app browser. i type directly in the document, and codex loops in parallel on the left, collaborating with me in realtime. this is obviously the future:
Dan Shipper 📧 tweet media
English
42
32
500
56.4K
Sam Collins
Sam Collins@smcllns·
@grok can you make a better printable version than what chatgpt produced?
English
0
0
0
20
Sam Collins
Sam Collins@smcllns·
This seems small but is a HUGE DEAL for busy parents. Thanks to ChatGPT-Images-2 you can now: 1. Write down everything you nag/remind your kids about each day in plain text. 2. Ask chatgpt to make it into a kid friendly print version you can stick on your wall
Sam Collins tweet mediaSam Collins tweet media
English
1
0
0
65
Sam Collins
Sam Collins@smcllns·
@Shpigford Hmm, removed and installed latest from App Store but don't have it. v2.5. Install from gh? Was assuming store would handle updates best.
Sam Collins tweet media
English
1
0
0
12
Sam Collins
Sam Collins@smcllns·
@Shpigford serious question - what do you think about wysiwyg edit mode in Clearly? I built my own little native md editor for this, but I switched to Clearly a few weeks ago to reduce 'distracting side projects' but I'm really missing it. Curious if you are considering, or if I should carve out some time to update mine to a stable v1 (electrobun + milkdown + codemirror)
English
1
0
1
42
Josh Pigford
Josh Pigford@Shpigford·
had my annual eye exam yesterday and looks like i'm now gonna need to wear contacts, readers AND prism glasses (which i'd never even heard of). ya boi getting OLD.
English
8
0
14
2.8K
Sam Collins
Sam Collins@smcllns·
@linuz90 Yeah I built a minimalist md editor for doing this too. I ask claude to execute `md <file.md>` and it's like the agent throws a doc at me, it's lovely. Cognito looks nice btw, are you considering adding wysiwyg edit mode or anything for inline commenting?
English
0
0
1
70
Sam Collins
Sam Collins@smcllns·
But your goal needs to be distribution.
English
0
0
0
7
Sam Collins
Sam Collins@smcllns·
You should still spend real human cycles refining your architecture and product experience. Your agents will gravitate your design toward "statistically-desirable" (aka slop). Use your taste and agency to push you from mean to outlier.
English
1
0
0
17
Sam Collins
Sam Collins@smcllns·
Writing tons of features will not differentiate your product. Writing beautiful code will not differentiate your product. Open sourcing your code will not differentiate your product. Giving your product away for free will not differentiate your product.
English
1
0
0
21