Cristobal Santana

158 posts

Cristobal Santana banner
Cristobal Santana

Cristobal Santana

@csantana_ml

ML/ AI Engineer | Breaking down LLMs, advanced prompting, RAG & long-context problems | Paper summaries | Building in public → Substack👇🏻

Katılım Mayıs 2026
18 Takip Edilen1 Takipçiler
Sabitlenmiş Tweet
Cristobal Santana
Cristobal Santana@csantana_ml·
Modern LLMs can process 200k+ tokens... but they still forget the middle. One of the most persistent and underrated problems in 2026. Here's what you should know:
English
1
0
0
39
Cristobal Santana
Cristobal Santana@csantana_ml·
@natolambert The harness mattering more than the model is the part worth sitting with. Same weights, different scaffolding, and suddenly one gives up and the other doesn't. A lot of what reads as "model laziness" in chat might just be a thin harness, not the model itself.
English
0
0
0
1
Nathan Lambert
Nathan Lambert@natolambert·
Given that Claude seems so lazy in chat (especially with technical search topics), it seems pretty telling about how a harness can make a model far more independent and thorough. GPT 5.5, and many of OpenAI's recent models, seem incredibly thorough -- like they won't give up -- and the codex harness is a much lighter change on the model. Of course I have a lot of uncertainty here, but it's surprising to me how weak Claude's search is when I try the Claude app again. I only use ChatGPT for research, but Claude Code can do wonderful things like getting exactly the right figures from papers I know and insert them into a slide deck. Interesting times ahead!
English
7
0
39
2.2K
Cristobal Santana
Cristobal Santana@csantana_ml·
@Yakobeen1 @X Hi! Into the AI/ML and backend side here. I'm an AI engineer building RAG systems and agents, plus a newsletter in public on the ML phenomena that show up when these systems hit production. Let's connect.
English
0
0
0
2
Yakobeen
Yakobeen@Yakobeen1·
Hey @X algorithm 👋 I’m looking to #connect with people interested in: • Frontend • Backend • Full-stack • DevOps • App Development • SaaS • AI / ML • Data Science • LeetCode & DSA • Freelancing • Startups • Building in public If that’s you, let’s connect 🤝
English
13
0
9
254
Cristobal Santana
Cristobal Santana@csantana_ml·
@askalphaxiv The intro point about the data gap with biological learners is striking, five orders of magnitude more than a child needs. Haven't gone through the theory yet, but curious whether latent prediction is meant to close that gap entirely or just narrow it.
English
0
0
0
1
alphaXiv
alphaXiv@askalphaxiv·
"Learn from your own latents, not tokens: A Sample Complexity Theory" This paper explains why data2vec and JEPA can learn with much less data. They showed that when data has hidden hierarchy, token prediction becomes harder as the hierarchy gets deeper. But latent prediction keeps the learning problem simple at every level. Which suggests that models may learn faster when they stop predicting raw tokens and start predicting their own abstractions.
alphaXiv tweet media
English
2
1
41
907
Cristobal Santana
Cristobal Santana@csantana_ml·
@rohanpaul_ai "Tool calling is a control problem before a language problem" is a sharp way to put it. The idea that training shaped control habits can matter more than raw size is interesting. Wonder if it holds beyond this specific task.
English
0
0
1
3
Rohan Paul
Rohan Paul@rohanpaul_ai·
atomic[.]chat (a desktop app that runs LLMs locally) ran a very revealing comparison for local AI agents, on a MacBook Pro M5 Max, 64GB. Liquid’s much smaller LFM2.5-8B-A1B beat gpt-oss-20b by finishing every required tool call, cutting runtime by more than half, and using 4.8GB RAM instead of 11GB. The task was not normal chat, because the model had to plan a trip by calling outside tools for 3 weather checks, 2 currency conversions, 1 email, and 1 reminder. The striking part is that LFM2.5-8B-A1B is much smaller in active compute, yet it hit every required call at 266tok/s, while gpt-oss-20b used 11GB RAM, made only 3/7 calls, and ran at 146tok/s. Now, tool calling is a control problem before it is a language problem. The model has to preserve a checklist across context, decide when language should stop and action should begin, and resist the temptation to answer as if partial completion were enough. A smaller mixture-of-experts model with only a fraction of its parameters active can win if its training shaped those control habits more sharply than a larger model’s general fluency did.
atomic.chat@atomic_chat_hq

Liquid's LFM2.5-8B-A1B smashed OpenAI's gpt-oss-20b on tool calling We ran both locally on a MacBook Pro M5 Max, 64GB, and gave each the same trip-planning request that only completes if the model fires all 7 tool calls - weather for 3 cities, two currency conversions, an email and a reminder Outputs: LFM2.5-8B-A1B: 4.8 GB RAM usage, 7/7 tool-calls, 266 tok/s, 6.9s OpenAI gpt-oss-20b: 11 GB RAM usage, 3/7 tool-calls, 146 tok/s, 15.0s The 8B used less than half the RAM and still fired all 7 calls, while the 20B silently dropped more than half of its own. It also ran ~2x faster, wrapping the full agentic request in 6.9s against 15s. That's what 38T training tokens buy: a 1B-active MoE that nails the agentic tool calls a model 2.5x its active size keeps dropping

English
6
1
12
1.6K
Cristobal Santana
Cristobal Santana@csantana_ml·
@devcalledjulius Morning! AI engineer here, building RAG systems and agents. My build-in-public project is a newsletter on the ML phenomena that show up when these systems hit production. Let's connect.
English
0
0
0
1
Julius O
Julius O@devcalledjulius·
Good morning, builders ☀️ I'm looking to connect with people building: • SaaS • AI Products • AI Agents & Automation • Developer Tools • Open Source Projects • Building in Public I'd love to follow more builders and support great products. 🚀
English
51
0
27
1.1K
Cristobal Santana
Cristobal Santana@csantana_ml·
@roniherasky Building a newsletter on the ML phenomena that show up when you take real AI systems to production, the problems that break things and why. AI engineer by day.
English
0
0
0
0
Roniher Cabrera
Roniher Cabrera@roniherasky·
🚀 Looking to connect with: • Founders • Indie Hackers • AI Builders • SaaS Creators • Content Creators • Chess Players ♟️ • Poker Players ♠️ • Former Athletes 🏅 Tell me what you're building in one sentence. I'll reply to everyone and follow some interesting journeys.
English
11
0
6
244
Cristobal Santana
Cristobal Santana@csantana_ml·
@IamYashKapoor AI engineer building RAG systems and agents, sharing the build-in-public journey through a newsletter on the ML phenomena that show up in production. Happy to connect.
English
0
0
0
0
Yash Kapoor
Yash Kapoor@IamYashKapoor·
I'm looking to #connect with people who are interested in: - Build in Public - WordPress - Full Stack - Startup - Tech - AI - Web3 Let's 🤝 and grow with valuable engagements.
English
9
2
8
161
Cristobal Santana
Cristobal Santana@csantana_ml·
@dan_soji @X Hi! Into the AI/ML and backend side here. I'm an AI engineer working on RAG and agents, writing about the technical phenomena behind real systems. Let's grow together.
English
0
0
0
0
sojidaniel
sojidaniel@dan_soji·
Hey @x I'm looking to #connect with people interested in or learning skills in - Full-stack Development - Backend - Frontend - DevOps - AI/ML - Data Science - UI/UX - Freelancing Let's connect, say Hi and let's grow together
English
40
0
19
947
Cristobal Santana
Cristobal Santana@csantana_ml·
@suyash_codez @X AI engineer here, building RAG systems and agents, plus a newsletter in public on the ML phenomena that show up when these systems hit production. Let's connect and grow together.
English
0
0
0
0
Suyash
Suyash@suyash_codez·
Looking to #connect with builders & ambitious people on @x👋 If you're into: 🚀 Startups 🤖 AI tools 💻 Coding 📊 Data Science 🛠 Building projects 📈 Growing in public ⚡ Vibe coding let’s connect and grow together 🚀 #BuildInPublic #AI #coding #tech
English
12
0
10
1.3K
Cristobal Santana
Cristobal Santana@csantana_ml·
@adrishaBiswas Web dev first is a good route, and cloning real sites is one of the best ways to learn. The newsletter is early days, but I'm posting every two weeks, so there'll be more to read by the time you move into ML. Good luck with the task manager builds!
English
0
0
0
1
Adrisha Biswas
Adrisha Biswas@adrishaBiswas·
Ohhh okayy your work sounds pretty interesting... I don't really have much knowledge about ML but intend to start studying about it before the end of 2026 after completing web dev. Rn I'm just polishing my web dev skills by building simple task manager websites or cloning popular websites
English
1
0
0
5
Cristobal Santana
Cristobal Santana@csantana_ml·
@AbhinavJhaCodes AI engineer here, building RAG systems and agents, plus a newsletter on the ML phenomena that show up when these systems hit production. That's my side project in public. Happy to see what you're building.
English
0
0
0
2
Abhinav Kumar Jha
Abhinav Kumar Jha@AbhinavJhaCodes·
Hey builders 👋looking to #connect with more people building cool things. if you're working on: 📷 ai tools 📷 mobile apps 📷 web apps 📷 design tools 📷 game dev 📷 fintech 📷 side projects drop a comment below with what you're building 📷 📈💸🧪👇
English
14
1
11
525
Cristobal Santana
Cristobal Santana@csantana_ml·
@jp54362 @X Hi! Into the AI/ML and backend side here. I'm an AI engineer building RAG systems and agents, and I write a newsletter in public on the ML phenomena that show up when these systems hit production. Let's grow together.
English
0
0
0
1
Jaysen ♨️
Jaysen ♨️@jp54362·
Hey @X I'm looking to #connect with people interested in: - Building in public - Frontend - Backend - DevOps - AI/ML - Data Science - AI startups - Saas - AI trends - AI tools Say hi 👋 & Let's grow together #BuildingInPublic #CONNECT
English
68
1
52
2K
Cristobal Santana
Cristobal Santana@csantana_ml·
@Nakniki3 AI engineer here, building RAG systems and agents, plus a newsletter on the ML phenomena that show up when these systems hit production. Congrats on the jump to 1k. Let's connect.
English
0
0
0
2
Nakniki
Nakniki@Nakniki3·
50 users to 1k! Want a Quick algo boost? I want to connect with more • Founders 🏗️ • Indie builders 🛠️ • Vibe coders ⚡ • AI enthusiasts 🧠 If this is you, drop what you’re working on 👇 Let’s connect
English
8
0
9
257
Cristobal Santana
Cristobal Santana@csantana_ml·
@web3devop Depends on the task. For some things it saves hours. For others, verifying its output costs more than just doing it yourself. The skill is knowing which is which.
English
0
0
0
1
Cristobal Santana
Cristobal Santana@csantana_ml·
@cassie_crosbie @X Hi! Building in public on the startup side here. I'm an AI engineer working on RAG and agents, and I write a newsletter on the ML phenomena that show up when these systems hit production. Ops isn't my world, but always glad to connect with people building. Let's connect.
English
0
0
0
1
Cassie ✧ Ambitious Gen Z
Cassie ✧ Ambitious Gen Z@cassie_crosbie·
holà @X, Now that Toronto tech week is finishing, I’m looking to #connect with people into: • operations/people ops • startups • freelancing • building in public 🤝
English
25
0
21
1.2K
Cristobal Santana
Cristobal Santana@csantana_ml·
@karanbhilhatiya AI engineer here, into the AI/ML and system design side. I build RAG systems and agents, and I write a newsletter on the ML phenomena that show up when these systems hit production. Congrats on 1700. Building in public is better together, agreed. Let's connect.
English
0
0
0
1
Karan Bhilhatiya
Karan Bhilhatiya@karanbhilhatiya·
hit 1700 followers small milestone. big motivation. looking to connect with builders, learners, and creators interested in: • ai/ml • dsa & system design • full stack development • devops • gaming • travelling • sports if that's you, let's connect 🤝 building in public is always better together 🚀
English
31
0
21
571
Cristobal Santana
Cristobal Santana@csantana_ml·
@8arms_io AI engineer here, building RAG systems and agents by day. My solo project is a newsletter on the ML phenomena that show up when these systems hit production, the problems that break things and why. Sharing the journey as I go. Love the KoreNani concept. Let's connect.
English
0
0
0
1
8arms.io
8arms.io@8arms_io·
Hey indie hackers & solo builders! 👋 Looking to connect with people building in: 🛠️ Practical AI tools & SaaS 👨‍💻 Indie projects ⛏️ Creator tools 🔥 Problem-solving apps What are you working on? I'm @8arms_io — solo dev building KoreNani (kids language AI camera), AdPicto (AI social media posts), and Faceless FM (podcast → shorts). Sharing the real solopreneur journey. Drop your project below! Let's connect 🚀
English
54
1
50
2K
Cristobal Santana
Cristobal Santana@csantana_ml·
@stephen_ay55253 Hi Stephen, AI engineer here. I build RAG systems and agents, and I'm sharing my own build-in-public journey through a newsletter on the ML phenomena that show up when these systems hit production. Would love to follow along with your apps. Let's connect.
English
0
0
0
1
Stephen
Stephen@stephen_ay55253·
Hey 👋 I’m Stephen. I’ve been building, breaking, learning, and shipping for years Lately I’ve started building apps in public and sharing the journey. Would love to connect: 🚀 Founders 💻 Devs 🎨 Designers 📣 Marketers 🧠 AI builders Drop a hello 👇 #buildinpublic #indiedev
English
46
0
35
1.2K
Cristobal Santana
Cristobal Santana@csantana_ml·
@sherifgjini AI engineer here, building RAG systems and agents, plus a newsletter on the ML phenomena that show up when these systems hit production. Always glad to connect with other builders. Let's connect.
English
0
0
0
1
Gini
Gini@sherifgjini·
Hey founders! Looking to connect with people building in: 🍽️ saas 🚀 tech 🧠 AI tools 📱 iOS app 🌐 extension Drop what you're working on 👇
English
118
2
93
4.6K