Leo

1.7K posts

Leo

@_leander30

AI/ML professional actively seeking opportunities

India 参加日 Eylül 2011

275 フォロー中162 フォロワー

固定されたツイート

Leo@_leander30·18 Nis

After 3 months of non stop building, I’m back with a new daily posting series. I just shipped three production AI projects (RAG system, job application agent, and GitHub portfolio reviewer). Starting today, I’ll showcase one deep dive every day. First up: My grounded RAG Q&A system that beats OpenAI File Search & Vectara on RAGAS benchmarks. Live: helpmateai.xyz

English

105

Leo がリツイート

NeilXbt@neil_xbt·26 Nis

ANDREJ KARPATHY COULD HAVE CHARGED $500 FOR THIS WALKTHROUGH. He put it on YouTube. Every way he personally uses LLMs in his own life. Thinking models. Deep research. File uploads. Python interpreter. Claude Artifacts. Not theory. Not benchmarks. The actual daily workflow of the person who built Tesla Autopilot and co-founded OpenAI. 2 hours walking through his personal LLM workflow. The gap between people who watch this week and those who save it for later is not 2 hours. It is everything those 2 hours quietly change about how you work for the rest of your career.

English

225

2.1K

193.2K

Leo がリツイート

CyrilXBT@cyrilXBT·27 Nis

ANTHROPIC JUST PROVED MOST PEOPLE HAVE NO IDEA HOW TO PROMPT CLAUDE. Their applied AI team dropped a 24 minute free workshop. Not a creator who reverse engineered it. Not a Reddit thread. ANTHROPIC. The people who wrote the weights. And what they showed is uncomfortable. There are 6 elements to a properly structured Claude prompt. Most people are using 1. Maybe 2. That is not a skill issue. That is an information issue. And it has been quietly costing you every single day. The outputs that felt slightly off. The responses you had to rewrite 4 times. The prompts that worked once and never again. All of it traces back to the same 6 missing elements. The people who watch this 24 minute workshop tonight will understand something about Claude that most daily users still do not know exists. The people who skip it will keep getting 30% of what the tool is actually capable of and wonder why the results never quite land. I watched it twice. Then I built a Claude Skill that applies all 6 elements to every prompt automatically. No more thinking about structure. No more guessing what Claude needs. The framework runs in the background every single time. Full breakdown and skill setup is below. Bookmark this now. Watch the workshop first. Then read the guide. This is the one that compounds. Follow @cyrilXBT for the exact prompt architecture, Claude skills, and systems I use to get outputs most people do not believe came from one person working alone.

English

167

691

7.3K

765.3K

Leo がリツイート

Prajwal Tomar@PrajwalTomar_·21 Nis

x.com/i/article/2045…

ZXX

117

89.3K

Leo@_leander30·22 Nis

It’s a good reminder that better RAG is not always about retrieving more. Sometimes it’s about retrieving more selectively and being more critical about the evidence before answering. #RAG #AI

English

Leo@_leander30·22 Nis

Self-RAG pushes on that by making retrieval and critique part of the generation process. Instead of treating RAG as: retrieve -> stuff context -> answer the model learns to: - retrieve when needed - reflect on the evidence - critique its own response That’s a much more interesting framing than naive “top-k chunks + prompt”.

English

Leo@_leander30·22 Nis

Today I’m reading Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection. Paper: arxiv.org/abs/2310.11511 One idea I really like: RAG shouldn’t always retrieve the same way for every query.

English

Leo@_leander30·20 Nis

This feels very relevant to what I’m building, My current retrieval already uses section synopses and summary aware retrieval for broad questions, but RAPTOR’s recursive summary hierarchy is a really interesting extension of that idea. #RAG #LLM #PaperReview

English

Leo@_leander30·20 Nis

RAPTOR builds a hierarchy of summaries, so retrieval can happen at multiple levels of abstraction instead of only pulling nearby chunks. That’s especially useful for broad questions like: • What is this paper about? • What are the key findings? • How do these sections connect? This is where naive RAG usually starts to break.

English

Leo@_leander30·20 Nis

Today I’m reading RAPTOR - Recursive Abstractive Processing for Tree-Organized Retrieval. Paper: arxiv.org/abs/2401.18059 One idea that really clicked for me: flat chunk retrieval is often not enough for long-document QA.

English

Leo@_leander30·19 Nis

All benchmark reports are saved in the repo. if anyone wants to take a more closer look @RivraDev github.com/LEANDERANTONY/… Still working on it, I'll keep sharing the architecture, failures, and fixes as I go. Would especially welcome thoughts from people running RAG systems in production @Arjunjain #RAG #AI

English

Leo@_leander30·19 Nis

The idea for the selector layer was this: The reranker selects good chunks. But ordering based purely on cross-encoder scores doesn't account for what the generator actually needs to answer the specific question. The selector layer adds an LLM layer that looks at query intent and promotes the most directly answerable chunk to position 1. Result: - Context precision: 0.9036 → 0.9608 - Faithfulness: 0.9310 → 0.9657 The generator now sees the most query-relevant evidence first, which keeps answers tighter and better grounded.

English

Leo@_leander30·19 Nis

Yesterday I said HelpmateAI beats OpenAI File Search and Vectara. I was asked about the eval setup ,here’s exactly how I measure it.

English

Leo@_leander30·18 Nis

@unbankedgroup it was pretty intense, still some ways to go

English

Mal@unbankedgroup·18 Nis

@_leander30 3 months shipping and you already learned what most people take a year to figure out: the RAG system is the foundation. everything else is a feature on top

English

Leo@_leander30·18 Nis

English

105

Leo@_leander30·18 Nis

@mem0ai I faced the same issues when i built my RAG system. Switched to hybrid retrieval + cross-encoder reranking, got much cleaner citation and backed answers with full evidence panels. A dedicated context layer feels like the right direction. Will explore how mem0 fits in.

English

374

mem0@mem0ai·18 Nis

x.com/i/article/2045…

ZXX

395

26K

Leo@_leander30·18 Nis

Repo: github.com/LEANDERANTONY/… Love some feedback and suggestions, still working on the latency and making the UI better. If you’re hiring for remote AI/ML roles (RAG / agents / multimodal), DMs open 👇 #RAG #AgenticAI #AI #RemoteAI #OpenToWork

English

Leo@_leander30·18 Nis

Hybrid retrieval + query-aware routing + cross-encoder reranking → Supported answer rate: 0.8026 → 0.8816 Citation page-hit: 0.6974 → 0.8684 Outperforms on faithfulness, answer relevancy & context precision (tested on health-policy, thesis & research papers). Full citation trails + raw evidence panels Built with Next.js + FastAPI + ChromaDB • Deployed on VPS with Docker.

English

ディスカバー

@cyrilXBT @RivraDev @Arjunjain @unbankedgroup @mem0ai @elonmusk @BarackObama @taylorswift13