Grisha

463 posts

Grisha banner
Grisha

Grisha

@grishahq

ML Engineer | RAG • Evals • Agents

Katılım Aralık 2011
593 Takip Edilen181 Takipçiler
Sabitlenmiş Tweet
Grisha
Grisha@grishahq·
🚀 400 stars on GitHub! Thank you all for supporting recursive-llm. Processing unbounded context with any LLM is the way forward! Appreciate every star and your feedback. 🔗 github.com/ysz/recursive-…
Grisha tweet media
Grisha@grishahq

How to process 1M+ token documents with any LLM without RAG? Open-source Python implementation of @a1zhang and @lateinteraction's RLM. Built an open-source Python implementation that works with 100+ LLM providers via LiteLLM. Ready to use with OpenAI, Anthropic, Ollama and more. 🔗 GitHub: github.com/ysz/recursive-… 📄 Paper: alexzhang13.github.io/blog/2025/rlm/

English
0
0
5
172
Grisha
Grisha@grishahq·
Where dense itself loses: 35 of 150 questions need evidence from 2-3 pages. Joint recall@10 is 11%. Top-k can't grab discrete pages at once. Query decomposition next. Caveat: FinanceBench has unusually severe lexical mismatch. On legal docs BM25 still adds 5-15% to hybrid.
English
0
0
0
19
Grisha
Grisha@grishahq·
Sharpest gap on metrics-generated questions (50 programmatic "FY2018 X for company Y" prompts): BM25 0% recall@10, dense 57%, hybrid 55%. Questions say "capital expenditure". Filings say "Purchases of property, plant and equipment". Lexical retrieval has nothing to match.
English
1
0
0
11
Grisha
Grisha@grishahq·
Ran a retrieval benchmark on FinanceBench. 150 finance QA pairs over 368 SEC filings, full corpus. Expected hybrid (BM25 + dense via RRF) to win. Pure dense beat it across every slice. Here is why.
Grisha tweet media
English
1
0
0
22
Grisha
Grisha@grishahq·
GraphRAG is not “better RAG”. It is a different tool. Hybrid RAG helps you find the right chunks. GraphRAG helps you find the right relationships. Start with Hybrid RAG. Add GraphRAG when the real problem is multi step context, dependencies, and connected entities. Most teams don’t need GraphRAG on day one.
Grisha tweet media
English
1
0
3
25
Kevin
Kevin@KevinPicchi·
Managed to find a list of 2500 VCs (They specialize in AI & SaaS) Never been a better time to get funded Want the list? Comment "VC" and LIKE this post. Will DM in 24 hours.
GIF
English
843
35
927
63K
Grisha
Grisha@grishahq·
For now it works for repos with 100+ stars - around 9K repos in the dataset.
English
0
0
0
9
Grisha
Grisha@grishahq·
GitHub doesn't show you similar repos when you're browsing. So I built a Chrome extension that does - adds a "Similar Repositories" panel right in the sidebar. Finds related repos by who stars them, how similar their descriptions are, and what topics they share. 🔗 github.com/ysz/gh-recomme…
English
1
0
1
43
Grisha
Grisha@grishahq·
@saipxai I checked it with some small provider. I wanted that exact one, but your name is great too.
English
1
0
1
24
Saipx
Saipx@saipxai·
@grishahq Which registrar did you use? Got antsintelligence*com in our portfolio.
English
1
0
0
27
Grisha
Grisha@grishahq·
Today I checked a .dev domain and it was available. Came back 3 hours later to register it - already taken. WHOIS shows it was registered today. Do registrars leak/search-snoop availability checks? Is that even legal?
Grisha tweet media
English
1
0
3
106
Amin
Amin@eCom_Amin·
BING ads are a $1M+ GOLDMINE to target 60yo millionaires with ZERO competition and after generating $1,000,000+ with BING i’m giving away EVERYTHING i know about it like + comment “BING” and i’ll shoot you 41-pages of ecom sauce about it (must be following + RT for priority access)
English
301
110
386
22.4K
Grisha
Grisha@grishahq·
I really enjoyed this paper, and when the first version was released about three months ago, I built my own implementation (400+ stars). I’ve updated it recently as well. You might find it interesting to play around with it. github.com/ysz/recursive-…
Derya Unutmaz, MD@DeryaTR_

I finally had a chance to read this paper. I am now convinced that Recursive Language Models (RLMs) are going to be the next big thing in AI advances! Attention is shifting toward very large context windows. Very impressive paper! Congrats to Alex who is a new PhD student at MIT.

English
0
0
4
91
Grisha
Grisha@grishahq·
5/ ~6K lines of Python. No open ports. Six security layers by default. A bit more friction, zero supply chain risk. github.com/ysz/nanoclaw
English
1
0
1
64
Grisha
Grisha@grishahq·
1Password just exposed how OpenClaw's skill registry became a malware vector. Central marketplace + auto-installs = poisoned supply chain. I built nanoClaw with the opposite architecture. Here's why these attacks don't work on it 🧵
Grisha tweet media
English
2
1
4
631
Grisha
Grisha@grishahq·
📦 Custom skill files are checked for ownership and permissions before loading. Background tasks are capped to prevent resource exhaustion. If you hand an agent real tools, secure the tools first.
English
0
0
0
29
Grisha
Grisha@grishahq·
🔑 Dashboard API now requires auth. Every audit log entry is HMAC-signed so nobody can quietly rewrite history.
English
1
0
0
34
Grisha
Grisha@grishahq·
Spent tonight hardening nanoClaw's security 🔒 When you give an AI agent access to files, shell, and the web you better make sure it can't turn them against you. Here's what I locked down 👇
Grisha@grishahq

nanoClaw is live. ❌OpenClaw: 430k lines, hours of setup, security holes ⚡️nanoClaw: 3k lines, 1 min setup, sandboxed by default Same capabilities: - Telegram 24/7 - Claude / GPT-5 / DeepSeek - Web search, shell, memory - Cron jobs Open source. MIT license. github.com/ysz/nanoClaw

English
1
0
0
97