69 posts

us

@mldev_

ml dev "The phenomenon of consciousness cannot be accommodated within a computational framework." – Roger Penrose

in the middle of desert Katılım Ağustos 2024

45 Takip Edilen2 Takipçiler

us@mldev_·7h

@IndieDevHailey just released github.com/us/crw 6mb instead 1gb+ memory like firecrawl! faster and more efficient! especially for local projects, local agents mcp

English

开发者Hailey@IndieDevHailey·9 Mar

最近在开发圈很火的 GitHub 项目 Firecrawl，一个专门给 AI 用的智能爬虫，已经 7万+ Star 了。一句话总结：它可以把任何网站，直接变成 AI 能用的数据。只要给它一个 URL，它就会自动： - 抓取整站页面 - 清洗网页内容 - 解析结构信息 - 输出 Markdown / JSON 也就是说：网站 → 结构化数据 → 直接喂给 LLM。现在很多 AI 项目的数据流程其实都是：网站 → Firecrawl → 向量库 → RAG → AI 应用如果你在做： - AI Agent - RAG 知识库 - 自动化数据采集这个工具基本算是 AI 开发的基础设施了

中文

389

1.5K

136.5K

us@mldev_·7h

@TheGeorgePu you can use open source github.com/us/crw 6mb crawler for crawling!

English

George Pu@TheGeorgePu·4d

Perplexity charges $20-200/month for AI search. You can build the same thing on a single Mac Mini sitting on your desk. $2,500. Once. Open-source LLM. Open-source crawler. Open-source RAG. Then it's yours. Forever. The AI industry is building a rental economy. You don't have to participate.

English

102

10K

us@mldev_·20h

crw v0.2.2 is out. – automatic JS/headless fallback for blocked pages – PDF extraction – smoother binary downloads no config, no flags. it just works. #free to try (500 credits): fastcrw.com #opensource: github.com/us/crw #openai #mcp #fastcrawler #ai

English

us@mldev_·2d

implementation of turboquant, pretty impressive github.com/TheTom/turboqu…

English

us@mldev_·2d

Anthropic: AI will kill dev jobs. Now that Claude Code is publicly available, we’re starting to see the real downside of “vibe coding” 😛 This is exactly the kind of risk people have been warning about.

Chaofan Shou@Fried_rice

Claude code source code has been leaked via a map file in their npm registry! Code: …a8527898604c1bbb12468b1581d95e.r2.dev/src.zip

English

us retweetledi

Jianyang Gao@gaoj0017·6d

The TurboQuant paper (ICLR 2026) contains serious issues in how it describes RaBitQ, including incorrect technical claims and misleading theory/experiment comparisons. We flagged these issues to the authors before submission. They acknowledged them, but chose not to fix them. The paper was later accepted and widely promoted by Google, reaching tens of millions of views. We’re speaking up now because once a misleading narrative spreads, it becomes much harder to correct. We’ve written a public comment on openreview (openreview.net/forum?id=tO3AS…). We would greatly appreciate your attention and help in sharing it.

Google Research@GoogleResearch

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI

English

976

6.5K

992K

us@mldev_·26 Mar

@gregisenberg check fastcrw.com 10x faster 100x lighter than firecrawl, 6.8mb binary written with rust and open source GitHub.com/us/crw

English

GREG ISENBERG@gregisenberg·24 Mar

how to use firecrawl to give your AI eyes and actually build startups that outperform 99% of apps: 1. your AI is smart but blind. it can't go to a website, read a page, or grab data on its own. firecrawl fixes that. you put in a URL. you get back clean markdown, structured JSON, screenshots. feed it to any model. 2. three lines of code. that's it. no proxies. no anti-bot detection. no custom scrapers that break when a site changes. one API call. clean data back in seconds. works on 98%+ of sites. 3. firecrawl has six core capabilities: scrape a single page. crawl an entire site. map all URLs on a domain. search google and return full content. an agent endpoint where you describe what you want and it goes and finds it. and a browser sandbox where AI controls a real browser like filling forms, clicking buttons, handles logins. 4. the agent endpoint is wild. you can say "find all of YC's winter 24 dev tool companies and their founders and emails" and get back structured data. or "compare pricing tiers across stripe, square, and paypal" and get a side-by-side table. 5. the browser sandbox lets your AI stay logged in across sessions, navigate pagination, watch live as it browses. this is computer use without building the infrastructure yourself. 6. think of it in layers. every builder needs: an agent harness (claude code, cursor, codex), a search layer (perplexity, exa), a web data layer (firecrawl), an ops brain (obsidian, notion), and an outbound stack. the web data layer is the one most people are sleeping on. 7. this is the AWS moment for web data. in 2006 building a web app meant buying servers and managing racks. AWS said one API call, use our servers. some of the biggest companies of the last decade were built on that. firecrawl is doing the same thing for web data in 2026. 8. the framework i'd use for coming up with startup ideas building with clean data: take a massive horizontal platform. rebuild it for one niche using firecrawl. the vertical version always wins because people want specific, not generic. price for outcome. 9. a year ago firecrawl posted a job listing that said "please only apply if you're an AI agent." content creator agents. customer support agents. junior dev agents. it looked weird. it was a signal for where this is all going. the people who understand how to get clean web data, wrap it around an LLM, and package it as a product are the the ones with a 12-month head start. i use @firecrawl with @ideabrowser . once you see what's possible with structured web data, you can't unsee it. episode is live on @startupideaspod (full breakdown there) i tried to explain this as clear as possible for even the non technical. send it to a builder friend. watch

English

709

129.8K

us@mldev_·26 Mar

@GithubProjects 5hours limits :p

English

GitHub Projects Community@GithubProjects·26 Mar

What's stopping YOU from CODING like THIS?

English

320

27K

us retweetledi

klöss@kloss_xyz·25 Mar

the engineers who named the algorithm

Google Research@GoogleResearch

English

151

2.5K

283.6K

us@mldev_·25 Mar

Ref: @garrytan Full Blog: garryslist.org/posts/boil-the…

English

us@mldev_·25 Mar

..Why can’t a startup deliver a service that is 100x better than the incumbent? Why can’t we have fusion energy? Why can’t we talk to every single user and have a perfect understanding of every bug in our product?...

English

us@mldev_·25 Mar

Garry captures the #AI moment well: don’t fear cheaper sameness—use AI to build what once seemed impossible, 100x better. Here some quote from that his `Boil the Ocean` blog:

English

us@mldev_·22 Mar

@NainsiDwiv50980 there was a shannon project similar to yours and I released the version that directly with Claude code check it out it can be applied to that project too cuz most of the devs are avoiding huge ai credits github.com/us/shannon-on-…

English

Nainsi Dwivedi@NainsiDwiv50980·22 Mar

Penetration testers are expensive. Good ones charge $200–$500/hr. And 90% of what they do in the first 48 hours is completely automatable. That's the bet behind PentAGI — and it just hit #1 trending on GitHub. Here's what makes it different from every other "AI security tool": Most AI security tools are wrappers. They take a model, give it a nmap command, and call it an agent. PentAGI is actually architected like a real red team: → A primary agent that plans the engagement → Specialist sub-agents for research, coding, and infra tasks → A memory system (episodic, semantic, long-term) so it LEARNS across sessions → A knowledge graph (Neo4j + Graphiti) tracking relationships between targets, tools, and vulnerabilities → 20+ professional pentesting tools baked in: nmap, metasploit, sqlmap, and more → A sandboxed Docker environment — untrusted code never touches your host The architecture insight is underrated: Real penetration tests fail not because hackers lack tools — they fail because of lost context. An analyst runs a scan, finds something interesting, pivots to a different thread, and 4 hours later can't remember what they were following. PentAGI stores everything. Every command, every output, every successful technique — indexed in PostgreSQL with pgvector. The knowledge graph makes semantic connections across sessions. It gets smarter the more you use it. The model flexibility is also quietly impressive. Supports OpenAI, Anthropic, Gemini, AWS Bedrock, DeepSeek, Ollama (local), Qwen, Kimi, GLM — and for the privacy-obsessed, you can run it 100% offline with a local vLLM stack. Their benchmark: 13,000 tokens/second prompt processing on 4× RTX 5090s. 12+ concurrent testing flows. Zero cloud dependency. This is what "AI-native" security tooling actually looks like. Repo in comments 👇 (if you work in AppSec, red teams, or bug bounty — this belongs in your stack)

English

102

us@mldev_·18 Mar

@ghumare64 I feel gemini 🤞

English

1.2K

Rohit Ghumare@ghumare64·18 Mar

Learn DevOps by playing games 🎮 1. Kubernetes K8sgames.com 2. DevOps devops.games 2. Linux overthewire.org 3. Git ohmygit.org 4. Python tynker.com 5. 25+ programming languages codingame.com

English

815

375.7K

us@mldev_·18 Mar

said one of the engineers again.. even though there is no one clear art explanation, engineers always say that i think it comes because of the inferiority complex

Mustafa@oprydai

any sufficiently advanced engineering is indistinguishable from art

English

us@mldev_·15 Mar

@nalinrajput23 red dot mouse and images from space in nasa rockets jfjfj

English

Nalin@nalinrajput23·15 Mar

What is the reason behind ThinkPad’s popularity among engineers?

English

627

366

9.9K

1.8M

us@mldev_·15 Mar

sometimes you don't need advanced runners, I have 2 projects on a $4 vps and built a only 500 line go tool to auto-deploy on git push, wrote about the whole experience in blog: us.github.io/blog/?post=dep… code: github.com/us/deploq #golang #selfhosted #devops #developer

English

us@mldev_·14 Mar

@MendyOK @browser_use you need to use it locally like browser use does! otherwise needed proxy! you can check the v0.0.11

English

Mendy@MendyOK·13 Mar

@mldev_ @browser_use Yeah, right...

English

Browser Use@browser_use·13 Mar

/crawl doesn't work on Cloudflare-protected sites. Browser Use can crawl and scrape ANY site.

Tuki@TukiFromKL

🚨 Stop scrolling. This is the biggest betrayal in tech this year. The company that built its entire reputation on BLOCKING scrapers just shipped the most powerful scraping tool ever made. > Cloudflare just dropped a /crawl endpoint. One API call and you get an entire website back. Clean HTML, Markdown, or JSON. That's it. That's the whole thing. Let me break it down. For years, Cloudflare sold anti-bot protection. Companies paid them to STOP crawlers. Now those same companies are watching Cloudflare hand everyone a free crawler that bypasses… other people's anti-bot protection. They didn't switch sides. They're playing both sides. And getting paid twice.

English

645

141.6K

Keşfet

@IndieDevHailey @TheGeorgePu @gregisenberg @firecrawl @ideabrowser @startupideaspod @GithubProjects @garrytan