Mike Pollard

146 posts

Mike Pollard banner
Mike Pollard

Mike Pollard

@mikepollard_dev

building in SF @inference_net

San Francisco, United States Katılım Temmuz 2011
409 Takip Edilen89 Takipçiler
Mike Pollard retweetledi
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
We're releasing Schematron V2, a family of Specialized Language Models for converting messy HTML to structured JSON frontier performance at 1/10th the cost Schematron V2 was designed in partnership with some of the largest web-scraping companies in the world to meet the demands of their heaviest workloads Schematron-V2-Turbo and Schematron-V2-Small are available today on @inference_net Get started: docs.inference.net/workhorse-mode…
Sam Hogan 🇺🇸 tweet media
Sam Hogan 🇺🇸@samhogan

I found out today that two of the largest web scraping companies in the world are using a custom Llama 3 model we released last year to process millions of webpages per day. Schematron-3b: HTML -> JSON parsing Frontier quality at dirt-cheap prices. huggingface.co/inference-net/…

English
6
8
72
13.3K
Abhijit
Abhijit@abhijitwt·
Amazon uses Rust. Microsoft uses Rust. Google uses Rust. Cloudflare uses Rust. Discord uses Rust. Dropbox uses Rust. Figma uses Rust. Solana uses Rust. Polkadot uses Rust. NEAR uses Rust. Aptos uses Rust. Sui uses Rust. What’s stopping you from learning Rust?
English
80
18
360
155.4K
Nico Cosmic 🚀
Nico Cosmic 🚀@NicoCosmic·
@Anubhavhing This doesn’t evade Cloudflare’s bot management or waf rules. It’s a transparent ethical crawler. A simple robots.txt can block this.
English
2
0
5
1.3K
Anubhav
Anubhav@Anubhavhing·
Crawling an entire website used to take: A Python script. Playwright or Selenium. Proxy rotation. Rate limiting logic. Error handling. 3 hours of debugging why page 47 returned a 403. Now it's one API call. Every web scraping startup that raised millions to solve this problem just became a single endpoint. Every freelancer charging $500 to "extract website data" just lost their entire business model to a /crawl command. HTML. Markdown. JSON. Pick your format. No scripts. No browser. No headache. The entire web scraping industry just got reduced to one line of code. Someone is going to use this to clone every competitor's website by Friday. 💀
Cloudflare Developers@CloudflareDev

Introducing the new /crawl endpoint - one API call and an entire site crawled. No scripts. No browser management. Just the content in HTML, Markdown, or JSON.

English
227
496
8.9K
2M
Mike Pollard
Mike Pollard@mikepollard_dev·
@Anubhavhing Well this was already a thing with firecrawl and nothing huge happened so I don’t imagine this would be much different. Cloudflares also self identifies as a bot so can get blocked more easily. Schematron-3b is the best model for extracting good page data for the cost.
English
0
0
0
233
Mike Pollard retweetledi
Inference
Inference@inference_net·
Day Zero fine-tuning & hosting support for Nemotron 3 Super by @nvidia is now live Fine-tune on real production traces & deploy on high-performance infrastructure optimized for Nemotron 3 Super Your data, your weights, your performance edge Learn more: inference.net/blog/nemotron-…
English
7
6
25
7.8K
Mike Pollard retweetledi
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
What if a codebase was actually stored in Postgres and agents directly modified files by reading/writing to the DB? Code velocity has increased 3-5x. This will undoubtedly continue. PR review has already become a bottleneck for high output teams. Codebase checked-out on filesystem seems like a terrible primitive when you have 10-100-1000 agents writing code. Code is now high velocity data and should be modeled at such. Bare minimum, we need write-level atomicity and better coordination across agents, better synchronization primitives for subscribing to codebase state changes and real-time time file-level code lint/fmt/review. The current ~20 year old paradigm of git checkout/branch/push/pr/review/rebase ended Jan 2026. We need an entirely new foundational system for writing code if we’re really going to keep pace with scale laws.
English
467
104
2.1K
942.2K
Mike Pollard
Mike Pollard@mikepollard_dev·
@solofounders It’s literally in the name, solo founder. Most decided to do this solo without the headache of hiring and partners.
English
1
0
1
69
Solo Founders
Solo Founders@solofounders·
Solo founders have double the equity to recruit with but almost none of them are using it. We suspect this will shift as solo founders understand this gives them a hiring advantage.
Solo Founders tweet media
English
10
4
76
31.9K
Mike Pollard retweetledi
Amar Singh
Amar Singh@AmarSVS·
Introducing our new Schematron benchmark. We took some time to compare all of the latest open source models to see which one takes the crown. The benchmark essentially measures the ability of LLMs to take raw HTML along with a JSON schema, and then fill out that schema. We measure things like recall/precision, hallucinations, and ability to handle ambiguity. The benchmarks are graded with an ensemble of frontier models on a 5 point rubric. We can see that GLM 5 is the best open source model currently for schema extraction. Surprisingly, GPT-OSS 120B does very well at these type of extraction tasks as well. Another interesting result is we noticed degradation of quality using Qwen3.5 Plus on this task versus the original Qwen3.5 397B MOE. The inputs can be up to 120K tokens, so this is akin to a long context benchmark, with an additional reasoning layer. We will be open sourcing this benchmark if it gains sufficient traction. Also, more benchmarks coming from our side!
Amar Singh tweet media
English
5
4
15
7.6K
Miles Deutscher
Miles Deutscher@milesdeutscher·
bro You literally CANT be lazy right now This is your competition HUNDREDS of AI agents working autonomously at once (thousands in revenue btw) Lock tf in
English
490
266
2.9K
486.7K
Michael
Michael@michael_chomsky·
So sorry to anyone I’m leaving on unread. I’ll get to everyone eventually. My schedule looks like this for the time being. Focusing on providing a world-class experience to existing customers. This OpenClaw thing is insane and demand is almost more than I can handle (but I’ll handle it. hiring a security/IT expert) (for legal reasons had to cover specific names and links)
Michael tweet media
English
16
0
39
6.7K
Mike Pollard retweetledi
Naval
Naval@naval·
Vibe coding is the new product management. Training and tuning models is the new coding.
English
1K
2.1K
20.9K
1.2M
Mike Pollard retweetledi
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
We're welcoming @mikepollard_dev to @inference_net as our Founding DevRel Engineer! Mike and I won a pitch competition for my first company nearly 7 years ago Life is long. When you find someone you love to work with, keep them close. You never know when your paths may cross
Sam Hogan 🇺🇸 tweet mediaSam Hogan 🇺🇸 tweet media
English
8
3
45
4.3K
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
Today I’m incredibly excited to announce that @AmarSVS has joined me and @atbeme as a co-founder of @inference_net Anyone who has worked with Amar knows he is a N=1 type of guy. His energy, raw horsepower, and dedication have allowed us to unlock exciting new opportunities and inspired the whole team. I look forward to many more years of partnership, ping pong, and late nights in the office.
Sam Hogan 🇺🇸 tweet media
English
23
29
113
18.2K
Mike Pollard
Mike Pollard@mikepollard_dev·
@netflix Great, can’t wait for every movie to feel like it was co-written by a brand safety team. Shows will become even more corporate slop and artistry in cinema will die for fear of less profit.
English
0
0
0
17
Netflix
Netflix@netflix·
Today, Netflix announced our acquisition of Warner Bros. Together, we’ll define the next century of storytelling, creating an extraordinary entertainment offering for audiences everywhere. about.netflix.com/en/news/netfli…
Netflix tweet media
English
26.4K
39.8K
278.9K
103.6M