Dylan Huang

247 posts

Dylan Huang

@dphuang2

Thinky. Passionate nerd. Developer experience.

Katılım Mayıs 2019

70 Takip Edilen45 Takipçiler

Dylan Huang@dphuang2·1h

@ethanelasky DMs open!

English

Ethan Elasky@ethanelasky·1h

@dphuang2 can you open dms, I have some questions on future tinker capabilities and don't wanna invest in a lot of infra if you guys are gonna put more infra out soon

English

Dylan Huang@dphuang2·20 Mar

this is actually crazy

Tinker@tinkerapi

Mantic used Tinker to RL gpt-oss-120b on judgmental forecasting; the result outperformed frontier models on event predictions. Combined with @_Mantic_AI's forecasting architecture, task-specific training takes us to the cusp of automated superforecasting.

English

Dylan Huang@dphuang2·16 Mar

@soumithchintala I can't tell if this is AI or not, I even googled it 😅

English

2.5K

Soumith Chintala@soumithchintala·16 Mar

someone's getting started early!

English

169

3.6K

105.4K

Dylan Huang retweetledi

Tinker@tinkerapi·13 Mar

This week’s projects expand the horizons of training in Tinker, from massive libraries of RL environments to new methods of training agents and models that budget their own thinking.

English

101

Dylan Huang@dphuang2·12 Mar

@karinanguyen congrats on releasing PostTrainBench! Have you guys thought about using Tinker in the harness? I imagine it would remove some of the setup/infra debugging work the agents need to do.

English

726

Dylan Huang retweetledi

Karina Nguyen@karinanguyen·11 Mar

Excited to release PostTrainBench v1.0! This benchmark evaluates the ability of frontier AI agents to post-train language models in a simplified setting. We believe this is a first step toward tracking progress in recursive self-improvement 🧵:

English

668

142.1K

Dylan Huang retweetledi

Thinking Machines@thinkymachines·10 Mar

We are partnering with @nvidia to power our frontier model training and platforms delivering customizable AI. thinkingmachines.ai/news/nvidia-pa…

English

100

167

2.4K

566.4K

Dylan Huang@dphuang2·7 Mar

@a_levitator what do the internal numbers say

English

977

Alex L@a_levitator·7 Mar

Ramp inspect might be the greatest internal tool any tech company has ever made

English

17.1K

Dylan Huang@dphuang2·7 Mar

@HanchungLee I think startups that don’t figure this out might become undifferentiated, and ultimately die

English

Han@HanchungLee·7 Mar

the model is the product.

Tanay Jaipuria@tanayj

Good piece on the "war time" at Cursor. Some interesting quotes: - The company’s new mandate was labeled “P0 #1”—priority zero: “Build the best coding model.” - Cursor estimated last year that a $200-per-month Claude Code subscription could use up to $2,000 in compute, suggesting significant subsidization by Anthropic. Today, that subsidization appears to be even more aggressive, with that $200 plan able to consume about $5,000 in compute, according to a different person who has seen analyses on the company’s compute spend patterns. forbes.com/sites/annatong…

English

1.9K

Dylan Huang@dphuang2·7 Mar

@alvinsng @FactoryAI engineers should be spending more time allowing agents to iteratively verify their work 🙂‍↕️

English

1.7K

Alvin Sng@alvinsng·7 Mar

At @FactoryAI, every PR triggers 40+ CI checks, all finishing in under 6 minutes. Our automated guardrails are so fast and comprehensive that you can "merge recklessly". This is agent-native development

GIF

English

629

130K

Dylan Huang retweetledi

Tinker@tinkerapi·6 Mar

Contextual AI used Tinker to post-train the planning behavior for a search agent. They land on a two-stage training recipe: On-Policy Distillation and GRPO with a CLP reward. Read more 👇

Abdallah Bashir@abdallah197_

Search agents, whether they're powering deep research, or multi-step QA over a private corpus, spend most of their time and compute in the research loop: query, search, reason, repeat. We wanted to make that loop faster and more accurate. So we optimized two things jointly: the retrieval stack itself, and the planner that decides when and how to search. A trained planner on our fastest retrieval config matches an untrained planner on the most expensive one, at half the latency. Every arrow in this plot points up and to the left. [1/n]

English

181

57K

Dylan Huang@dphuang2·26 Şub

@xn1cklas @QuiverAI @joanrod_ai Can’t wait to try it out in more projects 🙌

English

nicklas@xn1cklas·25 Şub

@dphuang2 @QuiverAI @joanrod_ai wow this is really cool, especially for such a "simple" prompt!

English

Dylan Huang@dphuang2·25 Şub

@QuiverAI is amazing... These logos took a couple minutes to create. I've played around with frontier models to generate SVGs, nothing compares. Great job @joanrod_ai and team! Looking forward to Arrow-2.0.

English

112

Dylan Huang retweetledi

Joan Rodriguez@joanrod_ai·25 Şub

Introducing @QuiverAI, a new AI lab and product company focused on frontier vector design. We’ve raised an $8.3M seed round led by @a16z, with support from amazing angels and investors. Our first model, Arrow-1.0, generates SVGs from images and text. It’s available now in public beta at app.quiver.ai

English

305

292

4.8K

1.3M

Dylan Huang@dphuang2·25 Şub

@noahzweben new attack vector acquired

English

Noah Zweben@noahzweben·24 Şub

Announcing a new Claude Code feature: Remote Control. It's rolling out now to Max users in research preview. Try it with /remote-control Start local sessions from the terminal, then continue them from your phone. Take a walk, see the sun, walk your dog without losing your flow.

English

1.5K

1.3K

17K

4.5M

Dylan Huang@dphuang2·24 Şub

@stephenhaney why not just have agents directly write code and hotload the results in the browser? Is the Paper DSL less confusing for agents? Is it a better workflow for designers? Just curious about your insights.

English

Stephen Haney@stephenhaney·24 Şub

Hello! Today we're releasing Paper Desktop Paper is now a canvas for Cursor, Claude Code, Codex. Any agent can read and write html to Paper. • push or pull from your codebase • pull real data from anywhere • less work, more design What will you ship? Sound on 🎶

English

348

406

5.8K

1.6M

Dylan Huang@dphuang2·7 Şub

@jaltma AI infra or APIs that agents use

English

Jack Altman@jaltma·6 Şub

The current consensus view is saas is dead...presuming that's right, the next interesting next question is What companies are "safe from ai"? - handling money, regulation - agents on top of company data - most hardware? - maybe systems of record? - security? - marketplaces?

English

307

712

178.8K

Dylan Huang@dphuang2·6 Şub

@VihaarNandigala Congrats!!

English

Vihaar Nandigala@VihaarNandigala·6 Şub

We just raised a $5.3M seed round for Orange Slice, co-led by 1984 Capital and Moxxie Ventures, with participation from angels like Paul Graham. We’re building AI agents, inside a spreadsheet, that help sales teams find companies that already want to buy. The reality is most sales teams don’t struggle with effort - they struggle with timing. Reps spend huge amounts of time working static lists and broad targeting, chasing leads that were never going to convert. That creates noise, low reply rates, and wasted cycles. Top companies like Ramp solve this with dedicated growth engineers building internal data workflows. We’re making that same capability accessible to everyone else. At its core, the challenge is simple: finding customers who already have the problem you solve. Orange Slice turns the spreadsheet into a system for discovering buying signals - agents research company sites, news, social signals, and niche sources like court records or building permits, then structure that information directly into columns teams can act on. Not “might be a fit.” But “likely in-market.” So instead of guessing who to target, teams build and refine living lists of high-intent accounts inside a sheet. Still early. Still learning. But we’re excited to keep building. Kishan and I met sophomore year on a Bollywood dance team at Michigan — and I couldn’t ask for a better co-founder. Grateful to our team, customers, and investors for believing in this vision.

English

677

68.9K

Dylan Huang@dphuang2·6 Şub

@shensi @merge_api @t_patterson @prit4k LMAO

761

Shensi Ding@shensi·6 Şub

It takes years to build a reputation and seconds to destroy it. And last night, we destroyed @merge_api's. Christian McCaffrey and Olivia Culpo sat next to our exec team during our board dinner. Our CRO @t_patterson is a huge football fan, and asked @prit4k to sneak a picture for him. Pritak did.. but he left his flash on. Even worse, this is the picture he took.

English

30.8K

Dylan Huang retweetledi

Dmytro Dzhulgakov@dzhulgakov·27 Oca

🌕 Kimi K2.5 = open SOTA reasoning + vision + 256K context + agentic coding 🏎 200+ t/s on @FireworksAI_HQ (soon even faster) ✅ Nails @simonw's "pelican on a bike" test in both directions Try it now on Fireworks and hats off to @Kimi_Moonshot

English

7.5K

Keşfet

@ethanelasky @soumithchintala @karinanguyen @nvidia @a_levitator @HanchungLee @alvinsng @FactoryAI