Jason Toevs
1.9K posts

Jason Toevs
@JasonToevs
CTO at UP 🤝 prev: built AI for Adobe, NBC & PGA Tour → | 🏆 2 Exits, 44 Fails 🪦 | Ethical AI + Farm Roots 🌾 | Helping leaders align tech with values.







A new AI lab built from the ground up around automated research. Led by the legend Jerry Tworek, former VP of Research at OpenAI, where he led the development of reasoning models. Absolutely incredible founding team. Heavy hitters out of OAI, Anthropic, DeepMind. Core idea: scaling models, data, and static deployment won't get us to the promised land. We need something different: new learning algos, new architectures, and systems that automate the process of building itself. As they say, they're "pursuing new learning algorithms that supersede large-scale pretraining and reinforcement learning, and architectures that scale better than transformers." An AI to build AI.

Today, we’re open-sourcing the draft specification for DESIGN.md, so it can be used across any tool or platform. We’re also adding new capabilities. DESIGN.md lets you easily export and import your design rules from project to project. Instead of guessing intent, agents know exactly what a color is for and can even validate their choices against WCAG accessibility rules. Watch David East break down this shared visual language in action👇. New capabilities and links in 🧵

SpaceXAI and @cursor_ai are now working closely together to create the world’s best coding and knowledge work AI. The combination of Cursor’s leading product and distribution to expert software engineers with SpaceX’s million H100 equivalent Colossus training supercomputer will allow us to build the world’s most useful models. Cursor has also given SpaceX the right to acquire Cursor later this year for $60 billion or pay $10 billion for our work together.








Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000+ tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). 🔹Motion-rich frontend - Videos in hero sections, WebGL shaders, GSAP + Framer Motion, Three.js 3D. 🔹Agent Swarms, elevated - 300 parallel sub-agents × 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100+ files. 🔹Proactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. 🔹Claw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blog/kimi-k2-6 🔗 Weights & code: huggingface.co/moonshotai/Kim…

Here's my update to the broader community about the ongoing incident investigation. I want to give you the rundown of the situation directly. A Vercel employee got compromised via the breach of an AI platform customer called Context.ai that he was using. The details are being fully investigated. Through a series of maneuvers that escalated from our colleague’s compromised Vercel Google Workspace account, the attacker got further access to Vercel environments. Vercel stores all customer environment variables fully encrypted at rest. We have numerous defense-in-depth mechanisms to protect core systems and customer data. We do have a capability however to designate environment variables as “non-sensitive”. Unfortunately, the attacker got further access through their enumeration. We believe the attacking group to be highly sophisticated and, I strongly suspect, significantly accelerated by AI. They moved with surprising velocity and in-depth understanding of Vercel. At the moment, we believe the number of customers with security impact to be quite limited. We’ve reached out with utmost priority to the ones we have concerns about. All of our focus right now is on investigation, communication to customers, enhancement of security measures, and sanitization of our environments. We’ve deployed extensive protection measures and monitoring. We’ve analyzed our supply chain, ensuring Next.js, Turbopack, and our many open source projects remain safe for our community. The recommendation for all Vercel customers is to follow the Security Bulletin closely (vercel.com/kb/bulletin/ve…). My advice to everyone is to follow the best practices of security response: secret rotation, monitoring access to your Vercel environments and linked services, and ensuring the proper use of the sensitive env variables feature. In response to this, and to aid in the improvement of all of our customers’ security postures, we’ve already rolled out new capabilities in the dashboard, including an overview page of environment variables, and a better user interface for sensitive env var creation and management. As always, I’m totally open to your feedback. We’re working with elite cybersecurity firms, industry peers, and law enforcement. We’ve reached out to Context to assist in understanding the full scale of the incident, in an effort to protect other organizations and the broader internet. I also want to thank the Google Mandiant team for their active engagement and assistance. It’s my mission to turn this attack into the most formidable security response imaginable. It’s always been a top priority for me. Vercel employs some of the most dedicated security researchers and security-minded engineers in the world. I commit to keeping you updated and rolling out extensive improvements and defenses so you, our customers and community, can have the peace of mind that Vercel always has your back.

There’s $1T up for grabs for agent-first startups and this window is WIDE open. Probably 10,000+ niches. How it plays out: 1. Every SaaS company follows salesforce and goes headless within 18 months 2. a new category of "agent-native" startups emerges that treat salesforce, HubSpot, workday etc as dumb backends. the startup IS the agent. the SaaS is just the database. 3. the entire consulting/services industry around enterprise SaaS gets compressed into software. the agent replaces the implementation team. 4. outcome-based pricing becomes default. nobody pays per seat when the "seat" is an agent making 10,000 API calls a minute. you pay when revenue hits your account. 5. the winning founders are ex-operators who understand a vertical workflow cold. the code is the easy part. knowing that a property manager spends 14 hours a week on lease renewals? that's the insight worth $100M. 6. distribution becomes the moat. when anyone can wire agents to APIs, the company with the audience and the brand wins. media + agents is the new SaaS. There’s a rush to incubate live/short form shows. 7. Silicon Valley goes all influencer. Roy lee gets this. Pat Walls gets this. Sam Parr gets this. 8. the first $1B agent-native company in each vertical will look nothing like the SaaS it replaced. smaller team, higher margins, no implementation cost, no churn from bad UX because there is no UX. the fastest path to wealth right now: find an industry that still runs on dashboards, phone calls, and spreadsheets. build the agent-native version. charge per outcome. own the workflow end-to-end. someone reading this right now is going to build a $100M company off this exact shift. tell me about it on the @startupideaspod when you do. Im rooting for you. Less reading, less bookmarking, more building. the last wave rewarded people who built pretty interfaces on top of ugly data. I think this wave rewards people who build smart agents on top of exposed APIs. Or who just build the APIs themselves Here we go
