Diego
29 posts

Diego
@diegocaples
Founder @ Markov Robotics






About 10% of America's life expectancy shortfall is due to motor vehicle accidents—fixed by Waymo and Tesla. About 18% is overdoses—that's fading now. And the lion's share of the rest is obesity-related. America will soon be buying its way out of its poor life expectancy issue.





New Short Course: Building AI Browser Agents! Learn how to build AI agents that interact and take actions on websites in this course, created in partnership with @agi_inc and taught by @divgarg and @namangarg0, Co-founders of AGI Inc. AI browser agents can log into websites, fill out forms, click through web pages, or even place orders online for you. They use both visual information, like screenshots, and structural data, like the HTML or Document Object Model (DOM) of a web page, to reason and take action. With the complexity of webpages and multiple possible actions at each step, it can be challenging for an AI browser agent to complete an assigned task. Because these agents run long action sequences, a single error—like clicking the wrong button or misreading a field—can lead to unexpected outcomes or errors that compound over time. In this course, you'll understand how autonomous web agents work, their current limitations, and how AgentQ enables them to improve through self-correction. In detail, you'll: - Learn what web agents are, how they automate tasks online, their architecture, key components, limitations, and an overview of their decision-making strategies. - Build a web agent that can scrape DeepLearning.AI's website and return course recommendations in a structured output format. - Build an autonomous web agent that can execute multiple tasks, such as finding and summarizing webpages, filling out a form, and signing up for a newsletter. - Explore AgentQ, a framework that enables agents to self-correct by combining Monte Carlo Tree Search (MCTS), a self-critique mechanism for continuous improvement, and Direct Preference Optimization (DPO). - Deep dive into MCTS, learn how it finds an effective path, illustrated by an example of Gridworld animation, and use AgentQ to complete web tasks. - Understand AI agents' current state and future directions—including key factors shaping their evolution, such as hardware, algorithm innovation, and data availability. By the end of this course, you will have hands-on experience building browser agents and a deeper understanding of how to make them more robust and reliable. Please sign up here: deeplearning.ai/short-courses/…


AI agents that can browse the web, fill out forms, and even place online orders are no longer just research demos—they’re being built today. But real-world websites are complex. Layouts change. Popups appear. And one wrong click can cascade into booking the wrong flight or buying the wrong product. In our new course, Building AI Browser Agents, made in collaboration with @agi_inc, you’ll learn how to build web agents and how to make them more reliable using AgentQ, a framework that helps agents self-correct. Guided by instructors @divgarg and @namangarg0, you’ll build agents step-by-step: from scraping and summarizing, to signing up for newsletters, to navigating the open web and choosing optimal actions. 👉 Learn for free: hubs.la/Q03hDWK10

1/ AutoMCP 🥇 1st Place ToolMaster RL - Training open-source LLMs to excel with MCPs through reinforcement learning. This project creates an environment where models learn tool usage through trial and error rather than prompt engineering. "Reinforcement Learning is All You Need" for transforming mediocre open-source models into tool-using experts that rival closed-source alternatives. Diego Caples, @diegocaples Thomas Joshi, @thomastjoshi Meghna Natraj, @NatrajMeghna Xiangyi Li, @xdotli
















