Diego Francisco Valenzuela Iturra

7.5K posts

Diego Francisco Valenzuela Iturra

@diegovogeid

Coffee, Music and Deep Learning

Santiago, Chile Katılım Mart 2018

1.5K Takip Edilen165 Takipçiler

Diego Francisco Valenzuela Iturra retweetledi

Garry Tan@garrytan·1d

Today is the deadline to apply for YC Summer 2026. If you're still hesitating: the companies that change the world don't wait until they're 100% ready. Applying is sometimes literally the first step.

English

172

120

1.9K

130.1K

Diego Francisco Valenzuela Iturra retweetledi

Browser Use@browser_use·12h

Supercharge @openclaw with browser-harness With browser-harness, OpenClaw can: > create its own browser tools > use SOTA stealth cloud browsers in parallel Now OpenClaw can do any task on the web Try it now ↓🔗

English

254

19.7K

Diego Francisco Valenzuela Iturra retweetledi

Runway@runwayml·21h

Real-time video agents are here. Today, we’re sharing how we built Runway Characters, allowing you to turn one image into a fully expressive, conversational video agent streaming at 24 frames per second in HD. With just 1.75 seconds of end-to-end latency. Learn more below.

English

801

74.8K

Diego Francisco Valenzuela Iturra retweetledi

Google Research@GoogleResearch·11 Mar

Today we announce results from a first-of-its-kind study with @BIDMC_Medicine on AMIE, our conversational AI for clinical reasoning. In a real-world clinical study, AMIE was found to be safe, feasible, and well-received by patients. Learn more: goo.gle/4sXCogz

English

395

63K

Diego Francisco Valenzuela Iturra retweetledi

Google Research@GoogleResearch·12 Ağu

Today, Google Research & @GoogleDeepMind introduce g-AMIE, an extension of our diagnostic AI system based on #Gemini 2.0 Flash. It uses a guardrail that prohibits medical advice sharing & instead provides a summary for a physician to review. Learn more: goo.gle/45shHyT

English

556

75.1K

Diego Francisco Valenzuela Iturra@diegovogeid·10h

@HeyGen dev

Diego Francisco Valenzuela Iturra retweetledi

HeyGen@HeyGen·22h

Video gets better when people share, build together, and learn together hyperframes.dev is live. Browse community projects, download any zip, hand it to your agent or publish yours $ npx hyperframes publish Publish then RT + comment "dev" for credits (must follow)

English

255

200

629

476.8K

Diego Francisco Valenzuela Iturra retweetledi

Jared Friedman@snowmaker·18h

Little known fact: many of YC's best companies applied on a whim, a few hours before the deadline.

Y Combinator@ycombinator

Today's the deadline to apply for YC Summer 2026. ycombinator.com/apply

English

640

114.9K

Diego Francisco Valenzuela Iturra retweetledi

Microsoft AI Frontiers@ms_aifrontiers·13h

Most web agents drive a browser one click at a time. We tried something different and it worked better than we expected. Webwright, a new project from our team, gives the model a terminal instead of a click loop. The agent writes Playwright code, spawns browser sessions on demand, and ends with a reusable script rather than a transient session. The results: SOTA on long horizon web benchmark Odysseys (60.8%, a 16-point jump over the previous best) and 86.7% on Online-Mind2Web with GPT-5.4 — the highest of any open-source AutoEval recipe we know of. All from a minimal harness that's roughly 1K lines of code with no multi-agent orchestration. The broader bet: as models get better at code, the right harness gets smaller, not larger. Great work by @Adamlu28 @Xu_Lingrui_ @huang_chao4969 @ahmed @AhmedHAwadallah You can check it out: microsoft.github.io/Webwright/

English

Diego Francisco Valenzuela Iturra retweetledi

elvis@omarsar0·23h

Autodata (from Meta) is an agentic data scientist that builds high-quality training and evaluation data autonomously. Great work on the autoharness track. (bookmark it)

DAIR.AI@dair_ai

Banger paper from Meta FAIR. They introduce Autodata, an agentic data scientist that builds high-quality training and evaluation data autonomously. The headline result: on a CS research QA task, an Agentic Self-Instruct loop produces a 34-point gap between weak and strong solvers (43.7% vs 77.8%), while standard CoT Self-Instruct on the same setup produces a 1.9-point gap (71.4% vs 73.3%). The agent generates questions that actually discriminate between models. The method: An orchestrator LLM directs a challenger agent to generate examples grounded in domain documents. A weak and a strong solver attempt them, a judge scores the outputs, and the orchestrator analyzes the failures and prompts the challenger to regenerate from new angles until quality thresholds are met. The system also meta-optimizes itself. An outer loop tunes the agent's instructions based on which harness changes lift validation pass rate. Over 126 accepted iterations, validation pass rate climbed from 12.8% to 42.4%. They processed 10,000+ CS papers and produced 2,117 quality-filtered QA pairs. Existing self-instruct pipelines do not control data quality. Autodata reframes data generation as an agent loop, spend more inference compute and the data gets harder, which gives downstream RL a real lift. Blog: facebookresearch.github.io/RAM/blogs/auto… Learn to build effective AI agents in our academy: academy.dair.ai

English

116

21.9K

Diego Francisco Valenzuela Iturra retweetledi

Lauren Reeder@laurenmhreeder·21h

Our friend @bcherny created Claude Code and told me he hasn't written a line of code himself in 2026. His team is living in the future at @AnthropicAI. We talked about why coding is effectively solved, how loops are changing the way we work, and why the printing press is the right analogy for what's coming to software. Hint: it’s going to be a massive value creation opportunity. 00:00 Introduction 00:55 Claude Code Crowd Check 02:39 Origin Story of Claude Code 03:35 From Typeahead to Agents 05:07 Is Coding Solved 06:50 Boris Personal Workflow 08:51 Future Teams and Generalists 10:26 SaaS Apocalypse Predictions 12:57 Audience Q&A Deep Dive 23:35 Closing and What’s Next

English

161

22.9K

Diego Francisco Valenzuela Iturra retweetledi

ollama@ollama·14h

🤯 Ollama now supports Claude Desktop via Claude’s built-in third party inference. ollama launch claude-desktop This allows all models from Ollama's Cloud to be used across Claude Cowork and Claude Code from the Claude Desktop app.

English

116

370

3.3K

264.4K

Diego Francisco Valenzuela Iturra retweetledi

Santi Torres@SantiTorAI·1d

🚨 Karpathy acaba de soltar 40 minutos de oro sobre agentes IA en 2026. Qué aprender, qué construir y qué tirar antes de que te hunda. El 90% de las herramientas actuales no van a sobrevivir 90 días. El filtro ya está hecho. Gratis.

Español

626

40.3K

Diego Francisco Valenzuela Iturra retweetledi

Logan Kilpatrick@OfficialLoganK·15h

We just shipped Webhooks in the Gemini API :) This is a big step towards making the DevX for long running tasks (batch, agents, GenMedia, etc) way better.

Google AI Studio@GoogleAIStudio

x.com/i/article/2051…

English

95.1K

Diego Francisco Valenzuela Iturra retweetledi

ClaudeDevs@ClaudeDevs·18h

Managing API keys is one of the top security concerns we hear from customers. Today we’re introducing keyless auth for Claude Platform: authenticate via browser with the CLI, or let workloads use their existing cloud identity (AWS, GCP, Azure, or any OIDC token provider).

English

140

447

4.4K

493.8K

Diego Francisco Valenzuela Iturra retweetledi

Google Research@GoogleResearch·5d

Since introducing Empirical Research Assistance last fall, Google Research scientists have been using it to address real-world applications in epidemiology, cosmology, atmospheric monitoring, and neuroscience, providing a hint of AI’s transformational capabilities to accelerate scientific discoveries. Learn more →goo.gle/3OVeZ0K

English

487

20.7K

Diego Francisco Valenzuela Iturra retweetledi

Google DeepMind@GoogleDeepMind·4d

AI co-clinician is our new research initiative to help explore how multimodal agents could better support healthcare workers and patients. 🩺 Here’s a snapshot of our progress 🧵

English

222

1.2K

337.5K

Diego Francisco Valenzuela Iturra retweetledi

Pushmeet Kohli@pushmeet·4d

I am happy to introduce AI co-clinician, @GoogleDeepMind's research initiative to explore how AI could better amplify doctor's expertise and help deliver higher quality care to patients. We’re excited about our early results, and are taking a phased approach to our research explorations with academic and research collaborators. Read more in our blog: deepmind.google/blog/ai-co-cli…

English

602

37.7K

Diego Francisco Valenzuela Iturra retweetledi