Diego Francisco Valenzuela Iturra

7.5K posts

Diego Francisco Valenzuela Iturra banner
Diego Francisco Valenzuela Iturra

Diego Francisco Valenzuela Iturra

@diegovogeid

Coffee, Music and Deep Learning

Santiago, Chile Katılım Mart 2018
1.5K Takip Edilen165 Takipçiler
Diego Francisco Valenzuela Iturra retweetledi
Garry Tan
Garry Tan@garrytan·
Today is the deadline to apply for YC Summer 2026. If you're still hesitating: the companies that change the world don't wait until they're 100% ready. Applying is sometimes literally the first step.
English
172
120
1.9K
130.1K
Diego Francisco Valenzuela Iturra retweetledi
Browser Use
Browser Use@browser_use·
Supercharge @openclaw with browser-harness With browser-harness, OpenClaw can: > create its own browser tools > use SOTA stealth cloud browsers in parallel Now OpenClaw can do any task on the web Try it now ↓🔗
English
14
20
254
19.7K
Diego Francisco Valenzuela Iturra retweetledi
Runway
Runway@runwayml·
Real-time video agents are here. Today, we’re sharing how we built Runway Characters, allowing you to turn one image into a fully expressive, conversational video agent streaming at 24 frames per second in HD. With just 1.75 seconds of end-to-end latency. Learn more below.
English
57
97
801
74.8K
Diego Francisco Valenzuela Iturra retweetledi
Google Research
Google Research@GoogleResearch·
Today we announce results from a first-of-its-kind study with @BIDMC_Medicine on AMIE, our conversational AI for clinical reasoning. In a real-world clinical study, AMIE was found to be safe, feasible, and well-received by patients. Learn more: goo.gle/4sXCogz
English
9
96
395
63K
Diego Francisco Valenzuela Iturra retweetledi
Google Research
Google Research@GoogleResearch·
Today, Google Research & @GoogleDeepMind introduce g-AMIE, an extension of our diagnostic AI system based on #Gemini 2.0 Flash. It uses a guardrail that prohibits medical advice sharing & instead provides a summary for a physician to review. Learn more: goo.gle/45shHyT
Google Research tweet media
English
21
95
556
75.1K
Diego Francisco Valenzuela Iturra retweetledi
HeyGen
HeyGen@HeyGen·
Video gets better when people share, build together, and learn together hyperframes.dev is live. Browse community projects, download any zip, hand it to your agent or publish yours $ npx hyperframes publish Publish then RT + comment "dev" for credits (must follow)
English
255
200
629
476.8K
Diego Francisco Valenzuela Iturra retweetledi
Microsoft AI Frontiers
Microsoft AI Frontiers@ms_aifrontiers·
Most web agents drive a browser one click at a time. We tried something different and it worked better than we expected. Webwright, a new project from our team, gives the model a terminal instead of a click loop. The agent writes Playwright code, spawns browser sessions on demand, and ends with a reusable script rather than a transient session. The results: SOTA on long horizon web benchmark Odysseys (60.8%, a 16-point jump over the previous best) and 86.7% on Online-Mind2Web with GPT-5.4 — the highest of any open-source AutoEval recipe we know of. All from a minimal harness that's roughly 1K lines of code with no multi-agent orchestration. The broader bet: as models get better at code, the right harness gets smaller, not larger. Great work by @Adamlu28 @Xu_Lingrui_ @huang_chao4969 @ahmed @AhmedHAwadallah You can check it out: microsoft.github.io/Webwright/
English
9
15
59
6K
Diego Francisco Valenzuela Iturra retweetledi
elvis
elvis@omarsar0·
Autodata (from Meta) is an agentic data scientist that builds high-quality training and evaluation data autonomously. Great work on the autoharness track. (bookmark it)
DAIR.AI@dair_ai

Banger paper from Meta FAIR. They introduce Autodata, an agentic data scientist that builds high-quality training and evaluation data autonomously. The headline result: on a CS research QA task, an Agentic Self-Instruct loop produces a 34-point gap between weak and strong solvers (43.7% vs 77.8%), while standard CoT Self-Instruct on the same setup produces a 1.9-point gap (71.4% vs 73.3%). The agent generates questions that actually discriminate between models. The method: An orchestrator LLM directs a challenger agent to generate examples grounded in domain documents. A weak and a strong solver attempt them, a judge scores the outputs, and the orchestrator analyzes the failures and prompts the challenger to regenerate from new angles until quality thresholds are met. The system also meta-optimizes itself. An outer loop tunes the agent's instructions based on which harness changes lift validation pass rate. Over 126 accepted iterations, validation pass rate climbed from 12.8% to 42.4%. They processed 10,000+ CS papers and produced 2,117 quality-filtered QA pairs. Existing self-instruct pipelines do not control data quality. Autodata reframes data generation as an agent loop, spend more inference compute and the data gets harder, which gives downstream RL a real lift. Blog: facebookresearch.github.io/RAM/blogs/auto… Learn to build effective AI agents in our academy: academy.dair.ai

English
4
11
116
21.9K
Diego Francisco Valenzuela Iturra retweetledi
Lauren Reeder
Lauren Reeder@laurenmhreeder·
Our friend @bcherny created Claude Code and told me he hasn't written a line of code himself in 2026. His team is living in the future at @AnthropicAI. We talked about why coding is effectively solved, how loops are changing the way we work, and why the printing press is the right analogy for what's coming to software. Hint: it’s going to be a massive value creation opportunity. 00:00 Introduction 00:55 Claude Code Crowd Check 02:39 Origin Story of Claude Code 03:35 From Typeahead to Agents 05:07 Is Coding Solved 06:50 Boris Personal Workflow 08:51 Future Teams and Generalists 10:26 SaaS Apocalypse Predictions 12:57 Audience Q&A Deep Dive 23:35 Closing and What’s Next
English
6
23
161
22.9K
Diego Francisco Valenzuela Iturra retweetledi
ollama
ollama@ollama·
🤯 Ollama now supports Claude Desktop via Claude’s built-in third party inference. ollama launch claude-desktop This allows all models from Ollama's Cloud to be used across Claude Cowork and Claude Code from the Claude Desktop app.
ollama tweet media
English
116
370
3.3K
264.4K
Diego Francisco Valenzuela Iturra retweetledi
Santi Torres
Santi Torres@SantiTorAI·
🚨 Karpathy acaba de soltar 40 minutos de oro sobre agentes IA en 2026. Qué aprender, qué construir y qué tirar antes de que te hunda. El 90% de las herramientas actuales no van a sobrevivir 90 días. El filtro ya está hecho. Gratis.
Español
12
87
626
40.3K
Diego Francisco Valenzuela Iturra retweetledi
ClaudeDevs
ClaudeDevs@ClaudeDevs·
Managing API keys is one of the top security concerns we hear from customers. Today we’re introducing keyless auth for Claude Platform: authenticate via browser with the CLI, or let workloads use their existing cloud identity (AWS, GCP, Azure, or any OIDC token provider).
ClaudeDevs tweet media
English
140
447
4.4K
493.8K
Diego Francisco Valenzuela Iturra retweetledi
Google Research
Google Research@GoogleResearch·
Since introducing Empirical Research Assistance last fall, Google Research scientists have been using it to address real-world applications in epidemiology, cosmology, atmospheric monitoring, and neuroscience, providing a hint of AI’s transformational capabilities to accelerate scientific discoveries. Learn more →goo.gle/3OVeZ0K
Google Research tweet media
English
10
73
487
20.7K
Diego Francisco Valenzuela Iturra retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
AI co-clinician is our new research initiative to help explore how multimodal agents could better support healthcare workers and patients. 🩺 Here’s a snapshot of our progress 🧵
English
78
222
1.2K
337.5K
Diego Francisco Valenzuela Iturra retweetledi
Pushmeet Kohli
Pushmeet Kohli@pushmeet·
I am happy to introduce AI co-clinician, @GoogleDeepMind's research initiative to explore how AI could better amplify doctor's expertise and help deliver higher quality care to patients. We’re excited about our early results, and are taking a phased approach to our research explorations with academic and research collaborators. Read more in our blog: deepmind.google/blog/ai-co-cli…
English
21
98
602
37.7K
Diego Francisco Valenzuela Iturra retweetledi
Gabriel Chua
Gabriel Chua@gabrielchua·
You can also use Codex to create presentations in Google Slide without opening your browser
Gabriel Chua tweet media
English
5
5
84
8.9K