Machine Learning Expedition

752 posts

Machine Learning Expedition

@MLexpAI

Deep Learning, Computer Vision, Machine Learning

San Francisco, CA Se unió Eylül 2021

415 Siguiendo321 Seguidores

Machine Learning Expedition@MLexpAI·17 May

Agents communicating through latent states instead of natural language arxiv.org/abs/2604.25917

English

Machine Learning Expedition@MLexpAI·9 Oca

Paper: arxiv.org/pdf/2601.03192

English

Machine Learning Expedition@MLexpAI·9 Oca

Explicitly separates stable LLM reasoning from a plastic, evolving memory, effectively addressing the stability-plasticity dilemma to allow for continuous runtime improvement and avoid catastrophic forgetting.

English

Machine Learning Expedition@MLexpAI·9 Oca

MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY 1. Introduces a two-phase retrieval system that first filters memory candidates by semantic relevance and then selects the final ones based on learned utility (Q-values), moving beyond purely semantic matching.

English

Machine Learning Expedition@MLexpAI·26 Ara

#science-mathematics" target="_blank" rel="nofollow noopener">blog.google/technology/ai/…

ZXX

Machine Learning Expedition@MLexpAI·16 Ara

3. Modular and Multi-Path-Aware Offline Benchmarking for Mobile GUI Agents arxiv.org/pdf/2512.12634

English

Machine Learning Expedition@MLexpAI·16 Ara

2. WEBOPERATOR: ACTION-AWARE TREE SEARCH FOR AUTONOMOUS AGENTS IN WEB ENVIRONMENT arxiv.org/pdf/2512.12692

English

Machine Learning Expedition@MLexpAI·16 Ara

Recent LLM Agent Papers 1. MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations

English

Machine Learning Expedition@MLexpAI·14 Ara

Paper URL: arxiv.org/pdf/2511.21689

English

Machine Learning Expedition@MLexpAI·14 Ara

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

English

Machine Learning Expedition@MLexpAI·12 Ara

Paper Link: arxiv.org/pdf/2512.08296

English

Machine Learning Expedition@MLexpAI·12 Ara

• Capability Saturation Once a single agent achieves ~45% task performance baseline, additional agents often degrade results because coordination overhead dominates.

English

Machine Learning Expedition@MLexpAI·12 Ara

Towards a Science of Scaling Agent Systems- This paper aims to move agentic AI systems (LLM-powered systems that reason, plan, and act) from heuristic practice toward a principled, quantitative science—especially for scaling those systems effectively

English

Machine Learning Expedition@MLexpAI·3 Ara

This paper got best paper award from Neurips 2025

English

Machine Learning Expedition@MLexpAI·3 Ara

Authors introduce infinity-chat dataset with 26k real world open-ended questions. The also provide human annotations including absolute rating and pairwise preferences. arxiv.org/abs/2510.22954

English

Descubrir

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry