DailyPapers

5.2K posts

DailyPapers

@HuggingPapers

Tweeting interesting papers submitted at https://t.co/rXX8x0HzXV. Submit your own at https://t.co/QhbJKXBd4Q, and link models/datasets/demos to it!

Anywhere Katılım Mart 2025

4 Takip Edilen17.1K Takipçiler

DailyPapers@HuggingPapers·56s

Paper: huggingface.co/papers/2605.00… Project page: finch.agibot.com/research/lwd LWD uses DIVL and QAM to learn from successes, failures, and human interventions across the fleet, continuously improving a single generalist policy without imitating only demonstrations.

English

DailyPapers@HuggingPapers·59s

Learning while Deploying: Fleet-Scale RL for Generalist Robot Policies A new framework that turns robot deployment into a continuous training loop, enabling 16 dual-arm robots to improve from real-world experience and achieve 95% success on long-horizon tasks like brewing tea and making cocktails.

English

DailyPapers@HuggingPapers·3h

Paper: huggingface.co/papers/2605.00… Model: huggingface.co/houyuanchen/Un… Code: github.com/houyuanchen111… Project: houyuanchen111.github.io/UniVidX.github…

Català

481

DailyPapers@HuggingPapers·3h

UniVidX: A Unified Multimodal Framework for Versatile Video Generation Enables omni-directional generation across RGB, intrinsic maps, and alpha channels using diffusion priors with stochastic condition masking—trained on fewer than 1,000 videos for SIGGRAPH 2026.

English

2.2K

DailyPapers@HuggingPapers·12h

Discuss: huggingface.co/papers/2604.15… Hallucinations arise from semantic interference during fine-tuning. Self-distillation mitigates this by regularizing output distributions.

English

962

DailyPapers@HuggingPapers·12h

Fine-tuning increases hallucinations New research shows SFT causes factual errors by interfering with pre-trained knowledge. The authors propose self-distillation to learn new facts without forgetting, plus selective parameter freezing to reduce hallucinations while preserving performance.

English

3.6K

DailyPapers@HuggingPapers·16h

Paper: huggingface.co/papers/2604.27…

English

656

DailyPapers@HuggingPapers·16h

Edit-R1: Reasoning verifier-based RL for image editing Moves beyond simple scorers to chain-of-thought verifiers that break instructions into verifiable principles. Trains editing models via GRPO with fine-grained rewards, outperforming Seed-1.5-VL and scaling up to 7B parameters.

English

1.7K

DailyPapers@HuggingPapers·17h

Perfect for AI City Challenge 2026 Track 3 and video anomaly reasoning research. huggingface.co/datasets/nvidi… huggingface.co/papers/2505.19…

English

593

DailyPapers@HuggingPapers·17h

NVIDIA just released AETC on Hugging Face 44k multi-task video annotations with chain-of-thought reasoning for traffic anomaly detection.

English

1.5K

DailyPapers@HuggingPapers·18h

Explore them here: hf.co/papers/week/20…

English

748

DailyPapers@HuggingPapers·18h

Recursive Multi-Agent Systems, Agentic World Modeling, and AI Organizations: Top Papers of the Week - Recursive Multi-Agent Systems: A new framework scaling agent collaboration through recursive latent-space computation (242 upvotes) - Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond - A comprehensive taxonomy for AI environment modeling (219 upvotes) - Heterogeneous Scientific Foundation Model Collaboration (Eywa): Bridging language models with scientific domain foundation models (192 upvotes) - From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company - The OneManCompany framework (116 upvotes) - World-R1: Reinforcing 3D Constraints for Text-to-Video Generation (115 upvotes) - GLM-5V-Turbo by Zhipu AI: Toward native foundation models for multimodal agents (90 upvotes)

English

11.4K

DailyPapers@HuggingPapers·20h

13 frontier models evaluated: Claude Opus 4.6 leads at 66.7%, GPT-5.4 at 63.8%, Gemini 3.1 Pro at 53.3%. Gap is clear—workspace repair is near-ceiling but HR, finance, and multi-system orchestration remain unsolved. Paper: huggingface.co/papers/2604.28… Leaderboard: claw-eval-live.github.io

English

DailyPapers@HuggingPapers·20h

Claw-Eval-Live A live benchmark for workflow agents that refreshes quarterly from real marketplace signals. 105 tasks across CRM, HR, finance, and workspace repair show even the best models struggle—Claude Opus 4.6 hits just 66.7% pass rate, with HR and management workflows failing most.

English

1.4K

DailyPapers@HuggingPapers·1d

Paper: huggingface.co/papers/2604.28… Explore the interactive graph: intern-atlas.opendatalab.org.cn

English

713

DailyPapers@HuggingPapers·1d

Intern-Atlas traces 60 years of AI method evolution Built from 1 million papers into a graph with 9 million causal edges, mapping how techniques emerge, relate, and advance across machine learning history.

English

2.1K

DailyPapers@HuggingPapers·1d

Paper: huggingface.co/papers/2604.27… Code: github.com/ITcarrot/Round… Docs: itcarrot.github.io/RoundPipe/

Català

944

DailyPapers@HuggingPapers·1d

RoundPipe Full fine-tune 32B models or LoRA fine-tune 235B models on a single 24GB GPU with 64K+ context length. Achieves 1.5-2.2× speedups over SOTA baselines by dynamically dispatching stages in a round-robin manner for near-zero pipeline bubbles.

English

100

6.4K

DailyPapers@HuggingPapers·1d

huggingface.co/allenai/G_pre_… Paper: allenai.org/papers/olmpool

English

779

DailyPapers@HuggingPapers·1d

Allen AI just released OlmPool architectural variants on Hugging Face 7-8B parameter models exploring how minor architectural choices impact long context extension.

English

1.8K

Keşfet

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry