Sumit

10K posts

Sumit banner
Sumit

Sumit

@_reachsumit

Senior ML Engineer @Meta | prev: @TikTok_us, @Amazon, @Samsung | UChicago Alum https://t.co/hcCJ2n979W 🇮🇳→🇰🇷→🇦🇺→🇨🇦→🇺🇲

Seattle, WA Katılım Nisan 2010
504 Takip Edilen4.1K Takipçiler
Sabitlenmiş Tweet
Sumit
Sumit@_reachsumit·
In the final post of the Adaptive RAG series, we explore how to treat selective retrieval as a core, learned skill, moving from passive observation to active, intelligent decision-making. blog.reachsumit.com/posts/2025/10/…
English
1
1
17
7.2K
Sumit
Sumit@_reachsumit·
Reinforced Preference Optimization for Reasoning-Augmented Recommendations Kuaishou integrates LLM chain-of-thought reasoning into recommendations via a dedicated retrieval head, using reinforcement learning to align reasoning quality. 📝 arxiv.org/abs/2605.21967
English
1
2
3
472
Sumit
Sumit@_reachsumit·
PEARL: Unbiased Percentile Estimation via Contrastive Learning for Industrial-Scale Livestream Recommendation TikTok presents a debiasing framework for recommendations that replaces raw engagement regression with contrastive percentile estimation. 📝 arxiv.org/abs/2605.21752
English
0
1
4
402
Sumit
Sumit@_reachsumit·
FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation TikTok presents a livestreaming recommender that replaces item IDs with multimodal semantic codes generated by a cross-domain encoder. 📝 arxiv.org/abs/2605.21832
English
0
0
6
484
Sumit
Sumit@_reachsumit·
Bridging the Cold-Start Gap: LLM-Powered Synthetic Data Generation for Natural Language Search at Airbnb Airbnb combines contrastive listing pairs with seed queries to generate synthetic training data and relevance labels for natural language search. 📝 arxiv.org/abs/2605.21812
English
1
0
7
443
Sumit
Sumit@_reachsumit·
LLM Retrieval for Stable and Predictable Ad Recommendations Meta presents an LLM-based semantic candidate generation framework for ads retrieval that extracts hierarchical ad attributes and uses graph traversal to improve delivery performance. 📝 arxiv.org/abs/2605.21969
English
1
0
3
265
Sumit
Sumit@_reachsumit·
Integrating Chain-of-Thought into Generative Retrieval: A Preliminary Study Interleaves chain-of-thought reasoning with document identifier generation in a single end-to-end pass for complex multi-hop retrieval. 📝 arxiv.org/abs/2605.22358
English
0
0
10
379
Sumit
Sumit@_reachsumit·
Search-E1: Self-Distillation Drives Self-Evolution in Search-Augmented Reasoning Proposes a self-evolution pipeline for search-augmented reasoning agents that alternates standard GRPO training with offline self-distillation. 📝 arxiv.org/abs/2605.22511
English
0
0
19
778
Sumit
Sumit@_reachsumit·
One prompt is not enough: Instruction Sensitivity Undermines Embedding Model Evaluation Shows that single-prompt evaluation of instruction-tuned embedding models is unreliable. 📝 arxiv.org/abs/2605.22544 👨🏽‍💻 github.com/centre-for-hum…
English
0
6
14
743
Sumit
Sumit@_reachsumit·
DIVE: Embedding Compression via Self-Limiting Gradient Updates @realDonZhao introduces a compression adapter that combines a hinge-based triplet loss (zero gradient once margin is satisfied) with a head-wise NT-Xent contrastive loss. 📝 arxiv.org/abs/2605.20689
English
0
1
15
972
Sumit
Sumit@_reachsumit·
Layer-wise Token Compression for Efficient Document Reranking @ShengyaoZhuang et al. at Amazon propose applying token compression at intermediate transformer layers for cross-encoder rerankers, preserving early query-document interactions. 📝 arxiv.org/abs/2605.20683
English
0
0
9
535
Sumit
Sumit@_reachsumit·
Trust or Abstain? A Self-Aware RAG Approach Introduces a self-aware belief estimator that uses frozen LLM hidden states to independently assess parametric and contextual knowledge reliability. 📝 arxiv.org/abs/2605.18792 👨🏽‍💻 github.com/xizhu1022/SABER
English
0
0
7
440
Sumit
Sumit@_reachsumit·
ClusterRAG: Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation Clusters users via HDBSCAN and retrieves documents from both the target user's profile and similar users' profiles. 📝 arxiv.org/abs/2605.18769 👨🏽‍💻 github.com/academicprojec…
English
0
3
26
1.2K
Sumit
Sumit@_reachsumit·
LWGR: Lagrangian-Constrained Personalized World Knowledge for Generative Recommendation Alibaba transfers personalized world knowledge from LLMs into generative recommendation, using soft instructions and controllable knowledge fusion. 📝 arxiv.org/abs/2605.18771
English
0
0
8
608
Sumit
Sumit@_reachsumit·
Mask-to-Correct+: Leveraging Retriever Diversity for Masking-guided Faithful Fact Correction Introduces a training-free RAG framework that uses diversity-aware masking to identify erroneous spans in claims. 📝 arxiv.org/abs/2605.18776 👨🏽‍💻 github.com/payelsantra/Ma…
English
0
0
8
421
Sumit
Sumit@_reachsumit·
The 99% Success Paradox: When Near-Perfect Retrieval Equals Random Selection Meta introduces a chance-corrected selectivity metric that reveals when high retrieval success rates actually perform no better than random selection. 📝 arxiv.org/abs/2605.18857
English
0
1
2
848
Sumit
Sumit@_reachsumit·
Understanding Wacky Weights: A Dissection of SPLADE's Learned Term Importance Formally defines "wacky weights" in SPLADE, finding that larger vocabularies increase their prevalence, stricter sparsity regularizers reduce it. 📝 arxiv.org/abs/2605.19628 👨🏽‍💻 github.com/polgrisha/unde…
English
0
2
24
978
Sumit
Sumit@_reachsumit·
Divergence Meets Consensus: A Multi-Source Negative Sampling Framework for Sequential Recommendation Introduces a "Teacher-Peer-Self" negative sampling framework inspired by Vygotsky's Zone of Proximal Development. 📝 arxiv.org/abs/2605.19651 👨🏽‍💻 github.com/Lyz103/SIGIR26…
English
0
2
10
442