Rishabh Singh

65 posts

Rishabh Singh

Rishabh Singh

@rishabhs

Research Lead @Databricks. Previously @Meta GenAI, Google Brain @GoogleAI, @MSFTResearch, @MIT_CSAIL @IITKgp

Mountain View, CA เข้าร่วม Haziran 2009
97 กำลังติดตาม994 ผู้ติดตาม
ทวีตที่ปักหมุด
Rishabh Singh
Rishabh Singh@rishabhs·
Very excited about formula prediction being released in Google Sheets! A great collaboration between Google Sheets and Brain team.
GIF
English
9
77
514
0
Rishabh Singh รีทวีตแล้ว
Rishabh Singh รีทวีตแล้ว
Jonathan Frankle
Jonathan Frankle@jefrankle·
Meet KARL, an RL'd model for document-centric tasks at frontier quality and open source cost/speed. Great for @databricks customers and scientists (77-page tech report!) As usual, this isn't just one model - it's an RL assembly line to churn out models for us and our customers 🧵
Jonathan Frankle tweet mediaJonathan Frankle tweet media
English
9
46
241
67.7K
Rishabh Singh รีทวีตแล้ว
Databricks
Databricks@databricks·
Reliable enterprise agents require system-level reasoning when retrieving across heterogeneous knowledge sources. Traditional RAG often fails to consistently follow instructions, schemas, and constraints end to end. That’s why we’re presenting Instructed Retriever, a new retrieval architecture that propagates complete system specifications through every stage of the search pipeline. The approach delivers: - 35–50% gains in retrieval recall on instruction-following benchmarks - 70% improvements in end-to-end answer quality over simplistic RAG, and ~15% over reranking-based approaches - Strong instruction adherence with small, efficient models suitable for real-world deployment Together, these results show how system-wide instruction awareness translates directly into more accurate and efficient enterprise agents. databricks.com/blog/instructe…
Databricks tweet media
English
4
13
71
11.5K
Rishabh Singh รีทวีตแล้ว
Jonathan Frankle
Jonathan Frankle@jefrankle·
I'm hiring interns for next summer at @databricks! Specifically on (1) empirical RL at scale on non-verifiable tasks and (2) enabling real people specify the behaviors they want out of AI (e.g., through evals) on highly complex tasks. 🧵
English
17
47
527
92.2K
Rishabh Singh รีทวีตแล้ว
Databricks
Databricks@databricks·
Big news: Databricks and @OpenAI are partnering to deliver powerful AI to the enterprise. OpenAI frontier models will now be available natively in Databricks. This means you can build, evaluate and scale production-grade AI apps and agents on your governed enterprise data, leveraging the latest OpenAI models like GPT-5. We’re excited to expand our relationship with OpenAI; Databricks was one of the first to host gpt-oss open models, they use Databricks products and now we’re offering OpenAI models natively on Databricks: databricks.com/blog/run-opena…
Databricks tweet media
English
6
39
193
23.4K
Rishabh Singh รีทวีตแล้ว
Matei Zaharia
Matei Zaharia@matei_zaharia·
Prompt optimization is becoming a powerful technique for improving AI that can even beat SFT! Here are some of our research results with GEPA at Databricks, in difficult Agent Bricks info extraction tasks. We can match the best models at 90x lower cost, or improve them by ~6%.
Matei Zaharia tweet media
English
30
126
882
107.4K
Rishabh Singh รีทวีตแล้ว
Ivan Zhou
Ivan Zhou@ivanzhouyq·
Automated prompt optimization (GEPA) can push open-source models beyond frontier performance on enterprise tasks — at a fraction of the cost! 🔑 Key results from our research @DbrxMosaicAI: 1⃣ gpt-oss-120b + GEPA beats Claude Opus 4.1 on Information Extraction (+2.2 points) — while being 90× cheaper to serve. 2⃣ The same technique also lifts frontier models (Claude Sonnet 4, Opus 4.1), pushing them to new SOTA benchmarks. 3⃣Versus Supervised Fine-Tuning (SFT): GEPA delivers equal or better performance at 20% lower serving cost. Even better → GEPA + SFT together gives the highest gains. 4⃣Lifetime cost analysis shows GEPA + gpt-oss is orders of magnitude cheaper overall. At scale, the one-time optimization overhead fades away — making optimized agents highly practical for real-world deployment. #dspy #gepa #promptoptimization #airesearch
English
11
68
533
85.5K
Rishabh Singh รีทวีตแล้ว
Databricks
Databricks@databricks·
The future of data science is autonomous, collaborative, and faster than ever. That's why we're excited to introduce the Data Science Agent for Databricks Assistant, an autonomous partner that plans, executes, and self-corrects entire workflows in your Notebooks and SQL Editor. Get: -End-to-end lifecycle support — from EDA to feature engineering, model training, and evaluation -Autonomous multi-step execution with full transparency and control -Deep Unity Catalog integration for governed, production-ready results -Native to Databricks Notebooks and SQL Editor for a seamless experience databricks.com/blog/introduci…
Databricks tweet media
English
0
8
35
6.4K
Rishabh Singh รีทวีตแล้ว
Ali Ghodsi
Ali Ghodsi@alighodsi·
Databricks just signed a Series K term sheet at >$100B valuation to scale two flagship products: 🔥 Lakebase — serverless Postgres with true compute/storage separation 🧠 Agent Bricks — agentic framework with built-in reasoning guardrails for enterprise data wsj.com/tech/ai/databr…
English
60
106
1.1K
219.5K
Rishabh Singh รีทวีตแล้ว
Michael Bendersky
Michael Bendersky@bemikelive·
Since joining @databricks, our research team has been hard at work on Agent Bricks, a new product that helps enterprises develop state-of-the-art domain-specific agents. We are now releasing a research blog about Agent Learning from Human Feedback (ALHF) databricks.com/blog/agent-lea…
English
2
20
101
9.8K
Rishabh Singh รีทวีตแล้ว
Jonathan Frankle
Jonathan Frankle@jefrankle·
RLVR isn't just for math and coding! At @databricks, it's impacting products and users across domains. One example: SQL Q&A. We hit the top of the BIRD single-model single-generation leaderboard with our standard TAO+RLVR recipe - the one rolling out in our Agent Bricks product.
Jonathan Frankle tweet media
English
3
15
108
23.1K
Rishabh Singh รีทวีตแล้ว
Michael Bendersky
Michael Bendersky@bemikelive·
This is a good opportunity to announce that I recently joined the research team at @databricks where I will be working alongside @jefrankle @rishabhs @matei_zaharia Erich Elsen, and many others on the hardest problems at the intersection of information retrieval and AI.
Jonathan Frankle@jefrankle

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

English
2
7
39
6.8K
Rishabh Singh รีทวีตแล้ว
Jonathan Frankle
Jonathan Frankle@jefrankle·
I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵
English
9
24
224
40.8K
Rishabh Singh รีทวีตแล้ว
Matei Zaharia
Matei Zaharia@matei_zaharia·
We're finding that what's needed in RL for enterprise tasks is pretty different than in foundation model training on math, code, etc. Catch @jefrankle and our team at ICML to talk about these problems!
Jonathan Frankle@jefrankle

Properties of our problems: * Semi-verifiability. Can LLM judges productively augment RLVR? How clean must they be? * Intermediate rewards. Signals we can exploit to make harder tasks tractable. * Real traces. Tons of human traces for imitation learning or environment building.

English
1
5
45
7.6K
Rishabh Singh
Rishabh Singh@rishabhs·
I'm super excited to share that I recently joined the @databricks AI research team to help with AI for data science efforts. We are working on real-world AGI to help customers succeed on the Databricks platform. We are hiring, please join us in this exciting mission!
Jonathan Frankle@jefrankle

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

English
3
1
39
2.9K
Rishabh Singh รีทวีตแล้ว
Yujia Li
Yujia Li@liyuajia·
Excited to share the project #AlphaCode I’ve been working on for more than 2 years! Can’t believe we started before COVID is a thing and worked through this project mostly at home, with an amazing team!
Google DeepMind@GoogleDeepMind

Introducing #AlphaCode: a system that can compete at average human level in competitive coding competitions like @codeforces. An exciting leap in AI problem-solving capabilities, combining many advances in machine learning! Read more: dpmd.ai/Alpha-Code 1/

English
21
95
1.1K
0