Rishabh Singh
@rishabhs
Followers
992
Following
101
Media
5
Statuses
65
Research Lead @Databricks. Previously @Meta GenAI, Google Brain @GoogleAI, @MSFTResearch, @MIT_CSAIL @IITKgp
Mountain View, CA
Joined June 2009
Very excited about formula prediction being released in Google Sheets! A great collaboration between Google Sheets and Brain team.
9
77
513
I’m super excited about the launch of Genie Code! It extends the power of AI coding to agentic data work, answering questions 2-3x more accurately than coding agents and automatically engineering and monitoring high quality pipelines. It’s transformed my own work at Databricks!
Today we're announcing Genie Code, your autonomous AI partner for data. Genie Code is a state-of-the-art agent that lets data teams move from prompting a copilot to delegating real work: building pipelines, machine learning models, debugging failures, and shipping dashboards.
11
15
153
Super excited about the release of Genie Code, Databricks' state of the art Data agent. The AI Research team has been collaborating closely to push the boundaries of the agent’s performance, with lot more coming soon!
Today we're announcing Genie Code, your autonomous AI partner for data. Genie Code is a state-of-the-art agent that lets data teams move from prompting a copilot to delegating real work: building pipelines, machine learning models, debugging failures, and shipping dashboards.
2
6
35
Meet KARL, an RL'd model for document-centric tasks at frontier quality and open source cost/speed. Great for @databricks customers and scientists (77-page tech report!) As usual, this isn't just one model - it's an RL assembly line to churn out models for us and our customers 🧵
9
46
241
Reliable enterprise agents require system-level reasoning when retrieving across heterogeneous knowledge sources. Traditional RAG often fails to consistently follow instructions, schemas, and constraints end to end. That’s why we’re presenting Instructed Retriever, a new
4
13
71
I'm hiring interns for next summer at @databricks! Specifically on (1) empirical RL at scale on non-verifiable tasks and (2) enabling real people specify the behaviors they want out of AI (e.g., through evals) on highly complex tasks. 🧵
17
47
527
Big news: Databricks and @OpenAI are partnering to deliver powerful AI to the enterprise. OpenAI frontier models will now be available natively in Databricks. This means you can build, evaluate and scale production-grade AI apps and agents on your governed enterprise data,
6
38
192
Prompt optimization is becoming a powerful technique for improving AI that can even beat SFT! Here are some of our research results with GEPA at Databricks, in difficult Agent Bricks info extraction tasks. We can match the best models at 90x lower cost, or improve them by ~6%.
30
125
880
Automated prompt optimization (GEPA) can push open-source models beyond frontier performance on enterprise tasks — at a fraction of the cost! 🔑 Key results from our research @DbrxMosaicAI: 1⃣ gpt-oss-120b + GEPA beats Claude Opus 4.1 on Information Extraction (+2.2 points) —
11
67
533
The future of data science is autonomous, collaborative, and faster than ever. That's why we're excited to introduce the Data Science Agent for Databricks Assistant, an autonomous partner that plans, executes, and self-corrects entire workflows in your Notebooks and SQL Editor.
0
8
35
Databricks just signed a Series K term sheet at >$100B valuation to scale two flagship products: 🔥 Lakebase — serverless Postgres with true compute/storage separation 🧠 Agent Bricks — agentic framework with built-in reasoning guardrails for enterprise data
wsj.com
The data analytics company, fueled by the AI boom, was valued at $62 billion less than a year ago.
60
106
1K
Try out GEPA! Excited to see how it does on people's problems.
Very excited to share that GEPA is now live on @DSPyOSS as dspy.GEPA! This is an early code release. We’re looking forward to community feedback, especially about any practical challenges in switching optimizers.
1
10
65
Since joining @databricks, our research team has been hard at work on Agent Bricks, a new product that helps enterprises develop state-of-the-art domain-specific agents. We are now releasing a research blog about Agent Learning from Human Feedback (ALHF)
databricks.com
Discover how Agent Learning from Human Feedback (ALHF) powers Databricks Agent Bricks, enabling AI agents like Knowledge Assistant to adapt to expert expectations with minimal natural language...
2
19
101
More details in the blog: https://t.co/ZqmcWh6l2K This work was led by @DipendraMisra with contributions from many others. If you're interested in taking this for a spin yourself, sign up here:
docs.google.com
This form allows you to register for the current TAO and RLVR private preview at Databricks.
1
3
16
RLVR isn't just for math and coding! At @databricks, it's impacting products and users across domains. One example: SQL Q&A. We hit the top of the BIRD single-model single-generation leaderboard with our standard TAO+RLVR recipe - the one rolling out in our Agent Bricks product.
3
15
108
This is a good opportunity to announce that I recently joined the research team at @databricks where I will be working alongside @jefrankle @rishabhs @matei_zaharia Erich Elsen, and many others on the hardest problems at the intersection of information retrieval and AI.
I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵
2
6
39
I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵
9
24
224
We're finding that what's needed in RL for enterprise tasks is pretty different than in foundation model training on math, code, etc. Catch @jefrankle and our team at ICML to talk about these problems!
Properties of our problems: * Semi-verifiability. Can LLM judges productively augment RLVR? How clean must they be? * Intermediate rewards. Signals we can exploit to make harder tasks tractable. * Real traces. Tons of human traces for imitation learning or environment building.
1
5
45
I'm super excited to share that I recently joined the @databricks AI research team to help with AI for data science efforts. We are working on real-world AGI to help customers succeed on the Databricks platform. We are hiring, please join us in this exciting mission!
I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵
3
1
39
Excited to share the project #AlphaCode I’ve been working on for more than 2 years! Can’t believe we started before COVID is a thing and worked through this project mostly at home, with an amazing team!
Introducing #AlphaCode: a system that can compete at average human level in competitive coding competitions like @codeforces. An exciting leap in AI problem-solving capabilities, combining many advances in machine learning! Read more: https://t.co/yaXfMWtEfe 1/
21
95
1K
Hey, ML/PL enthusiasts! Looking for some "light" reading for the holiday break? FnT just published our survey on "Neurosymbolic Programming", written jointly with @swarat, Kevin Ellis, @rishabhs, Armando Solar-Lezama, and @yisongyue. https://t.co/bP4di3sJpP
5
45
273