Rishabh Singh @rishabhs X Profile

Rishabh Singh

@rishabhs

Followers

992

Following

101

Media

5

Statuses

65

Research Lead @Databricks. Previously @Meta GenAI, Google Brain @GoogleAI, @MSFTResearch, @MIT_CSAIL @IITKgp

https://t.co/555o7qnX2T

Mountain View, CA

Joined June 2009

Don't wanna be here? Send us removal request.

Rishabh Singh

@rishabhs

5 years

Very excited about formula prediction being released in Google Sheets! A great collaboration between Google Sheets and Brain team.

9

77

513

Matei Zaharia

@matei_zaharia

19 days

I’m super excited about the launch of Genie Code! It extends the power of AI coding to agentic data work, answering questions 2-3x more accurately than coding agents and automatically engineering and monitoring high quality pipelines. It’s transformed my own work at Databricks!

Databricks

@databricks

19 days

Today we're announcing Genie Code, your autonomous AI partner for data. Genie Code is a state-of-the-art agent that lets data teams move from prompting a copilot to delegating real work: building pipelines, machine learning models, debugging failures, and shipping dashboards.

11

15

153

Rishabh Singh

@rishabhs

18 days

Super excited about the release of Genie Code, Databricks' state of the art Data agent. The AI Research team has been collaborating closely to push the boundaries of the agent’s performance, with lot more coming soon!

Databricks

@databricks

19 days

Today we're announcing Genie Code, your autonomous AI partner for data. Genie Code is a state-of-the-art agent that lets data teams move from prompting a copilot to delegating real work: building pipelines, machine learning models, debugging failures, and shipping dashboards.

2

6

35

Jonathan Frankle

@jefrankle

25 days

Meet KARL, an RL'd model for document-centric tasks at frontier quality and open source cost/speed. Great for @databricks customers and scientists (77-page tech report!) As usual, this isn't just one model - it's an RL assembly line to churn out models for us and our customers 🧵

9

46

241

Databricks

@databricks

3 months

Reliable enterprise agents require system-level reasoning when retrieving across heterogeneous knowledge sources. Traditional RAG often fails to consistently follow instructions, schemas, and constraints end to end. That’s why we’re presenting Instructed Retriever, a new

4

13

71

Jonathan Frankle

@jefrankle

3 months

I'm hiring interns for next summer at @databricks! Specifically on (1) empirical RL at scale on non-verifiable tasks and (2) enabling real people specify the behaviors they want out of AI (e.g., through evals) on highly complex tasks. 🧵

17

47

527

Databricks

@databricks

6 months

Big news: Databricks and @OpenAI are partnering to deliver powerful AI to the enterprise. OpenAI frontier models will now be available natively in Databricks. This means you can build, evaluate and scale production-grade AI apps and agents on your governed enterprise data,

6

38

192

Matei Zaharia

@matei_zaharia

6 months

Prompt optimization is becoming a powerful technique for improving AI that can even beat SFT! Here are some of our research results with GEPA at Databricks, in difficult Agent Bricks info extraction tasks. We can match the best models at 90x lower cost, or improve them by ~6%.

30

125

880

Ivan Zhou

@ivanzhouyq

6 months

Automated prompt optimization (GEPA) can push open-source models beyond frontier performance on enterprise tasks — at a fraction of the cost! 🔑 Key results from our research @DbrxMosaicAI: 1⃣ gpt-oss-120b + GEPA beats Claude Opus 4.1 on Information Extraction (+2.2 points) —

11

67

533

Databricks

@databricks

7 months

The future of data science is autonomous, collaborative, and faster than ever. That's why we're excited to introduce the Data Science Agent for Databricks Assistant, an autonomous partner that plans, executes, and self-corrects entire workflows in your Notebooks and SQL Editor.

0

8

35

Ali Ghodsi

@alighodsi

7 months

Databricks just signed a Series K term sheet at >$100B valuation to scale two flagship products: 🔥 Lakebase — serverless Postgres with true compute/storage separation 🧠 Agent Bricks — agentic framework with built-in reasoning guardrails for enterprise data

wsj.com

The data analytics company, fueled by the AI boom, was valued at $62 billion less than a year ago.

60

106

1K

Matei Zaharia

@matei_zaharia

8 months

Try out GEPA! Excited to see how it does on people's problems.

Lakshya A Agrawal

@LakshyAAAgrawal

8 months

Very excited to share that GEPA is now live on @DSPyOSS as dspy.GEPA! This is an early code release. We’re looking forward to community feedback, especially about any practical challenges in switching optimizers.

1

10

65

Michael Bendersky

@bemikelive

8 months

Since joining @databricks, our research team has been hard at work on Agent Bricks, a new product that helps enterprises develop state-of-the-art domain-specific agents. We are now releasing a research blog about Agent Learning from Human Feedback (ALHF)

databricks.com

Discover how Agent Learning from Human Feedback (ALHF) powers Databricks Agent Bricks, enabling AI agents like Knowledge Assistant to adapt to expert expectations with minimal natural language...

2

19

101

Jonathan Frankle

@jefrankle

8 months

More details in the blog: https://t.co/ZqmcWh6l2K This work was led by @DipendraMisra with contributions from many others. If you're interested in taking this for a spin yourself, sign up here:

docs.google.com

This form allows you to register for the current TAO and RLVR private preview at Databricks.

1

3

16

Jonathan Frankle

@jefrankle

8 months

RLVR isn't just for math and coding! At @databricks, it's impacting products and users across domains. One example: SQL Q&A. We hit the top of the BIRD single-model single-generation leaderboard with our standard TAO+RLVR recipe - the one rolling out in our Agent Bricks product.

3

15

108

Michael Bendersky

@bemikelive

9 months

This is a good opportunity to announce that I recently joined the research team at @databricks where I will be working alongside @jefrankle @rishabhs @matei_zaharia Erich Elsen, and many others on the hardest problems at the intersection of information retrieval and AI.

Jonathan Frankle

@jefrankle

9 months

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

2

6

39

Jonathan Frankle

@jefrankle

9 months

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

9

24

224

Matei Zaharia

@matei_zaharia

9 months

We're finding that what's needed in RL for enterprise tasks is pretty different than in foundation model training on math, code, etc. Catch @jefrankle and our team at ICML to talk about these problems!

Jonathan Frankle

@jefrankle

9 months

Properties of our problems: * Semi-verifiability. Can LLM judges productively augment RLVR? How clean must they be? * Intermediate rewards. Signals we can exploit to make harder tasks tractable. * Real traces. Tons of human traces for imitation learning or environment building.

1

5

45

Rishabh Singh

@rishabhs

9 months

I'm super excited to share that I recently joined the @databricks AI research team to help with AI for data science efforts. We are working on real-world AGI to help customers succeed on the Databricks platform. We are hiring, please join us in this exciting mission!

Jonathan Frankle

@jefrankle

9 months

I'm at ICML 🇨🇦 and I'm hiring at @databricks. Visit our booth if you're interested. My scientific focus: It's 1972 in AI, there's an AI crisis, Dijkstra isn't here to save us, and maybe RL can. Why Databricks? The long road to AGI is being paved here and we have the real evals 🧵

3

1

39

Yujia Li

@liyuajia

4 years

Excited to share the project #AlphaCode I’ve been working on for more than 2 years! Can’t believe we started before COVID is a thing and worked through this project mostly at home, with an amazing team!

Google DeepMind

@GoogleDeepMind

4 years

Introducing #AlphaCode: a system that can compete at average human level in competitive coding competitions like @codeforces. An exciting leap in AI problem-solving capabilities, combining many advances in machine learning! Read more: https://t.co/yaXfMWtEfe 1/

21

95

1K

🇺🇦 Alex Polozov

@Skiminok

4 years

Hey, ML/PL enthusiasts! Looking for some "light" reading for the holiday break? FnT just published our survey on "Neurosymbolic Programming", written jointly with @swarat, Kevin Ellis, @rishabhs, Armando Solar-Lezama, and @yisongyue. https://t.co/bP4di3sJpP

5

45

273