Debadeepta Dey @debadeepta X Profile

Debadeepta Dey

@debadeepta

Followers

2K

Following

19K

Media

15

Statuses

1K

Distinguished Researcher, DataRobot | ex MSR, CMU

Kenmore, WA

Joined July 2011

Don't wanna be here? Send us removal request.

Debadeepta Dey

@debadeepta

3 months

1️⃣We are excited to open-source syftr: a powerful tool for automatically finding Pareto-optimal generative AI flows! syftr searches a large search space of agentic and non-agentic flows to surface optimal tradeoffs between accuracy, cost and latency. 🧵

1

7

40

Debadeepta Dey

@debadeepta

1 month

RT @WenSun1: Does RL actually learn positively under random rewards when optimizing Qwen on MATH? Is Qwen really that magical such that eve….

0

14

0

Debadeepta Dey

@debadeepta

3 months

RT @sytelus: A different and interesting work from my ex-colleague Dey: How do you generate Pareto frontier for the agentic workflow? . Man….

0

1

0

Debadeepta Dey

@debadeepta

3 months

RT @roma_glushko: ✨Meet syftr, a new OSS framework to find the best RAG workflows (both agentic and not) balancing cost/latency/accuracy us….

0

3

0

Debadeepta Dey

@debadeepta

3 months

6️⃣Want to get involved?.📖 Technical blog post and full paper (to appear at @automl_conf ). 💻 Try syftr 🙌 Contribute via PRs.

github.com

syftr is an agent optimizer that helps you find the best agentic workflows for your budget. - datarobot/syftr

0

1

Debadeepta Dey

@debadeepta

3 months

5️⃣syftr is made possible thanks to:.Ray for distributed search orchestration. @anyscalecompute .LlamaIndex for building advanced workflows. @llama_index .HuggingFace Datasets for fast dataset interfaces. @huggingface . Starting with question-answering and actively expanding tasks.

1

0

1

Debadeepta Dey

@debadeepta

3 months

4️⃣Why syftr?.Models are part of complex workflows in the real world, and syftr evaluates them within those contexts. syftr complements benchmarks which evaluate intrinsic capabilities of models. We're developing in the open to build trust and share the best combinations.

1

0

Debadeepta Dey

@debadeepta

3 months

3️⃣Here's what syftr does:.🔍 Takes your grounding dataset and question-answer pairs to find the best workflows. 🎯 Uses multi-objective Bayesian Optimization to identify Pareto-optimal solutions. ⚙️ You choose the ideal workflow for your application based on the Pareto-frontier.

1

0

Debadeepta Dey

@debadeepta

3 months

2️⃣Are you struggling with questions like:. 🤔 "Which synthesizing LLM and embedding model to use?".🤖 "Should I adopt the latest agentic flow?".📊 "How do I balance accuracy, cost and latency in my AI workflows?". That's why we created syftr.

1

0

Debadeepta Dey

@debadeepta

10 months

RT @sytelus: Another important development for achieving o1 like test-time compute scaling is Entropix by @_xjdr. Both of these ideas coinc….

0

2

0

Debadeepta Dey

@debadeepta

11 months

We are growing our fundamental AI research team at DataRobot and are looking for strong researchers with proven publication track record in deep learning in general and generative AI in particular. Please apply at: #GenAI #DeepLearning.

0

5

Debadeepta Dey

@debadeepta

1 year

RT @crwhite_ml: 🚨Llama 3.1 405B eval just dropped🚨.🥇 in instruction following.🥈 in reasoning.On par with GPT-4o in math and coding.It’s a g….

0

19

0

Debadeepta Dey

@debadeepta

1 year

RT @crwhite_ml: OpenAI strikes back 💫 GPT-4o-mini is a remarkable model for its price! Check out its performance on .

0

9

0

Debadeepta Dey

@debadeepta

1 year

This is why we need private benchmarks or ones like which change fast to prevent gaming.

Nathan Lambert

@natolambert

1 year

lol gemma instruct is specifically tuned to argmax chatbotarena??? finally someone did it

0

6

Debadeepta Dey

@debadeepta

1 year

RT @chinganc_rl: Super excited to announce our cool project, Trace, on optimizing general AI systems, using LLMs.😎. Trace is a new AutoDiff….

0

26

0

Debadeepta Dey

@debadeepta

1 year

RT @crwhite_ml: Wow! 😮 claude-3.5 is an extremely impressive overall model! It achieves the top score in **every category**, and substantia….

0

126

0

Debadeepta Dey

@debadeepta

1 year

RT @micahgoldblum: 🚨 Announcing LiveBench, a challenging new general-purpose live LLM benchmark! 🚨.Thanks @crwhite_ml and @SpamuelDooley fo….

0

77

0

Debadeepta Dey

@debadeepta

1 year

RT @WenSun1: REBEL is one of the simplest algorithms and implementation out there that can achieve this performance, e.g., no online GPT4 q….

0

4

0

Debadeepta Dey

@debadeepta

1 year

RT @joao_gante: New sampling strategy dropped in 🤗 transformers -- Min P sampling 🔥. Are you tired of having `top_k` arbitrarily discarding….

0

5

0

Debadeepta Dey

@debadeepta

1 year

RT @g_k_swamy: My advisor Drew recently gave a lecture on the past, present (i.e. my work!), and future of imitation learning and how it ap….

0

11

0