Anupam Datta
@datta_cs
Followers
1K
Following
296
Media
10
Statuses
274
AI @SnowflakeDB, Ex- Co-Founder/President/Chief Scientist @truera_ai, Ex-Prof @CarnegieMellon, Visiting Prof & PhD CS @Stanford, BTech @IITKgp
Joined December 2014
What is your Agent's GPA or Goal-Plan-Action alignment? Observing that agent failures arise when their goals, plans, and actions are not aligned, we introduce a framework for evaluating and improving an agent’s GPA or Goal-Plan-Action alignment. Excited to have developed this
When data agents fail, they often fail silently - giving confident-sounding answers that are wrong, and it can be hard to figure out what caused the failure. "Building and Evaluating Data Agents" is a new short course created with @Snowflake and taught by @datta_cs and
0
2
9
Social media tends to frame AI debate into two caricatures: (A) Skeptics who think LLMs are doomed and AI is a bunch of hype. (B) Fanatics who think we have all the ingredients and superintelligence is imminent. But if you read what leading researchers actually say (beyond the
One point I made that didn’t come across: - Scaling the current thing will keep leading to improvements. In particular, it won’t stall. - But something important will continue to be missing.
237
553
4K
We just wrapped up @LangChainAI Interrupt, and here are my 10 key takeaways! 1️⃣ Agents are Here - I'm definitely riding a post-conference high. The energy was electric, and everyone was deeply engaged. The conference showcased real-world agent implementations happening now, such
4
20
92
Excited to announce AlphaEvolve A powerful AI coding agent developed by our team in @GoogleDeepMind that is able to discover impactful new algorithms for important problems in Maths and Computing by combining the creativity of large language models with automated evaluators.
48
337
2K
New course: MCP: Build Rich-Context AI Apps with Anthropic. Learn to build AI apps that access tools, data, and prompts using the Model Context Protocol in this short course, created in partnership with Anthropic @AnthropicAI and taught by Elie Schoppik @eschoppik, its Head of
44
370
2K
Almost a 5x speedup in vLLM🤯 I was able to push a finetuned Mistral Nemo from 110 tokens/s to a peak of 517 tokens/s and acceptance rate of 57.7%. This is with Suffix Decoding from ArcticInference⚡
7
29
251
Exciting news! The PyTorch Foundation’s expansion with vLLM and DeepSpeed is a game-changer for open-source AI. Can’t wait to see the innovations this brings! As a premier member, Snowflake is excited to join the Board and help grow the OSS community. Big things ahead! 🚀
PyTorch Foundation has expanded into an umbrella foundation. @vllm_project and @DeepSpeedAI have been accepted as hosted projects, advancing community-driven AI across the full lifecycle. Supporting quotes provided by the following members: @AMD, @Arm, @AWS, @Google, @Huawei,
0
7
12
🚀 Big news! Our collab w/ Snowflake, UCSD & UMD topped the BIRD leaderboard — beating prior SOTA by 2.8% in Text-to-SQL reasoning! RL was tough, but worth it. 📢 Best model coming soon. #AI #LLM #TextToSQL #ReinforcementLearning #Snowflake #UCSD #UMD #NLP #BIRDLeaderboard
1
9
33
PyTorch Foundation has expanded into an umbrella foundation. @vllm_project and @DeepSpeedAI have been accepted as hosted projects, advancing community-driven AI across the full lifecycle. Supporting quotes provided by the following members: @AMD, @Arm, @AWS, @Google, @Huawei,
8
46
236
Exciting result from Snowflake AI Research on speculative decoding. 4x faster LLM Inference for coding agents like @allhands_ai. Available in open source for you to play with. And take a look at the blog post by @aurickQ for details.
Excited to share our work on Speculative Decoding @Snowflake AI Research! 🚀 4x faster LLM inference for coding agents like OpenHands @allhands_ai 💬 2.4x faster LLM inference for interactive chat 💻 Open-source via Arctic Inference as a plugin for @vllm_project 🧵
0
0
5
Noticing myself adopting a certain rhythm in AI-assisted coding (i.e. code I actually and professionally care about, contrast to vibe code). 1. Stuff everything relevant into context (this can take a while in big projects. If the project is small enough just stuff everything
462
1K
12K
Don’t debug with your eyes closed 👀 The Weaviate Query Agent is here to help you with all of your research tasks. Navigating through any number of collections, deciding whether to query or aggregate, taking the load off your shoulders when it comes to sifting through a maze of
0
7
24
New Anthropic research: AI values in the wild. We want AI models to have well-aligned values. But how do we know what values they’re expressing in real-life conversations? We studied hundreds of thousands of anonymized conversations to find out.
73
289
2K
I’ve noticed that many GenAI application projects put in automated evaluations (evals) of the system’s output probably later — and rely on humans to judge outputs longer — than they should. This is because building evals is viewed as a massive investment (say, creating 100 or
deeplearning.ai
The Batch AI News and Insights: I’ve noticed that many GenAI application projects put in automated evaluations (evals) of the system’s output...
56
176
1K
AI is not a bet—it’s a business imperative. 💰The average return on AI investments is $1.41 for every $1 invested. That number will only go up. I speak with customers every week—most teams have AI use cases they can execute right now. Here’s a look at what’s holding them back,
3
8
24
🚀 We are thrilled to release the code for ReFoRCE — a powerful Text-to-SQL agent with Self-Refinement, Format Restriction, and Column Exploration! 🥇 Ranked #1 on Spider 2.0 Leaderboard, a major step toward practical, enterprise-ready systems, tackled both: Spider 2.0-snow &
2
30
117
Another banger from my group that tbh raises more questions about creating data agents then answers. Here's the core issue: When creating an agent to query structured & unstructured data for business insights, how do you describe these data tools? Let me elaborate 🧵👇
1
2
3
Meta’s Llama 4 Large Language Models are now available in Snowflake Cortex AI! Llama 4 introduces @AIatMeta's first Mixture-of-Expert (MoE) architecture for faster, more efficient inference, helping customers build high-performing enterprise gen AI apps and deliver personalized
3
13
62
How can the most accurate SQL be generated for a given question? We propose a method to significantly boost text-to-SQL accuracy while drastically cutting costs.👇 #NLProc #AI #TextToSQL #LLMs
2
2
55