Shashank Agarwal @itsshashank X Profile

Shashank Agarwal

@itsshashank

Followers

4K

Following

5K

Media

416

Statuses

10K

Building https://t.co/8yZmgov4SW of the Internet 🚀 Prev: MagicAPI, AWS, Levity, Activeloop, Pipfeed, Expedia, Hopdata. Weekly thoughts at: https://t.co/CTuxg0X4Hh

https://t.co/76X2FQg5qR

Bengaluru, Karnataka, India

Joined September 2008

Don't wanna be here? Send us removal request.

Shashank Agarwal

@itsshashank

2 months

The solution? Trace-based evaluation. Capture everything the agent does. Analyze the complete journey. This is what separates production-ready agents from prototypes. Learn more: https://t.co/duGOJvNFjS What's your experience been with agent evaluation?

noveum.ai

Real-time monitoring, tracing, and analytics for AI agents in production. 73+ evaluation scorers, multi-agent support, cost tracking.

0

1

Shashank Agarwal

@itsshashank

2 months

This is why so many AI agents fail in production. Teams are monitoring like they're monitoring ML models. They're blind to what's actually happening inside the agent. It's like driving a car while only looking at the speedometer.

1

0

1

Shashank Agarwal

@itsshashank

2 months

With ML models, you measure outputs. With agents, you need to measure the entire TRAJECTORY. Every decision point. Every reasoning step. Every tool call. Because an agent can fail at any point in its journey, not just at the end.

1

0

1

Shashank Agarwal

@itsshashank

2 months

When I was at AWS, we learned this the hard way. We built monitoring systems for ML models that worked great. Then we tried to apply the same logic to agents. It failed spectacularly. Why? Because we were measuring the wrong things.

1

0

1

Shashank Agarwal

@itsshashank

2 months

An ML model is straightforward: given input X, predict Y. You can measure accuracy, precision, recall. Done. But an AI agent is different. It's a system that: •Reasons about a problem •Makes decisions •Takes actions •Learns from feedback It's fundamentally more complex.

1

0

1

Shashank Agarwal

@itsshashank

2 months

I've been thinking about how we evaluate AI agents. Most teams treat them like ML models: input → output → score. But that's not how agents work. They're decision-making systems, not prediction systems. This distinction matters more than you think. Let me share what I've

1

0

2

Kalyani

@thisismekalyani

3 months

Why 95% of AI Agents Fail in Production (And How to Fix It) | @itsshashank from Noveum AI

1

3

Pulkit

@_pulkitxm

4 months

Thanks @coderabbitai @aravindputrevu for sending over these cool stuffs!! things like these show how much you value your community and contributors🙌

Pulkit

@_pulkitxm

4 months

My experience with @coderabbitai -> - new pr - adds up >50 comments - pushed the fixes - adds next 15 comments - again push the fixes - add another set of comments (this is repeated at least 3-4 times😭)

3

1

22

Pulkit

@_pulkitxm

4 months

We are hiring for two role -> Python AI/ML Engineer & FullStack Next.js Engineer both are full time roles, completely remote (more details in comments)

60

21

534

API Market

@apimarket_

4 months

💡 “Your first paying customer matters more than 100 free users.” Our CEO, Shashank Agarwal, shares the #1 startup mistake to avoid 🚀 Too many founders chase growth before proving people will actually pay. The real validation? That very first customer who trusts your

0

1

3

API Market

@apimarket_

5 months

Your prompts just grew arms and legs. We turned every API on https://t.co/Ox5qM9iQdv into an MCP tool—so Claude and Cursor can use them directly. Not just for devs. If you’re a PM, founder, analyst, or designer, you can now run real workflows from a chat window. Same 4 steps for

0

2

4

API Market

@apimarket_

4 months

🎭 Next-Gen Figurine Design with Ultra-Fast AI (Nano-Banana) No more waiting for slow renders — bring your figurine concepts to life instantly with Google’s Nano-Banana powered API. 🚀 Why it matters? ✅ Ultra-fast image processing — no delays ✅ High-precision figurine edits &

1

2

Shashank Agarwal

@itsshashank

5 months

To all -> Indian Software Engineers!! Please stop cheating in your Coding Interview!!!

0

API Market

@apimarket_

5 months

⚡ Detect Anything Instantly with Real-Time Object Detection API (YOLOv8s Worldv2) No more missed details in images or videos! From smart surveillance to retail analytics, traffic monitoring, or warehouse automation — this API delivers blazing-fast, high-accuracy object

1

4

Shashank Agarwal

@itsshashank

5 months

AI or Trading, it pays to be good at Math!!

0

1

API Market

@apimarket_

5 months

Bring your images to life with FaceSwap Image V3 API 🎭 Instantly create high-resolution, ultra-realistic face swaps — no Photoshop, no manual work. Just plug into our simple REST API and start transforming images in seconds. Ideal for content creators, marketers, game

1

3

6

Shashank Agarwal

@itsshashank

6 months

🤝 NovaEval is open source, and I need YOUR help! High-priority areas: • 🧪 Unit tests (currently 23% coverage) • 📚 Real-world examples • 📝 Documentation & guides • 🔍 RAG evaluation metrics • 🤖 Agent evaluation frameworks First-time contributors welcome!

1

2

6

Shashank Agarwal

@itsshashank

6 months

Just like how every business had to get a computer, every business will get AI agents.

0

2

Shashank Agarwal

@itsshashank

6 months

https://t.co/CfWiBDo19Y

0

2

Shashank Agarwal

@itsshashank

6 months

🤝 NovaEval is open source, and I need YOUR help! High-priority areas: • 🧪 Unit tests (currently 23% coverage) • 📚 Real-world examples • 📝 Documentation & guides • 🔍 RAG evaluation metrics • 🤖 Agent evaluation frameworks First-time contributors welcome!

1

2

6