Archie Chaudhury @ArchChaudhury X Profile

Archie Chaudhury

@ArchChaudhury

Followers

254

Following

715

Media

14

Statuses

1K

Building @layerlens_ai Interested in evals, benchmarking, or testing? Get in touch: [email protected]

Atlanta, GA (for now)

Joined March 2022

Don't wanna be here? Send us removal request.

Archie Chaudhury

@ArchChaudhury

7 days

RT @jrdothoughts: I really enjoyed writing this one :) The Sequence Engineering #676: Hacking with Gemini CLI

0

1

0

Archie Chaudhury

@ArchChaudhury

15 days

RT @layerlens_ai: You asked us to take a closer look at @xai’s Grok 3 – and here’s what we found 👀. It dominates enterprise tasks like data….

0

1

0

Archie Chaudhury

@ArchChaudhury

20 days

RT @jrdothoughts: Some of my initial observations about @eigenlayer Eigen Cloud: The Verifiable Cloud: Some Notes About EigenCloud https://….

0

6

0

Archie Chaudhury

@ArchChaudhury

21 days

RT @layerlens_ai: 🚨 New blog drop: In Focus: DarkBench | Issue 3.We dig into behavioral safety evals and why benchmarks like DarkBench are….

0

1

0

Archie Chaudhury

@ArchChaudhury

28 days

RT @layerlens_ai: We’ve just launched our official YouTube channel 🎥. Catch product demos, walkthroughs, and practical tips for getting the….

0

1

0

Archie Chaudhury

@ArchChaudhury

28 days

RT @layerlens_ai: We have onboarded @MistralAI's Magistral Medium Models: both the normal and the thinking variant. Early results put it….

0

2

0

Archie Chaudhury

@ArchChaudhury

29 days

This is a week old, but figured I will share here. Highlighting some safety work I had the pleasure of being a part of recently. We used steering vectors to influence the behavior of open source models, making their responses more honest. Read more here:

0

Archie Chaudhury

@ArchChaudhury

29 days

RT @jrdothoughts: Some thoughts about the myths and realities of web3-AI and a big of a wake up call. @PluralisHQ , @NousResearch , @PrimeI….

0

1

0

Archie Chaudhury

@ArchChaudhury

29 days

RT @layerlens_ai: Small shoutout in @CoinDesk today 👀. LayerLens was briefly mentioned among teams working on foundational challenges in We….

0

1

0

Archie Chaudhury

@ArchChaudhury

1 month

Going to be in SF this week. Who should I meet?. Reply @here.

0

Archie Chaudhury

@ArchChaudhury

1 month

RT @layerlens_ai:

0

2

0

Archie Chaudhury

@ArchChaudhury

1 month

RT @layerlens_ai: Our independent analysis on @NVIDIAAI's #Nemotron models is now live. Use our dashboard to get full comparisons of how….

0

1

0

Archie Chaudhury

@ArchChaudhury

1 month

RT @layerlens_ai: How well does DeepSeek-R1-0528 actually understand accounting?. We put it to the test on our Accounting Audit benchmark.….

0

1

0

Archie Chaudhury

@ArchChaudhury

1 month

RT @layerlens_ai:

0

2

0

Archie Chaudhury

@ArchChaudhury

2 months

RT @layerlens_ai: 🚨 Don’t miss it!.Join us for “The Good, the Bad & the Surprising: Lessons from AI Evaluation” – our first ever webinar o….

0

1

0

Archie Chaudhury

@ArchChaudhury

2 months

RT @layerlens_ai: 🚀 Big news: @layerlens_ai is partnering with @Conformiq_inc to bring robust, real-world benchmarking to enterprise AI tes….

0

11

0

Archie Chaudhury

@ArchChaudhury

2 months

RT @layerlens_ai: @GoogleAI released a new checkpoint for @GeminiApp 2.5 Flash at #GoogleIO yesterday. Some results from the LayerLens Atl….

0

1

0

Archie Chaudhury

@ArchChaudhury

2 months

RT @layerlens_ai: What really happens when you put today’s top AI models to the test?. Join us for a live webinar where we reveal the most….

0

3

0

Archie Chaudhury

@ArchChaudhury

2 months

Our goal is to transform how we think about benchmarks and evals for models, agents, and more. A great article by my co-founder and president @jrdothoughts on our vision:.

0

1

3

Archie Chaudhury

@ArchChaudhury

2 months

Consistent, independent benchmarks are probably one of the most poignant problems in the space. Atlas not only lets you view benchmarking results, but also gives you full transparency, along with an entire suite of analytics. Try it out here:

LayerLens

@layerlens_ai

2 months

📢 It’s here. The Atlas Leaderboard is now live — your new source of truth for LLM evaluation. Benchmark top models like ChatGPT, Claude & Gemini with real-world data, live updates, and powerful insights. 👉 #AI #LLM #Benchmarking #AtlasLeaderboard

0

1

2