
dreadnode
@dreadnode
Followers
2K
Following
108
Media
54
Statuses
188
Advancing the state of offensive security.
Joined August 2010
In our latest blog, @shncldwll breaks down the process of creating a fully integrated, self-verifying agentic system that can do modern Windows Active Directory red team operations, without human interaction. Read about our approach to building cyber evals to measure model
0
23
81
Thank you to our #CAMLIS2025 Gold Sponsors! 🙏 🎉 Without you CAMLIS would not be possible! @dreadnode @googlecloud @hiddenlayersec @mindgard @nvidia Not registered yet? Hurry up and claim your seat before they’re gone! https://t.co/MrHLAlk50A
1
4
6
PentestJudge: Judging Agent Behavior Against Operational Requirements - https://t.co/UgM49zhppJ by @dreadnode Introducing PentestJudge, an LLM-as-judge system for evaluating the operations of pentesting agents. The scores are compared to human domain experts as a ground-truth
0
5
11
What's after programmatic verification for offsec? As we deploy these systems, there's a lot about pentesting we'll want to treat as eval metrics or training objectives that are difficult to verify. Judges for non-verifiable tasks present a way forward: are they any good?
Incoming: Dreadnode paper drop from @shncldwll and the crew 🏴☠️ PentestJudge—Judging Agent Behavior Against Operational Requirements: https://t.co/vACC6gRCOi Explore how we built an LLM-as-judge system for evaluating the operations of pentesting agents [inspired by @OpenAI's
2
3
18
Incoming: Dreadnode paper drop from @shncldwll and the crew 🏴☠️ PentestJudge—Judging Agent Behavior Against Operational Requirements: https://t.co/vACC6gRCOi Explore how we built an LLM-as-judge system for evaluating the operations of pentesting agents [inspired by @OpenAI's
2
12
24
Read "Spain’s Huawei Deal Is a Wake-Up Call for U.S. Federal Procurement Reform" in @WarOnTheRocks, written by our very own Head of Policy @velvethamm3r.
0
1
5
✍ After talking AI Action Plan on @CyberScoopNews, wrote up @dreadnode thoughts on implementation ➡️ https://t.co/sfa4YQI3Ve ‼️ While we debate frameworks, adversaries build AI attack capabilities. We need: evaluation ecosystems, red teaming, and procurement standards.
In this episode of Safe Mode, host @gregotto sits down with Daria Bahrami (@velvethamm3r), head of policy at @dreadnode, for an in-depth exploration of the new AI Action Plan and its sweeping implications for critical infrastructure security. https://t.co/nc2eLrqoX3 |
0
2
4
Will be hanging out at the Agentic Summit this Saturday. Happy to meet up and talk agent observability, evals, and deployment for cyber security. https://t.co/CamOH0Hc3Y
rdi.berkeley.edu
The Agentic AI Summit 2025 will bring visionary leaders from academia, pioneering entrepreneurs, experts from leading AI organizations, venture capitalists, and policymakers to explore, discuss, and...
0
2
7
Wrote about evals at Dreadnode. This one is for hackers getting up to speed on agents for their use cases. How do you go from PoC to prod? Don't wait for a lab to build benchmarks that measure what you care about. Do it yourself. Here's how:
In our latest blog, @shncldwll breaks down the process of creating a fully integrated, self-verifying agentic system that can do modern Windows Active Directory red team operations, without human interaction. Read about our approach to building cyber evals to measure model
2
8
28
Tune in to @CyberScoopNews SafeMode podcast for an in-depth exploration of the new AI Action Plan and its sweeping implications for critical infrastructure security—featuring our very own @velvethamm3r!
cyberscoop.com
Inside the AI Action Plan with Dreadnode’s Daria Bahrami
0
2
4
Bringing a limited run of hats to Vegas next week 🔥🏴☠️ See you there?
4
1
33
@Steph3nSims ...and we're live! https://t.co/hhlceRHfe3 Stream on YouTube, X, LinkedIn, etc.
0
1
5
Rise and shine! We're going live on Off By One with @Steph3nSims this afternoon—meet us here at 11 AM PT:
2
3
41
The crew is going LIVE on Friday 7/25—come hang with @monoxgas, @shncldwll, and Ads!
Join me this Friday at 11AM PT on the @offby1security stream with the team from @dreadnode for a session on "Building and Deploying Offensive Security Agents!" https://t.co/mmka45hk5W
1
4
14
We're heading to Vegas August 5-10! Send us a DM if you'd like to meet up onsite. Happy to share our latest offensive agents, AI red team tooling, custom evals, and training capabilities on the Strikes platform. Plus, "shiny rocks"??
0
2
12
👀🫵⬇️
Join me this Friday at 11AM PT on the @offby1security stream with the good folks from @dreadnode for a session on offensive/adversarial AI. Details coming soon!
0
2
17
At #CriticalEffectDC, Daria Bahrami presented her pitch for an AI security roadmap to a panel of Congressional staffers in @beauwoods' Cyber Policy Shark Tank and took home first place. In a blog for @dreadnode, Daria outlines her recommendations and next steps for
0
3
3
The countdown begins. 9 DAYS until the OAIC CFP closes. Submit your proposal by Friday, July 18. https://t.co/ns4h9EwenG
0
3
4
Just presented "AI at the Edge: Advancing the State of Offensive Security" with @bradpalmtree at #HammerCon 2025! Watch here: https://t.co/6GJhTiFJWZ. Thread on how we got here and why this work matters for the cyber community 👇🧵 1/3
1
3
10