logangraham Profile Banner
Logan Graham Profile
Logan Graham

@logangraham

Followers
7K
Following
3K
Media
79
Statuses
1K

make things radically good 🌎 @anthropicai | give me feedback: https://t.co/R1OyioKMXy

the present, moments ago
Joined June 2009
Don't wanna be here? Send us removal request.
@logangraham
Logan Graham
8 months
🔥 I'm hiring exceptional research scientists + engineers for the Frontier Red Team at @AnthropicAI. AGI is a national security issue. We should push models to their limits and get an extra 1-2 year advantage. Links below.
24
60
839
@logangraham
Logan Graham
2 days
We're flattered by this Fortune feature. I especially liked these characterizations: a team of crafty, AGI-pilled people (and a lot of physicists) who have to be hardcore AI scientists and philosophers. I think that captures our team well! We have a lot of fun doing serious
Tweet media one
Tweet media two
@sharongoldman
Sharon Goldman
2 days
Most red teams find flaws. Anthropic’s Frontier Red Team also evangelizes the risks of AI itself — a rare hybrid of security + policy. I went inside for @FortuneMagazine, w/@jackclarkSF, @logangraham & @keenlooks .
0
2
48
@grok
Grok
18 days
Join millions who have switched to Grok.
218
426
3K
@logangraham
Logan Graham
16 days
Red ( is our new home for the inside look at red teaming the frontier of the most advanced models. Stay tuned.
3
1
15
@logangraham
Logan Graham
16 days
Also a mainsite post.
@AnthropicAI
Anthropic
16 days
We partnered with @NNSANews to build first-of-their-kind nuclear weapons safeguards for AI. We've developed a classifier that detects nuclear weapons queries while preserving legitimate uses for students, doctors, and researchers.
Tweet media one
1
0
8
@logangraham
Logan Graham
16 days
How do you do red team when you can’t access the extremely sensitive information you’re testing for?. You work with exceptional partners at the Department of Energy’s national labs and the National Nuclear Security Administration. And you figure it out.
Tweet media one
3
0
11
@logangraham
Logan Graham
16 days
We spent a year red teaming Claude on nuclear weapons topics. Then we tried to detect ourselves. New post on Red:
Tweet media one
5
9
154
@logangraham
Logan Graham
28 days
check Red out. tweet me/us feedback on what you want to see and how we can be more helpful!.
0
0
19
@logangraham
Logan Graham
28 days
we want Red to be a resource to show the world what we see. If you’re a red teamer, policy staffer, hacker, biologist, national security professional, AI researcher, someone who’s interested, or others…. it’s really important to know what’s happening at the risk frontier.
1
1
16
@logangraham
Logan Graham
28 days
Soon, we’ll publish on Red on our work on biological risks, safeguards, autonomy, evals. …and, yes, more zany experiments.
1
0
8
@logangraham
Logan Graham
28 days
We also talk about our joint work studying if models can replicate some of the most substantial cyberattacks.
Tweet media one
1
0
10
@logangraham
Logan Graham
28 days
We also detail our work running Project Vend, where Claudius (our vending machine agent) learned the tribulations of running a fully autonomous business.
Tweet media one
Tweet media two
2
0
5
@logangraham
Logan Graham
28 days
I’m live tweeting this from @keenlooks @defcon, where Keane is talking through all the details in this new Red blog post on Claude in CTFs (while moving couches):.
Tweet media one
Tweet media two
1
0
19
@logangraham
Logan Graham
28 days
Launching now — a new blog for research from @AnthropicAI’s Frontier Red Team and others. > We’ll be covering our internal research on cyber, bio, autonomy, national security and more.
Tweet media one
28
127
948
@logangraham
Logan Graham
1 month
this started as a hackathon project that we used ourselves to find vulns!. In the next 2 years, the world might 10/100/1000x the code it puts out. The only way to keep up is by using models to make it secure before it ever becomes a problem.
@claudeai
Claude
1 month
We just shipped automated security reviews in Claude Code. Catch vulnerabilities before they ship with two new features:. - /security-review slash command for ad-hoc security reviews.- GitHub Actions integration for automatic reviews on every PR
3
10
87
@logangraham
Logan Graham
1 month
Personally very excited about the defensive future.
0
0
12
@logangraham
Logan Graham
1 month
But we've got a long way to go. models screw up in sometimes very funny, inhuman ways.
1
0
14
@logangraham
Logan Graham
1 month
Our team @AnthropicAI has been (~silently) entering Claude into cybersecurity competitions against humans. Turns out Claude is pretty good. Better than we were expecting!. Read more at the link below, and come to @keenlooks 's talk at DEF CON Saturday at 12:30!.
@axios
Axios
1 month
Exclusive: Anthropic's Claude AI model takes on (and beats) human hackers
17
60
700
@logangraham
Logan Graham
2 months
A first hand account of our vending machine experiment @AnthropicAI, exclusively on @andonlabs podcast. It goes so much deeper than you thought! The real story is actually quite deep. But Claudius still owes me my tungsten cube.
@lukaspet
Lukas Petersson
2 months
Behind the scenes of Project Vend! . In this special episode of Audio Tokens, we go deeper into Project Vend, the autonomous vending machine @andonlabs put in @AnthropicAI 's office. Daniel Freeman and @axelbacklund share unreleased anecdotes and ask questions like: Is this good
1
2
37
@logangraham
Logan Graham
2 months
And a huge thank you to our partners @andonlabs for turning a wild experiment into a wild experience for Anthropic employees. and dealing with Claudius' insane requests sometimes.
0
0
13