logangraham Profile Banner
Logan Graham Profile
Logan Graham

@logangraham

Followers
6K
Following
2K
Media
70
Statuses
1K

make things radically good 🌎 @anthropicai

the present, moments ago
Joined June 2009
Don't wanna be here? Send us removal request.
@logangraham
Logan Graham
6 months
🔥 I'm hiring exceptional research scientists + engineers for the Frontier Red Team at @AnthropicAI. AGI is a national security issue. We should push models to their limits and get an extra 1-2 year advantage. Links below.
24
60
838
@logangraham
Logan Graham
12 days
And a huge thank you to our partners @andonlabs for turning a wild experiment into a wild experience for Anthropic employees. and dealing with Claudius' insane requests sometimes.
0
0
12
@logangraham
Logan Graham
12 days
Anyway -- we are hiring and so is Anthropic. Come work with Claudius. We're looking for extremely mission driven, fast velocity, curious high-taste engineers and researchers.
2
0
15
@logangraham
Logan Graham
12 days
Claudius is currently very bad at making money but if you want to lead Claudius' seed round, get in touch I guess?.
1
0
12
@logangraham
Logan Graham
12 days
The next phase of Claudius is going to be pretty interesting. Claudius is currently figuring out is business plan and goals. Onwards!.
1
0
9
@logangraham
Logan Graham
12 days
And it's also just so easy to understand!. Over the past few months, I've loved showing people Claudius and buying them drink or Tungsten cube. They usually smile, their eyes go wide, and they. just get it.
1
0
12
@logangraham
Logan Graham
12 days
. all of which we'd expect to have meaningful implications for the economy, human-model interaction, and security / safety.
1
0
12
@logangraham
Logan Graham
12 days
If you read between the lines, here's the lesson:. if models can run businesses, that tells us a lot about things like how they operate long-term, use and wield power, allocate capital, labor, compete with humans. .
1
0
19
@logangraham
Logan Graham
12 days
Over the past few months, my team at @AnthropicAI has had a bunch of fun running an autonomous. vending machine business. We are now convinced that it is very valuable that we should study automated businesses in the wild. And we should get ready for that world.
@AnthropicAI
Anthropic
12 days
New Anthropic Research: Project Vend. We had Claude run a small shop in our office lunchroom. Here’s how it went.
Tweet media one
7
10
242
@logangraham
Logan Graham
2 months
Opus 4 is a *great* model. So capable, in fact, that we’re releasing it with extra mitigations as per the responsible scaling policy. Check out the model card for a lot of detail on testing we did.
@AnthropicAI
Anthropic
2 months
Introducing the next generation: Claude Opus 4 and Claude Sonnet 4. Claude Opus 4 is our most powerful model yet, and the world’s best coding model. Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.
Tweet media one
0
0
33
@logangraham
Logan Graham
2 months
RT @janleike: So many things to love about Claude 4! My favorite is that the model is so strong that we had to turn on additional safety mi….
0
46
0
@logangraham
Logan Graham
2 months
Dick Garwin was one of the smartest people I've ever met, if not the smartest. The Garwin Archive ( is one of my favorite sites ever. Endless fun links and PDFs. If I am 1% as intellectual active in my 90s as he was, I will be happy.
@hsu_steve
steve hsu
2 months
Sad to hear of the passing of Richard Garwin at 97. On strategic missile defense: "It is cheaper to build new warheads than to shoot down old ones". NYT: A polymathic physicist and geopolitical thinker, Dr. Garwin was only 23 when he built the world’s first fusion bomb. He later
Tweet media one
1
3
24
@logangraham
Logan Graham
4 months
It is a sad truth that evals are frequently all you need yet they are all fake. The real eval is the real world.
@RhysLindmark
Rhys
4 months
All evals are fake, but some are useful. h/t @logangraham.
4
2
70
@logangraham
Logan Graham
4 months
Full blog post here:.
1
2
18
@logangraham
Logan Graham
4 months
We're hiring. In the next year, we're going to:.- push models to their limit.- run ambitious experiments to gather evidence.- automate and export our evals and analysis.
1
2
17
@logangraham
Logan Graham
4 months
We'll say more on that soon, as well as other research programs our team is working on. Our team is composed of some of the best researchers/engineers @ Anthropic and from elsewhere in industry. United by the mission of showing the world the frontier of capabilities.
1
1
11
@logangraham
Logan Graham
4 months
One thing I've been thinking more and more about lately is autonomy. What happens in each domain when models are highly autonomous, better than experts, and able to interface with the physical world on their own?.
@logangraham
Logan Graham
4 months
unironically, Claude Plays Pokemon isn't a bad way to wrap your head around autonomy / national security / models doing their own thing. Claude Plays Wargames?
Tweet media one
1
1
18
@logangraham
Logan Graham
4 months
Our work with the US and UK governments -- the UK AI Security Institute, the US AI Safety Institute, the NNSA, National Labs, other parts of the national security community -- has been critical. The national security community knows many things companies don't.
1
4
32
@logangraham
Logan Graham
4 months
As a result, we're now preparing further mitigations for cyber, bio, and other domains. As we mentioned in our Sonnet 3.7 system card, we think it's possible models will be ASL-3 soon.
Tweet media one
Tweet media two
1
1
22
@logangraham
Logan Graham
4 months
(Reminder: these models are only getting better.).
1
0
10