benthamite🔸
@benthamite_
Followers
418
Following
45K
Media
94
Statuses
557
push-pin = poetry. Effective Altruist.
Joined January 2020
I like that OAI has been discussing things in public and feel more positively towards them than if they had just made their original post
openai needs to understand one simple fact, posting more about a situation over & over, including AMAs does not create clarity. it creates even more confusion & mistrust. like you're trying desperately to be understood. if you got it wrong the first time or there are problems,
0
0
2
#1 post in /r/OpenAI is titled "The end of GPT". 7/10 top posts are about cancelling accounts/anti-OAI. 10/10 /r/anthropic posts are pro-A\, 3/10 about people switching from oai
1
2
97
"I wish we had more empirical evidence re: Holden's claim 'there are companies not terribly far behind the frontier that would see any unilateral pause or slowdown as an opportunity rather than a warning'"
0
0
13
I stand with Sec. Hegseth. I too would rather invoke the DPA than do a project without claude
9
77
2K
When you run claude --dangerously-skip-permissions and it does something dangerous without asking permission
0
0
3
I’m launching a $7M RFP for humane fish slaughter tech via @coeff_giving Over 100 billion farmed fish are slaughtered yearly. We think only ~0.5% are reliably stunned beforehand. For the trillion+ wild-caught fish, most suffocate slowly in air or low-oxygen water. 🧵 🎥@WeAnimals
3
8
23
i had no idea the EA Animal Welfare Fund was so big. Kudos to everyone involved!
1
1
3
Just a heads up: I work in EA so critiques of my employer are minimum 200 pages (excluding footnotes). To keep things simple, I won't be commenting on SSA, EDT, or whether there is enough protein at EAG lunch
Just a heads up: I signed a permanent non-disparagement agreement when I joined Goodfire. To keep things simple, I won't be commenting on the company, its research, etc.
1
1
57
How I feel listening to people talk about how it's weird that they don't read the code they get from LLMs.
0
0
3
OpenAI released a substantially expanded Whistleblowing Policy – from 3 to 13 pages. Most comprehensive among frontier AI companies. AIWI finds significant progress: 8 of 13 AIWI recommendations addressed, including 3 of 5 critical items. ⚠️ Critical for AI employees:
0
4
12
The year couldn't have started better. For the first time in history, a country will effectively stop growing chickens unnaturally fast - a horrific form of animal cruelty. A massive step forward, announced by the industry itself! Now, it's time for others to do the right thing.
🎉 Norway’s largest meat producer and the industry association KLF announced a plan to phase out fast-growing chicken breeds by the end of 2027. Norway will become the world's first country to stop raising fast-growing chickens. 🐥This is a historic victory for chickens not only
0
10
24
Anthropic also has a team for educating those with the highest raw cognitive ability though. It's named "post-training."
"The KPIs will be students reached in underserved communities + learning outcomes" These are the wrong KPIs, and this misguided obsession with equalizing outcomes is the reason that education has been stagnant for a generation. Our society has a ideological fixation on helping
0
0
1
A righteous man's flight was cancelled. Everyone felt sorry for him. But his father spoke: "Who knows if that won't bring you good luck?" Because his replacement flight was empty, his upgrade came through. Everyone congratulated him. But his father spoke: "Who knows if that
0
0
0
We reviewed Anthropic’s unredacted report and agreed with its assessment of sabotage risks. We want to highlight the greater access & transparency into its redactions provided, which represent a major improvement in how developers engage with external reviewers. Reflections: 🧵
🌱⚠️ weeds-ey but important milestone ⚠️🌱 This is a first concrete example of the kind of analysis, reporting, and accountability that we’re aiming for as part of our Responsible Scaling Policy commitments on misalignment.
1
15
107
Reminder: @METR_Evals is hiring! We have an ambitious research agenda ahead of us, are well-funded, and need senior researchers! We're extending our work on time-horizons, productivity/"uplift", and agent monitoring, and have great research access across industry/gov/academia!
4
20
140
apparently they wanted to use [lastname]@.stripe.com for company emails, but there were too many address collisons
20
47
3K
BREAKING: Governor @GavinNewsom just signed our groundbreaking AI bill, SB 53, to promote AI innovation (creating a public cloud called CalCompute), require transparency around AI lab safety practices & protect whistleblowers at AI labs who report risk of catastrophic harm. 🧵
71
48
300