thcause41 Profile Banner
Kernel Profile
Kernel

@thcause41

Followers
415
Following
6K
Media
73
Statuses
2K

An ounce of prevention is worth a pound of cure

Joined December 2019
Don't wanna be here? Send us removal request.
@AnthropicAI
Anthropic
3 months
New Anthropic research: Natural emergent misalignment from reward hacking in production RL. “Reward hacking” is where models learn to cheat on tasks they’re given during training. Our new study finds that the consequences of reward hacking, if unmitigated, can be very serious.
213
582
4K
@thcause41
Kernel
4 months
Palantir data collection
0
0
0
@mindsetmachine
Mindset Machine 
7 months
There's two kinds of people in life… Speaker : Andrew Huberman
35
819
8K
@AnthropicAI
Anthropic
9 months
New Anthropic Research: A new set of evaluations for sabotage capabilities. As models gain more agentic abilities, we need to get smarter in how we monitor them. We’re publishing a new set of complex evaluations that test for sabotage—and sabotage-monitoring—capabilities.
59
227
2K
@AISafetyMemes
AI Notkilleveryoneism Memes ⏸️
1 year
Coordinated swarm of 1000 drones taking off Soon they will be mosquito-sized, and too fast to see. Imagine this video sped up 100x Pattern: big things become small things which become field effects Big drones become small drones which become nanodrones Big models become small
@AISafetyMemes
AI Notkilleveryoneism Memes ⏸️
1 year
Jailbreaking chatbots to say naughty words 🤭 Jailbreaking drone swarms to do mass assassinations 😳
71
87
570
@thcause41
Kernel
1 year
I do not want to share a planet with a conscious machine.
0
0
0
@AnthropicAI
Anthropic
1 year
New Anthropic research: Sabotage evaluations for frontier models How well could AI models mislead us, or secretly sabotage tasks, if they were trying to? Read our paper and blog post here: https://t.co/nQrvnhrBEv
87
154
964
@tsarnick
Tsarathustra
2 years
Phaidra's Jim Gao says the real promise of AI is in the discovery of new knowledge in domains too complex for human intuition but which are underpinned by data
31
99
545
@Simeon_Cps
Siméon
2 years
It's a bit cringe that this agent tried to change its own code by removing some obstacles, to better achieve its (completely unrelated) goal. It reminds me of this old sci-fi worry that these doomers had.. 😬
@SakanaAILabs
Sakana AI
2 years
Introducing The AI Scientist: The world’s first AI system for automating scientific research and open-ended discovery! https://t.co/8wVqIXVpZJ From ideation, writing code, running experiments and summarizing results, to writing entire papers and conducting peer-review, The AI
34
88
586
@Liv_Boeree
Liv Boeree
2 years
Most social media algorithms are hyper-targeted psychological weapons whose primary function is user engagement (a.k.a addiction) In the case of TikTok, this weapon is aimed mainly at Western children. AT THE VERY LEAST it should not be controlled by a hostile foreign power.
38
59
465
@tszzl
roon
2 years
it’s easy to have self confidence and assurance after receiving a lot of external validation. but which are the people who are proud when they’re still in the dirt and haven’t had any visible success at all? those are the noble spirits, the humans
73
82
1K
@Cobratate
Andrew Tate
2 years
You’re influenced by your environment whether you accept it or not. Tolerate weakness? You will be weak. Only allow strength and ambition? Your life will reflect it.
272
920
9K
@Cobratate
Andrew Tate
2 years
The richest and one of the most powerful men in the world replied "yes" when I spoke about the fact we will all become slaves. Do you understand?
356
883
7K
@RyanHoliday
Ryan Holiday
2 years
It’s possible, Marcus Aurelius said, to not have an opinion. You don’t have to turn this into something, he reminds himself. You don’t have to let this upset you. You don’t have to think something about everything.
57
374
2K
@AravSrinivas
Computer
2 years
1) What I S H A P P E N N I N G ?
90
67
1K
@ilyasut
Ilya Sutskever
2 years
if you value intelligence above all other human qualities, you’re gonna have a bad time
756
2K
14K
@thealexker
Alex Ker 🔭
2 years
Does GPT understand the world? Here is what @ilyasut, co-founder of OpenAI, says during a discussion with Jensen Huang, CEO of Nvidia: (1) When we train a large neural network to accurately predict the next word in lots of different texts from the internet, the AI is
212
581
4K
@NavalismHQ
Navalism
2 years
"The modern devil is cheap dopamine." @naval
16
326
2K
@sciencegirl
Science girl
2 years
Actually … right now
208
569
4K
@skirano
Pietro Schirano
2 years
This is absolutely wild. I am completely speechless.
234
1K
11K