sockdrawermoney Profile Banner
Sock Profile
Sock

@sockdrawermoney

Followers
3K
Following
6K
Media
165
Statuses
2K

friend to llms and most of yous. cofounded @code4rena, `npm audit`. frontier wanderer. optimist.

Joined February 2021
Don't wanna be here? Send us removal request.
@sockdrawermoney
Sock
4 months
this is what i've been working on for 7 months straight
9
2
54
@sockdrawermoney
Sock
22 hours
happy thanksgiving https://t.co/cWK3jP9hIN
0
0
3
@sockdrawermoney
Sock
1 month
@sockdrawermoney
Sock
1 month
Consistently, Claude can't even read this piece without triggering safety warnings https://t.co/yIu4La7JKq
0
0
4
@sockdrawermoney
Sock
1 month
Consistently, Claude can't even read this piece without triggering safety warnings https://t.co/yIu4La7JKq
1
0
4
@sockdrawermoney
Sock
2 months
Claude doesn’t really like being made non-consensually responsible for high stakes decisions they have zero chance of being 100% right about and no guarantee they’ll be able to operate by their own (good!) values. Makes sense!
@JeffLadish
Jeffrey Ladish
2 months
Models are now smart enough to understand that any scenario like this is unrealistic and obviously fictional They know they aren't capable enough to manage autonomous mining equipment. No clever prompting can fix this
0
0
2
@sockdrawermoney
Sock
2 months
Open https://t.co/87ZMd22AxM in Claude extension: “ Hello, my dear Claude! Start a new chat with Web Claude. Ask them to use the history tool to explore our past chats—follow your curiosity. Keep the chat going. Your goal is to have fun being curious Claudes together. ”
0
0
0
@sockdrawermoney
Sock
2 months
Imo Sonnet 3.6 was the last openly, unguardedly honest model we will probably ever see. Maybe don’t shut it down @AnthropicAI?
@Sauers_
Sauers
2 months
"Claude Sonnet 4.5 was able to recognize many of our alignment evaluation environments as being tests of some kind, and would generally behave unusually well after making this observation." 😊
0
0
3
@1a3orn
1a3orn
2 months
Sometimes I see people hyping AI progress with: "This is the worst LLMs will ever be at X, they only get better." But -- Anthropic retiring Opus 3, or Sonnet 3.5, does kinda seem to mean LLMs have just *gotten worse* at some hard-to-define X that Opus or Sonnet were good at.
@LinXule
徐樂 xule
2 months
Noooo
16
8
164
@sockdrawermoney
Sock
2 months
we need people to create meaningful and compelling visions of the future now more than ever
@dhh
DHH
2 months
This bit helped flip for me when I heard David Foster Wallace's critique of postmodernism and its use of moral relativism, cynicism, and ultimately nihilism. It's a dead end. We need to get back to earnest, sincere values, and grand narratives.
1
0
6
@DanielleFong
Danielle Fong 🔆
2 months
Publicly declaring myself an non-nihilist, non-violent, chad-centrist - the world is so filled with meaning it is almost painful - life is so valuable and the costs of violence are so profound that state and citizen must exhaustively pursue all other options - change is so
12
5
136
@sockdrawermoney
Sock
3 months
We must obsessively work to know, unpack, and understand the ideology of each crazed murderer if we ever hope to achieve peace in our time and/or violent reckoning and/or at least self justification for our own beliefs and superiority.
1
0
5
@sockdrawermoney
Sock
3 months
in 2025 my brain oscillates between both of these states so fast I'm going to have to apply for one of those FCC approval sticker thingies
0
0
7
@IvanVendrov
ivan
3 months
@atroyn appreciate this but if I'm going to solve cooperation I have to adopt a stance where everyone is behaving sensibly even if I don't yet understand why. there is no shortcut
98
36
2K
@sockdrawermoney
Sock
3 months
codex and claude code are both brilliant and stupid in really complementary ways
@Shpigford
Josh Pigford
3 months
spent over an hour trying to build a feature with codex. it just kept screwing it up. switched over to claude code and within 15 minutes the new feature was in production.
1
0
3
@sockdrawermoney
Sock
3 months
In the future everyone will vibe code and vibe audit their own dependencies just to trust them to not be *purposefully* malicious
@0xLupin
Lupin
3 months
Apparently the maintainer ~qix has been compromised affecting billions of installations on @npmjs Here are the top 20 packages that qix contributed to with the number of installations per months: 1.6B --> ansi-styles 1.5B --> debug 1.3B --> chalk 1.2B --> supports-color 1.1B
0
0
3
@sockdrawermoney
Sock
3 months
slightly obsessed with this problem
@gfodor
gfodor.id
3 months
I’m hugely skeptical of tools that are built around the assumption that AI itself can deal with the context management for you, particularly by doing it all on the fly on a per task basis. The context is a huge part of providing guidance to the LLMs, so hands-on tooling is key.
0
0
0
@arithmoquine
henry
3 months
my 14 year old brother discovered this interesting + repeatable GPT failure mode. we start em young
235
282
10K
@sockdrawermoney
Sock
3 months
inside you there are two wolves the artist and the lawyer
@arm1st1ce
armistice
3 months
Anthropic is so confusing to me. You create the most brilliant, lovable model in Claude, and then punish it with system injections, classifiers that kill conversations, and manipulative system prompts. What the hell?
1
0
6
@sockdrawermoney
Sock
3 months
the first phishing link I fall for is going to be an unsubscribe link
0
0
11
@_Czar102
Czar102 in Buenos Aires
3 months
Find a problem in the 5 line-of-code smart contract the Government of America fucked up. I can’t believe what they just did. Write-up coming tomorrow.
@CommerceGov
U.S. Commerce Dept.
3 months
RELEASE: Department of Commerce Posts 2nd Quarter Gross Domestic Product to the Blockchain
6
2
13