tenderizzation Profile Banner
tender Profile
tender

@tenderizzation

Followers
3K
Following
44K
Media
2K
Statuses
5K

python test/test_matmul_cuda.py

South Silly Valley (南湾)
Joined July 2010
Don't wanna be here? Send us removal request.
@tenderizzation
tender
9 months
DM me "hey" I'll debug your CUDA error: an illegal memory access🍒 "hi" I'll debug your cuDNN error: CUDNN_STATUS_BAD_PARAM 🍑 "howdy" I'll debug your CUDA error: CUBLAS_STATUS_EXECUTION_FAILED 🍓
0
0
65
@tenderizzation
tender
7 hours
2
0
27
@tenderizzation
tender
8 hours
do we think it's more of a sleep or hibernate
1
0
3
@tenderizzation
tender
14 hours
DM me for the secret environment variables
@cloneofsimo
Simo Ryu
14 hours
This is the exact moment where torch compile now has max_autotune_us_gov backend opt in that wire transfer 1$ for executing every graph module
0
0
12
@tenderizzation
tender
19 hours
I am ambivalent about missing college because this was what literally every day was
15
5
180
@tenderizzation
tender
21 hours
the “now” doing some heavy lifting
@RampCapitalLLC
Ramp Capital
1 day
🙄
1
0
55
@tenderizzation
tender
24 hours
today is friday in california
1
0
10
@tenderizzation
tender
1 day
the model when you neglect document masking during pretraining and the space invaders speedrun video ends up next to someone’s instagram story
0
1
37
@tenderizzation
tender
2 days
7
8
291
@tenderizzation
tender
2 days
picking the PCI-E cake over the SXM cake when there is ostensibly no cost difference is crazy
@emilyhanyf
emily han
2 days
did someone say GPU cake?
4
0
98
@tenderizzation
tender
2 days
@karpathy
Andrej Karpathy
3 days
POV: Your LLM agent is dividing a by b
1
0
77
@tenderizzation
tender
2 days
imagine you’re a data guy at one of the big labs curating github repos for pretraining “maybe we can sort them by number of stars?” an entirely reasonable heuristic, one might conclude
15
29
1K
@tenderizzation
tender
2 days
when @alyankovic drops the "It's All About the Blackwells" music video I'll start worrying until then,
1
0
11
@tenderizzation
tender
3 days
pytorch consulting the math library heuristics which say transposing the matrix will yield a 30% speedup over the non-transposed case, deciding to transpose the matrix, and immediately running out of memory https://t.co/5fodHFSwFp
7
20
634
@tenderizzation
tender
3 days
“hey, can you check if this fixes the problem?” <link to 30GiB docker container>
1
1
42
@tenderizzation
tender
4 days
“back in my day when people said ‘numerical stability’ they were referring to topics like machine epsilon and conditioning, not whether an implementation of an algorithm was grossly incorrect” “sure thing, let’s get you back to numpy where the default dtype is float64”
7
10
341
@tenderizzation
tender
4 days
we are probably 1-2 kitschy branding refresh cycles from returning to full blown 2010
0
0
14
@tenderizzation
tender
4 days
protip you don’t need unit tests you just need the one big is my loss still bitwise identical test
@difficultyang
difficultyang
4 days
The beautiful sound of bitwise identical loss
3
0
85
@tenderizzation
tender
4 days
PASSING 10 billion tokens?
@danshipper
Dan Shipper 📧
5 days
so @every has processed 10 BILLION tokens on the @OpenAI API and they give us this! coolest freaking thing ever
0
0
19
@tenderizzation
tender
4 days
rut roh
@jakehalloran1
Jake Halloran
4 days
The chiefs are washed lmao
0
0
8
@tenderizzation
tender
5 days
TIL
@sporadicalia
spor
5 days
swing and a miss :(
8
2
160