tender @tenderizzation X Profile

tender

@tenderizzation

Followers

3K

Following

44K

Media

2K

Statuses

5K

python test/test_matmul_cuda.py

South Silly Valley (南湾）

Joined July 2010

Don't wanna be here? Send us removal request.

tender

@tenderizzation

9 months

DM me "hey" I'll debug your CUDA error: an illegal memory access🍒 "hi" I'll debug your cuDNN error: CUDNN_STATUS_BAD_PARAM 🍑 "howdy" I'll debug your CUDA error: CUBLAS_STATUS_EXECUTION_FAILED 🍓

0

65

tender

@tenderizzation

7 hours

https://t.co/cbxXUQ4aCp

2

0

27

tender

@tenderizzation

8 hours

do we think it's more of a sleep or hibernate

1

0

3

tender

@tenderizzation

14 hours

DM me for the secret environment variables

Simo Ryu

@cloneofsimo

14 hours

This is the exact moment where torch compile now has max_autotune_us_gov backend opt in that wire transfer 1$ for executing every graph module

0

12

tender

@tenderizzation

19 hours

I am ambivalent about missing college because this was what literally every day was

15

5

180

tender

@tenderizzation

21 hours

the “now” doing some heavy lifting

Ramp Capital

@RampCapitalLLC

1 day

🙄

1

0

55

tender

@tenderizzation

24 hours

today is friday in california

1

0

10

tender

@tenderizzation

1 day

the model when you neglect document masking during pretraining and the space invaders speedrun video ends up next to someone’s instagram story

0

1

37

tender

@tenderizzation

2 days

https://t.co/BNJRwUtP5D

7

8

291

tender

@tenderizzation

2 days

picking the PCI-E cake over the SXM cake when there is ostensibly no cost difference is crazy

emily han

@emilyhanyf

2 days

did someone say GPU cake?

4

0

98

tender

@tenderizzation

2 days

https://t.co/wss73ILBg8

Andrej Karpathy

@karpathy

3 days

POV: Your LLM agent is dividing a by b

1

0

77

tender

@tenderizzation

2 days

imagine you’re a data guy at one of the big labs curating github repos for pretraining “maybe we can sort them by number of stars?” an entirely reasonable heuristic, one might conclude

15

29

1K

tender

@tenderizzation

2 days

when @alyankovic drops the "It's All About the Blackwells" music video I'll start worrying until then,

1

0

11

tender

@tenderizzation

3 days

pytorch consulting the math library heuristics which say transposing the matrix will yield a 30% speedup over the non-transposed case, deciding to transpose the matrix, and immediately running out of memory https://t.co/5fodHFSwFp

7

20

634

tender

@tenderizzation

3 days

“hey, can you check if this fixes the problem?” <link to 30GiB docker container>

1

42

tender

@tenderizzation

4 days

“back in my day when people said ‘numerical stability’ they were referring to topics like machine epsilon and conditioning, not whether an implementation of an algorithm was grossly incorrect” “sure thing, let’s get you back to numpy where the default dtype is float64”