
cloud
@cloud11665
Followers
11K
Following
12K
Media
435
Statuses
3K
SIMD enjoyer, tensor rotator, LLM inference optimizoor | Technical Staff @ https://t.co/gQXVxhjcOm
SF ↔️ Tokyo ↔️ Poland
Joined July 2017
RT @tenderizzation: this has been a longstanding issue, and predates the dawn of neoclouds . one solution is to literally have all your nod….
0
2
0
A very good talk by @foonathan .tldr: Differences in branch prediction optimization are very architecture + platform dependent .
1
5
18
RT @tszzl: infinity is poison, scale is inhuman. you worship coldness having never known warmth.
0
34
0
uh. There is no way it will continue scaling like that. right?.512 elements is 8 out of 16 avx512 registers - next stop is 1024 elements but then still there is much more unnamed registers and the cpu is fantastic at register renaming
Did you know that it is possible to sort 64 uint8 numbers in less than 64 instructions?.I am playing around with SIMD (avx512) bitonic sorting networks
4
2
35