
Jonathan Chang
@ChangJonathanC
Followers
1K
Following
13K
Media
496
Statuses
2K
ML/AI Engineer, building https://t.co/uEbfxzF7jm
Taiwan
Joined May 2020
while we wait for gpt-5 to drop. Here is a flex attention tutorial for building a < 1000 LoC vllm from scratch.
jonathanc.net
PyTorch FlexAttention tutorial: Building a minimal vLLM-style inference engine from scratch with paged attention
9
39
399
Pantheon.
I’m excited to share that @Starcloud_Inc1 is partnering with @Google Cloud to be the first to run a version of Gemini on a satellite. "Starcloud, a start-up that is building a space data center, will soon launch a satellite equipped with NVIDIA graphics processing unit (GPU)
0
0
3
lol they really put juice number to the system prompt
gpt-5-high is only available through the API and has the highest juice: 200. Even on the $200/month plan, thinking and thinking-pro are limited to a juice of 128. Consumers are drunk on the juice and have just caught sight of some untapped kegs.
1
0
2
hey @aidan_mclau , i think it's weird for model to say "Got it" after i've waited for 3 mins for the response. the model should just answer the question
0
0
5