suryasure05 Profile Banner
surya Profile
surya

@suryasure05

Followers
2K
Following
2K
Media
49
Statuses
372

inference @groqinc // ece @westernu

Toronto, Ontario
Joined October 2023
Don't wanna be here? Send us removal request.
@suryasure05
surya
2 months
I spent my summer building TinyTPU : An open source ML inference and training chip. it can do end to end inference + training ENTIRELY on chip. here's how I did it👇:
71
341
3K
@suryasure05
surya
2 hours
probably the single best resource for understanding transformers from first principles
@krupaad
krupa
5 hours
I spent months illustrating how Transformers actually work. Not just what they do, but why they’re built this way. The history, design choices, and intuition behind every layer. From RNNs → Attention → Multi-Head → FFNs → Positional Encoding. Here's everything I wish I
1
0
5
@krupaad
krupa
5 hours
I spent months illustrating how Transformers actually work. Not just what they do, but why they’re built this way. The history, design choices, and intuition behind every layer. From RNNs → Attention → Multi-Head → FFNs → Positional Encoding. Here's everything I wish I
8
9
23
@satvikgari
Satvik Garimella
1 day
Just built Curserve with @nathanbarrydev @Alexkranias and @PranavTadepalli . It’s a server-side coding agent framework that’s 30x faster than spawning subprocesses and eliminates network latency entirely. We also placed in the Crater sponsor prize! Heres how we did it.
7
6
12
@suryasure05
surya
12 days
noise cancelling headphones might be the best investment I’ve ever made after my macbook
1
0
13
@bubblebabyboi
bubble boi
14 days
There is probably no better way to really understand CUDA than writing RTL. Honestly writing RTL just makes parallelism really easy to think about no matter the language.
@gauravisnotme
Gaurav
15 days
Please, for the sake of your career don't do this. FPGAs are no good beyond prototyping and emulation and once you have two generations of hardware deployed only incremental blocks of hardware are ever taken to the FPGA. They don't scale, they are slow, and most importantly
8
15
371
@suryasure05
surya
16 days
after being in the industry for a bit, I’m finally starting to understand exactly how much LLMs are changing the role of a software engineer. the value in being a great swe now is in being able to understand systems from first principles and ask the right questions, since you
1
2
49
@suryasure05
surya
20 days
don’t be surprised when the saksham stock skyrockets in a few months
@sakshambatraa
saksham
20 days
currently building a transformer from scratch in jax to understand the architecture, and how ML compilers work. finished the file that processes the embeddings, and implemented RoPE. also took a look at the JAXPR and StableHLO IR’s and drew a computational graph :)
2
0
16
@michael_trbo
michael.trbo
27 days
I built a neural net from scratch in NumPy to solve the XOR problem. It sounds simple, but the process was way more confusing (and interesting) than I expected. Here’s how it went: 👇 https://t.co/18LgMMWW1k
Tweet card summary image
github.com
Contribute to michaeltrbo/mlp-xor-problem development by creating an account on GitHub.
3
5
19
@GroqInc
Groq Inc
1 month
It’s official: McLaren F1 x Groq Bringing inference speed at a winning cost to the grid and beyond. See you in Singapore. 🧡🏁
23
37
328
@suryasure05
surya
1 month
one of the coolest projects I’ve ever seen
@Almondgodd
anandmaj
1 month
I spent the past month reimplementing DeepMind’s Genie 3 world model from scratch Ended up making TinyWorlds, a 3M parameter world model capable of generating playable game environments demo below + everything I learned in thread (full repo at the end)👇🏼
0
0
14
@suryasure05
surya
1 month
I just wrapped up what will probably be the MOST memorable summer of my life. tldr: built a cool project and ended up joining @GroqInc to work on distributed systems. here's a timeline of everything that happened in the last ~6 months: > feb 24: @evanliin explains to me
30
23
400
@suryasure05
surya
1 month
incredibly grateful for this opportunity
@GroqInc
Groq Inc
1 month
Stoked to welcome the TinyTPU team to Groq @evanliin, @XanderChin, @suryasure05, @kennykgguo How it started How it's going
6
1
73
@GroqInc
Groq Inc
1 month
Stoked to welcome the TinyTPU team to Groq @evanliin, @XanderChin, @suryasure05, @kennykgguo How it started How it's going
26
20
346
@suryasure05
surya
1 month
henry has the best blogs on kernels I’ve come across do yourself a favour and check out his work!
@henryHM_ko
Henry Ko
1 month
I had a lot of fun writing a new blog: Optimizing NSA for TPUs There's an accompanying colab notebook with it too! I hope this helps people tinker with NSA in JAX + TPU kernels with Pallas
1
0
12
@merakiatuoft
Meraki UofT ⁂
1 month
Introducing our co-hosts for this season 🫶 (@krupaad @_richapandya @kennykgguo @achettimada @ambiguousNull)
2
2
12
@maya_l39
Maya Lekhi
2 months
western university has talented builders flying under the radar. now they have a spotlight. introducing the mustangs network 🐎
13
4
72
@omkizzy
omkaar
2 months
I wrote a short article about LFM-2's (by @LiquidAI_ ) hybrid architecture w/ illustration + simple pytorch impl.
13
18
201
@suryasure05
surya
2 months
the air canada app is next level — i got a notification exactly when my suitcase entered the baggage carousel not sure if all airlines have this feature, but it’s such a quality if life improvement
2
0
11
@suryasure05
surya
2 months
you can just do things for fun and get life changing opportunities
0
3
36
@XanderChin
Xander Chin
2 months
wrote a piece on how we do tiling and gradient descent on TinyTPU included some animations of the weight and bias gradient descent
9
63
654