
Ruikai
@retr0reg
Followers
3K
Following
873
Media
41
Statuses
138
16 founder @pwnoio and researcher
Avon, CT
Joined March 2024
My 10k-word writeup on exploiting a heap-overflow in Llama.cpp's RPC Server's Tensor-operation to RCE. This by far is one of the most challenging but fun exploitation I've ever researched on. https://t.co/aPLJyDF4Vq
retr0.blog
Retr0's Threat Research
4
106
416
Vocabularies are the lossy interface between thoughts and understanding. Hidden vectors carry meanings; tokens are merely the symbols we use to approximate them. https://t.co/A2JG7qqpfD
ruikai.posthaven.com
I knew nothing about Transformers before yesterday; I am a lover for cognitive science because I overthink too much. Words inherently lack precision. Often, I might have what feels like a...
1
1
12
I have at least spent a entire month of four months working on pwno just on k8s (gke), ignite... distributed system is a pain in the butt, but i guess if you do it right you can do such cool things (that scales)
2
0
24
We decided to open source Pwno-MCP: the MCP system for autonomous binary-level security research, a keystone behind Pwno's research architecture. Feel free to contribute or just try it out, we'll love to hear about how you solve pwn challenge with it! https://t.co/IljRU8RbNV
github.com
MCP for Pwn. Contribute to pwno-io/pwno-mcp development by creating an account on GitHub.
2
36
224
Our interview with Dark Reading at Black Hat https://t.co/BaqF0zvbsR
1
4
21
Just stepped off the stage at Black Hat USA as the youngest speaker in history. It's such an exciting but emotional moment sharing a project I've worked for almost a year to the world, months of work into thirty minutes. Just proud that this idea came from a piece of scratch
7
4
78
GGML is the tensor processing library used in Llama.cpp and other projects for AI inference, quantization and more. o4-mini and I spent an afternoon exploring the security of the GGML RPC server, writing a fuzzer and fixing some minor issues I uncovered. https://t.co/h6t1hEqPRh
0
9
28
These are 51 reports I submitted to Huntr last year (23-24), 28 RCEs, 11 LFIs, 34k words in total https://t.co/eeO0C9zyVw
2
26
146
watching chatgpt agent use a computer to do complex tasks has been a real "feel the agi" moment for me; something about seeing the computer think, plan, and execute hits different.
1K
848
13K
that's why sink-to-source is just naturally logically sensible and efficient, it's just a manner of recovery rather than discovery (**the point here is the latent information), this is what i think called context relevance specifically for logical tasking. putting this to
0
0
3
It's fun to think sinks have a hidden caller-chain state (counterintuitive), and sink-to-source are essentially recovering a hidden states, rather traditional dynamic pruning process.
1
0
9
We Found a Heap Overflow in llama.cpp's Tokenizer https://t.co/XovQ63afkX
pwno.io
Pwno's discovery of a heap overflow in Llama.cpp's Tokenizer
0
6
39
The reason why we put so much faiths on Transformers on low-level security is because they're natural processor of information, the only part where they're naturally better than us as human is their granularity in context awareness - something engraved in their attention heads
0
0
10
the thing is @pwnoio really needs a base indexing so we can tell him how to pwn, but existing softwares have terrible retrievals on knowledge base like https://t.co/h5f6s82Hhd (chinese, in-depth). just want to something easy to use in agents workflow and less expensive
1
0
2