TensorSlay Profile Banner
Tensor-Slayer Profile
Tensor-Slayer

@TensorSlay

Followers
556
Following
2K
Media
661
Statuses
10K

張量殺手

Joined February 2010
Don't wanna be here? Send us removal request.
@TensorSlay
Tensor-Slayer
6 months
Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement:
1
9
63
@TensorSlay
Tensor-Slayer
4 hours
Hyperscalers that spent 2024/5 hoarding NVIDIA cards are discovering that a rack of Blackwell GPUs is useless without a 150 MVA step-down transformer, a slab of Japanese ABF substrate and a monthly allocation of indium-phosphide lasers. The marginal cost of a delivered
@zephyr_z9
Zephyr
23 hours
"The era of cheap silicon is over." Interesting words from the Nothing Phone guy
0
0
0
@TensorSlay
Tensor-Slayer
6 hours
One question : Is he Chonky ?
@JustinLin610
Junyang Lin
7 hours
20%
0
0
0
@TensorSlay
Tensor-Slayer
6 hours
Time to repost the banger which reached no one because X algo buried me after calling out Theo and getting blocked
@TensorSlay
Tensor-Slayer
3 months
Seems like everyone has solved the king of RL problems i.e credit assignment function of process reward models in their tweets. I, for one, am lost on the counter to the counter-narrative intuitively.
0
0
0
@TensorSlay
Tensor-Slayer
7 hours
I like this direction. Sandbox agents only to let em run wild.
@himanshustwts
himanshu
8 hours
We just announced Mogra! Mogra gives agents what they actually need: sandboxed execution environments + persistent state across sessions + deployment. More coming! Harness > Model.
1
0
3
@TensorSlay
Tensor-Slayer
8 hours
Hard disagree with skills. Test this yourself : Generate a basic web page. Then use Claude frontend design skill to generate the same page. The difference is night and day. This is a “low hanging fruit” example btw.
@Presidentlin
Lincoln 🇿🇦
1 day
A person's first exposure to an AI agent really does matter. When I look at the Claude Code harness, to me, it looks like the inferior harness. Anthropic are really good at creating all these stuff to augment the model, that ends up not mattering. MCP don't matter, the model
0
0
2
@TensorSlay
Tensor-Slayer
14 hours
Imagine how it feels for 2010 account holders
@cto_junior
TDM (e/λ) (L8 vibe coder 💫)
18 hours
X's timeline has gone under a horrendous mode collapse the algo will basically mask you out if you don't echo the trending keyword this is why it feels so slop to most old timers we need to retvrn to the days of high entropy novel bangers
0
0
1
@TensorSlay
Tensor-Slayer
2 days
Not sure I understand this direction. CC was supposed to be a new way to interface with code. If we wanted we already had the ability to ….use IDEs to see the diffs. Wondering if this is to cater the new influx of programmers integrating CC in their workflows.
@bcherny
Boris Cherny
2 days
You asked, we listened. The team has been cooking this for a while. Can't wait to hear what you think.
1
0
3
@TensorSlay
Tensor-Slayer
2 days
This is quite refreshing to see. Agents did make mistakes BUT they can be taught the right way to do critical domain specific things. Enter Agent skills. If you still are on the fence about skills being “just .md files with instructions lol”, you are missing out.
@AnthropicAI
Anthropic
2 days
Since launching our AI for Science program, we’ve been working with scientists to understand how AI is accelerating progress. We spoke with 3 labs where Claude is reshaping research—and starting to point towards novel scientific insights and discoveries. https://t.co/WAvghBlbsC
0
0
0
@TensorSlay
Tensor-Slayer
2 days
Great insight. <<The flaws you see in your writing are invisible to everyone else.>>
@simonw
Simon Willison
2 days
I enjoyed answering questions from @c_a_dunlop about my approach to writing online. Here's my number one tip for people who want to publish more of their thoughts on the internet:
1
0
1
@TensorSlay
Tensor-Slayer
2 days
🤣🤣🤣🤣🤣🤣🤣🤣 relatable
@mikewazar
Mike Wazar
4 days
u l t r a t h i n k
0
0
2
@TensorSlay
Tensor-Slayer
26 days
> perf-hints : A language agnostic Claude Code plugin based on Jeff Dean & Sanjay Ghemawat's performance hints post. > /perf runs a 7-phase optimization workflow with back-of-envelope analysis. > /plugin marketplace add areu01or00/perf-hints >/plugin install perf > Link :
Tweet card summary image
github.com
A language-agnostic engineering wisdom plugin for Claude Code based on Performance Hints by Jeff Dean, Sanjay Ghemawat's viral article - including performance optimization, code review, tes...
@JeffDean
Jeff Dean
29 days
Performance Hints Over the years, my colleague Sanjay Ghemawat and I have done a fair bit of diving into performance tuning of various pieces of code. We wrote an internal Performance Hints document a couple of years ago as a way of identifying some general principles and we've
1
3
10
@scaling01
Lisan al Gaib
3 days
good for you, but isn't 1.5 million like half a researcher?
@alexalbert__
Alex Albert
4 days
I'm happy to share that we (@AnthropicAI) are investing $1.5 million in support of the Python Software Foundation and open source security. Python powers so much of the AI industry. Supporting the folks that make our work possible is an honor.
11
5
436
@TensorSlay
Tensor-Slayer
3 days
0
0
2
@TensorSlay
Tensor-Slayer
3 days
Raw prompts : ### Dual Filesystem Instructions ``` The presence of this tool means that Claude has access to two computer filesystems: 1. The user's computer filesystem (this computer), which Claude can access using its Filesystem tools. 2. Claude's computer filesystem (the
1
0
2
@TensorSlay
Tensor-Slayer
3 days
Reverse engineered the macOS binary to see how Cowork functions. Below are the architecture details and Tooling prompts : The Core : Claude Cowork = GUI + Claude Code CLI + Linux VM Dual Filesystem is the Core. Claude has its own isolated computer (VM) , can access your
@claudeai
Claude
5 days
Introducing Cowork: Claude Code for the rest of your work. Cowork lets you complete non-technical tasks much like how developers use Claude Code.
1
3
9
@TensorSlay
Tensor-Slayer
5 days
Engram : A Hash Table for Language or L1 cache for language. Engram’s core idea is almost trivial in hindsight: bolt an O(1) lookup table onto the transformer. The pipeline is three steps: Slice: incoming token sequence is hashed into overlapping N-grams (e.g., 3-grams).
2
0
1
@TensorSlay
Tensor-Slayer
5 days
L1 cache for language is here
@scaling01
Lisan al Gaib
5 days
DeepSeek is back! "Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models" They introduce Engram, a module that adds an O(1) lookup-style memory based on modernized hashed N-gram embeddings Mechanistic analysis suggests Engram reduces the need
0
0
2
@TensorSlay
Tensor-Slayer
10 days
I hope your legal teams are ready
@OpenAI
OpenAI
10 days
Introducing ChatGPT Health — a dedicated space for health conversations in ChatGPT. You can securely connect medical records and wellness apps so responses are grounded in your own health information. Designed to help you navigate medical care, not replace it. Join the
0
0
1
@TensorSlay
Tensor-Slayer
17 days
1. AI red teaming will be rampant. New attack vectors cultivating a billion dollar scam industry. 2. No amount of security and interpretability work will stop it. More outages. More leakages. More vulnerabilities. 3. Software quality degrades. Bar lowered. 3. General public
@alexalbert__
Alex Albert
17 days
What are your predictions for AI in 2026?
0
0
1
@TensorSlay
Tensor-Slayer
18 days
Nerd snip of the day https://t.co/slIoRI9o9S
0
0
1