Shannon Sands Profile
Shannon Sands

@max_paperclips

Followers
9K
Following
276K
Media
1K
Statuses
31K

Software developer & cognitive architect https://t.co/JAoBrqMLXN.

Joined February 2023
Don't wanna be here? Send us removal request.
@theemozilla
emozilla
4 hours
I spent the last two days at the Vatican attending the AI Forum It was basically a Council of Elrond-style gathering of technical practitioners (me), philosophers, and theologians Good news, the Church is broadly aligned that AI is a force for elevating humanity so long as we
@Pontifex
Pope Leo XIV
12 hours
Technological innovation can be a form of participation in the divine act of creation. It carries an ethical and spiritual weight, for every design choice expresses a vision of humanity. The Church therefore calls all builders of #AI to cultivate moral discernment as a
3
4
27
@Grayscale
Grayscale
10 hours
TODAY: Stop by Flatiron Public Plaza to learn about Grayscale CoinDesk Crypto 5 ETF (Ticker: $GDLC) and the underlying tokens. Bitcoin Ethereum $SOL $XRP $ADA We’ll be sharing how $GDLC streamlines crypto exposure and giving attendees a limited-edition Grayscale Speedcube to
42
93
577
@tjcages
ty
2 days
normalize decorative first letters in code snippets
@instance_11
m_11
3 days
forms & the code that produced them
92
1K
16K
@pukicho
Pukicho
4 days
My child will not be allowed to use chat gpt. He will be smarter and stronger than the other children and he will kill them easily.
656
34K
273K
@Yuchenj_UW
Yuchen Jin
1 day
If you ever wonder how Chinese frontier models like Kimi, DeepSeek, and Qwen are trained on far fewer (and nerfed) Nvidia GPUs than US models. In 1969, NASA’s Apollo mission landed people on the moon with a computer that had just 4KB of RAM. Creativity loves constraints.
124
277
4K
@WaymoCommunity
Waymo community
4 days
This cyclist nearly lost his life in a crash. Learn why he thinks Waymo will help improve road safety.
1
3
13
@EMostaque
Emad
1 day
A note on costs/compute Base Kimi K2 model used 2.8m H800 hours with 14.8 trillion tokens, about $5.6m worth Details of post training for reasoning not given, but it is likely max 20% more (excluding data prep!) Would be < $3m for sota if they had Blackwell chip access
@Kimi_Moonshot
Kimi.ai
1 day
🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built
21
45
514
@max_paperclips
Shannon Sands
2 days
I'd have expected "lack of novelty" or something, but nope apparently the reviewers don't know this kinda pipeline is what everyone is doing now
@johannes_hage
Johannes Hagemann
2 days
The PipelineRL paper getting rejected at NeurIPS reminds me of when the Megatron-LM paper got rejected from every conference back in 2020 scientific reviewers still don’t recognize a good systems paper when they see one https://t.co/KMPnx5HlnX
1
0
25
@max_paperclips
Shannon Sands
2 days
me playing around with my tiny xLSTM implemention in MLX lmao
@mike64_t
mike64_t
2 days
@_ueaj Don't lecture me, uaj, I see through the lies of the GPU managerial class. There is no free lunch for parallelism. I do not fear sequentiality as you do. I have brought speed, scalability and memory sustainability to my new empire.
0
1
17
@max_paperclips
Shannon Sands
2 days
a mass movement of voters? In my democracy? Say it isn't so
@tszzl
roon
2 days
i miss “post rationalism”, a principled school of thought, which has now been replaced by its idiot cousin “populism”
1
0
22
@KanuGulati
Kanu Gulati, Partner @Khosla Ventures
3 days
So, I just got back from Hangzhou, China; Attended @IROS2025. Takeaway: China’s robotics ecosystem is moving faster and more coordinated than the US
14
7
56
@max_paperclips
Shannon Sands
2 days
> If you want to do multi-agent multi-turn LLM RL, might as well do commit sudoku. Accurate
@redtachyon
Ariel
2 days
Aight let's talk about frameworks, libraries, RL, and why I probably don't like your favorite RL codebase. Yes, including that one. The unusual thing about RL is that the algorithm is the easy part. GRPO is a single-line equation on some logprobs. If you have the data, computing
3
0
41
@max_paperclips
Shannon Sands
3 days
New idea for web apps: Need to wrap it up, put it on NPM and have a full in-browser OS, and then serve the actual site on it's localhost, purely to add more layers of abstraction to the frontend
@wavefnx
wavefnx
4 days
Fun. Someone patched the Linux kernel so it can run on WASM. Big difference between some simulations that sandbox or warp the kernel (spec. LKL) while this actually compiles it for WASM Biggest changes on patches kernel-0005, llvm-0001, musl-0001. arch/wasm is born
2
2
24
@max_paperclips
Shannon Sands
3 days
Use Nix. I await your block
3
0
11
@fchollet
François Chollet
3 days
The path to autonomous AI is a system that learns to solve new problems by synthesizing models of them on the fly (as code), and that gets smarter over time by adding new abstractions to its own library (also as code), compounding its capabilities. Not a static map -- rather, an
51
39
413
@ASUS_ROG
ROG Global
14 hours
Break free with the ROG Xbox Ally and ROG Xbox Ally X. Take your favorite games with you, wherever you are!
0
2
26
@max_paperclips
Shannon Sands
3 days
brrrrrr
@Teknium
Teknium (e/λ)
3 days
Some of our internal MoE pretrain experiments ^_^
0
1
46
@Teknium
Teknium (e/λ)
3 days
Some of our internal MoE pretrain experiments ^_^
19
7
297
@max_paperclips
Shannon Sands
3 days
This isn't true, I have dreams (occasionally nightmares) about code constantly lmao. I don't dream about my phone though, which is kinda interesting
@tbpn
TBPN
4 days
"Do you never notice that devices and screens aren't in your dreams?" – @bchesky
2
0
18
@zack_overflow
zack
4 days
"it's Python, you do anything and it allocates" How true is this? I modified CPython to print when it allocates an integer object Then added numbers in a for-loop 100k times My terminal got spammed with 101006 allocations Why? Let's explore the internals of CPython:
@leothrix
tyler
10 days
I will never forgive Rust for making me think to myself “I wonder if this is allocating” whenever I’m writing Python now
25
41
794
@CanaryFunds
Canary Capital
4 days
Hedera’s Hashgraph replaces blockchain’s linear chain of blocks with a Directed Acyclic Graph (DAG), where transactions confirm each other asynchronously, similar to how news spreads by word of mouth The result: parallel consensus, sub-second finality, high throughput & low fees
0
18
211
@max_paperclips
Shannon Sands
4 days
I think it actually is beautiful
@boazbaraktcs
Boaz Barak
5 days
@emollick @EpochAIResearch The article also slanders non-commutative algebra.
0
0
18
@i2cjak
i²cjak
4 days
if your lithium batteries get hot it’s best to dunk them in water to cool them down
@simcity99
simcity
4 days
fuuuuk! battery is starting to swell… quick! chat, what do i do now!?
31
3
279
@Sentdex
Harrison Kinsley
4 days
Another side benefit of this exploit is, w/ root access to the main board, you have a stronger e-stop. When running low level policies, the remote controller stops working You can even reboot the jetson board and robot still moves. A reboot of main board reliably stops things
@Sentdex
Harrison Kinsley
21 days
@ChongZitaZhang @benjamin_bolte This wasnt fake. I was able to replicate it and posted a video doing it live for this exact reason. I still have root and sudoer on my g1 main board that i used the reported bluetooth exploit to get, which anyone could do in range.
4
6
64
@Sentdex
Harrison Kinsley
21 days
@ChongZitaZhang @benjamin_bolte Correct, you can see I am connected to the .161 PC1 in photo. vid of me testing exploits: https://t.co/vWtY7znfS2 You can safely test reboot command like I did in video. I later enabled ssh, and used that same method to setup a new sudoer. You can also just change root pass.
2
3
20