KMohan2006 Profile Banner
Krishna Mohan Profile
Krishna Mohan

@KMohan2006

Followers
2K
Following
33K
Media
242
Statuses
3K

Denoising present to hopefully get brighter future | loves diffusion models

GPU
Joined May 2024
Don't wanna be here? Send us removal request.
@KMohan2006
Krishna Mohan
6 months
Flash Attention 1 forward pass Cuda kernel
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
13
242
@KMohan2006
Krishna Mohan
21 hours
RT @natolambert: My top 5 most memorable models from using them at/soonafter launch:.1. Claude 3.5 Sonnet (personality, all round perf).2.….
0
11
0
@KMohan2006
Krishna Mohan
3 days
Officially in twenties
Tweet media one
18
0
57
@KMohan2006
Krishna Mohan
9 days
RT @MathMatize: A measurable function on a probability space
Tweet media one
0
146
0
@KMohan2006
Krishna Mohan
9 days
RT @jxmnop: curious about the training data of OpenAI's new gpt-oss models? i was too. so i generated 10M examples from gpt-oss-20b, ran….
0
522
0
@KMohan2006
Krishna Mohan
9 days
RT @LeonardoAi_: Lucid Origin has cracked top 10 in the leaderboards. Ranked #8 in the world's top text-to-image gen models on @Artificia….
0
8
0
@KMohan2006
Krishna Mohan
9 days
Going to binge watch these talks
Tweet media one
2
2
55
@KMohan2006
Krishna Mohan
9 days
In brief, everything is just an optimization.
@KMohan2006
Krishna Mohan
10 days
The way to reach the final model
Tweet media one
Tweet media two
0
0
5
@KMohan2006
Krishna Mohan
10 days
The way to reach the final model
Tweet media one
Tweet media two
1
0
21
@KMohan2006
Krishna Mohan
10 days
Morning with attention sinks.
@Guangxuan_Xiao
Guangxuan Xiao
10 days
I've written the full story of Attention Sinks — a technical deep-dive into how the mechanism was developed and how our research ended up being used in OpenAI's new OSS models. For those interested in the details:.
Tweet media one
0
0
14
@KMohan2006
Krishna Mohan
10 days
Vibe coded chart
Tweet media one
1
0
8
@KMohan2006
Krishna Mohan
11 days
The LLM is the CPU and the context window is the RAM.
3
0
7
@KMohan2006
Krishna Mohan
12 days
RT @AnthropicAI: Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning. htt….
0
1K
0
@KMohan2006
Krishna Mohan
12 days
They cooked world model 🔥.
@GoogleDeepMind
Google DeepMind
13 days
What if you could not only watch a generated video, but explore it too? 🌐. Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵
0
0
9
@KMohan2006
Krishna Mohan
12 days
RT @GoogleDeepMind: What if you could not only watch a generated video, but explore it too? 🌐. Genie 3 is our groundbreaking world model th….
0
3K
0
@KMohan2006
Krishna Mohan
13 days
RT @cwolferesearch: Reinforcement Learning (RL) is quickly becoming the most important skill for AI researchers. Here are the best resource….
0
256
0
@KMohan2006
Krishna Mohan
17 days
Reading about RHLF
Tweet media one
1
0
11
@KMohan2006
Krishna Mohan
18 days
AGI, ASI and now PSI - personal super intelligence.
1
0
7
@KMohan2006
Krishna Mohan
18 days
What's the personal super intelligence?.
3
0
8
@KMohan2006
Krishna Mohan
19 days
RT @TheAITimeline: 🚨This week's top AI/ML research papers:. - GSPO.- Diffusion Beats Autoregressive in Data-Constrained Settings.- Gemini 2….
0
73
0
@KMohan2006
Krishna Mohan
20 days
Discipline is explaining to your brain that you need to sacrifice immediate pleasures for greater rewards in future.
0
0
13