Krishna Mohan @KMohan2006 X Profile

Krishna Mohan

@KMohan2006

Followers

2K

Following

33K

Media

242

Statuses

3K

Denoising present to hopefully get brighter future | loves diffusion models

GPU

Joined May 2024

Don't wanna be here? Send us removal request.

Krishna Mohan

@KMohan2006

6 months

Flash Attention 1 forward pass Cuda kernel

5

13

242

Krishna Mohan

@KMohan2006

21 hours

RT @natolambert: My top 5 most memorable models from using them at/soonafter launch:.1. Claude 3.5 Sonnet (personality, all round perf).2.….

0

11

0

Krishna Mohan

@KMohan2006

3 days

Officially in twenties

18

0

57

Krishna Mohan

@KMohan2006

9 days

RT @MathMatize: A measurable function on a probability space

0

146

0

Krishna Mohan

@KMohan2006

9 days

RT @jxmnop: curious about the training data of OpenAI's new gpt-oss models? i was too. so i generated 10M examples from gpt-oss-20b, ran….

0

522

0

Krishna Mohan

@KMohan2006

9 days

RT @LeonardoAi_: Lucid Origin has cracked top 10 in the leaderboards. Ranked #8 in the world's top text-to-image gen models on @Artificia….

0

8

0

Krishna Mohan

@KMohan2006

9 days

Going to binge watch these talks

2

55

Krishna Mohan

@KMohan2006

9 days

In brief, everything is just an optimization.

Krishna Mohan

@KMohan2006

10 days

The way to reach the final model

0

5

Krishna Mohan

@KMohan2006

10 days

The way to reach the final model

1

0

21

Krishna Mohan

@KMohan2006

10 days

Morning with attention sinks.

Guangxuan Xiao

@Guangxuan_Xiao

10 days

I've written the full story of Attention Sinks — a technical deep-dive into how the mechanism was developed and how our research ended up being used in OpenAI's new OSS models. For those interested in the details:.

0

14

Krishna Mohan

@KMohan2006

10 days

Vibe coded chart

1

0

8

Krishna Mohan

@KMohan2006

11 days

The LLM is the CPU and the context window is the RAM.

3

0

7

Krishna Mohan

@KMohan2006

12 days

RT @AnthropicAI: Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning. htt….

0

1K

0

Krishna Mohan

@KMohan2006

12 days

They cooked world model 🔥.

Google DeepMind

@GoogleDeepMind

13 days

What if you could not only watch a generated video, but explore it too? 🌐. Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

0

9

Krishna Mohan

@KMohan2006

12 days

RT @GoogleDeepMind: What if you could not only watch a generated video, but explore it too? 🌐. Genie 3 is our groundbreaking world model th….

0

3K

0

Krishna Mohan

@KMohan2006

13 days

RT @cwolferesearch: Reinforcement Learning (RL) is quickly becoming the most important skill for AI researchers. Here are the best resource….

0

256

0

Krishna Mohan

@KMohan2006

17 days

Reading about RHLF

1

0

11

Krishna Mohan

@KMohan2006

18 days

AGI, ASI and now PSI - personal super intelligence.

1

0

7

Krishna Mohan

@KMohan2006

18 days

What's the personal super intelligence?.

3

0

8

Krishna Mohan

@KMohan2006

19 days

RT @TheAITimeline: 🚨This week's top AI/ML research papers:. - GSPO.- Diffusion Beats Autoregressive in Data-Constrained Settings.- Gemini 2….

0

73

0

Krishna Mohan

@KMohan2006

20 days

Discipline is explaining to your brain that you need to sacrifice immediate pleasures for greater rewards in future.

0

13