Drishan Arora Profile
Drishan Arora

@drishanarora

Followers
3K
Following
14
Media
1
Statuses
9

AI Researcher @DeepCogito

San Francisco, CA
Joined February 2013
Don't wanna be here? Send us removal request.
@drishanarora
Drishan Arora
2 months
Data quality has the biggest impact on LLM performance - far more than most algorithmic improvements. As we build models that move towards superintelligence, the paradigm for LLM evaluation would need to evolve to increase the strength of the overseer. Very excited to see this -.
@mannatsan
Mannat Sandhu
2 months
Today, we are launching Anthromind, where we are building scalable oversight for AI systems. As LLMs and AI systems grow more intelligent, the data needed to evaluate, supervise and align these models requires higher intelligence, often surpassing human expertise. Traditional
Tweet media one
0
0
1
@drishanarora
Drishan Arora
4 months
All models we create will be open sourced. More details in the blog post:
Tweet card summary image
deepcogito.com
Building general superintelligence
3
22
275
@drishanarora
Drishan Arora
4 months
The Cogito v1 models can be downloaded on @huggingface or @ollama, and can be accessed via API through @FireworksAI_HQ or @togethercompute. Each model can answer directly (standard LLM), or self-reflect before answering (like reasoning models).
4
2
149
@drishanarora
Drishan Arora
4 months
From what we can tell, we're still in the early stages of this scaling curve - IDA is incredibly powerful and generalizes across domains. Most notably, our 70B model also outperforms Llama 4 Scout (109B MoE) distilled from a 2T model. As we improve and iterate on our.
1
2
139
@drishanarora
Drishan Arora
4 months
We use IDA to remove the intelligence ceiling. Simply put, use more computation to let the model arrive at a better solution, and then distill the expensive thinking process to the model's own parameters. As the LLM improves in intelligence, the thinking process itself becomes.
5
15
219
@drishanarora
Drishan Arora
4 months
Traditional LLMs are upper bounded in intelligence by their overseers (larger teacher models or human capabilities). Building superintelligence requires not only matching human-level abilities but also to uncover entirely new capabilities we have yet to imagine.
2
5
150
@drishanarora
Drishan Arora
4 months
Today, we are launching @DeepCogito, where we are building general superintelligence. We are also releasing open models of 3B, 8B, 14B, 32B, and 70B sizes trained using our research on iterated distillation and amplification (IDA). From evals so far, each model outperforms the
Tweet media one
87
319
3K
@drishanarora
Drishan Arora
10 months
RT @southpkcommons: 4/ Deep Cogito. You can build a more powerful language model with more data and more compute. @drishanarora showed how….
0
1
0