omarsar0 Profile Banner
elvis Profile
elvis

@omarsar0

Followers
279K
Following
34K
Media
4K
Statuses
17K

Building @dair_ai • Ex Meta AI, Elastic, PhD • New cohort: https://t.co/xw2XQ0z8up

DAIR.AI Academy
Joined September 2015
Don't wanna be here? Send us removal request.
@omarsar0
elvis
21 hours
I love this figure from Anthropic's new talk on "Skills > Agents". Here are my notes: The more skills you build, the more useful Claude Code gets. And it makes perfect sense. Procedural knowledge and continuous learning for the win! Skills essentially are the way you make
47
64
778
@omarsar0
elvis
21 hours
I love this figure from Anthropic's new talk on "Skills > Agents". Here are my notes: The more skills you build, the more useful Claude Code gets. And it makes perfect sense. Procedural knowledge and continuous learning for the win! Skills essentially are the way you make
47
64
778
@omarsar0
elvis
14 hours
I will be covering Skills in our upcoming Claude Code build sessions. There are so many impactful ways that non-technical or technical folks can build with Skills. The time to jump into this stuff is now: https://t.co/lmI57QlqVs
2
0
21
@dair_ai
DAIR.AI
18 hours
New research from Google: "The Illusion of Deep Learning Architecture". For those following research on continual learning, you may want to bookmark this one. Instead of stacking more layers, what if we give neural networks more levels of learning? The default approach to
16
89
420
@omarsar0
elvis
16 hours
Easy to give Mistral Vibe access to MCP and custom tools. It uses config.toml for setting up custom tools and custom agents as well.
0
0
8
@omarsar0
elvis
17 hours
It has access to tools like search, grep, bash, and a task manager for planning.
2
0
10
@omarsar0
elvis
17 hours
I really like the built-in theme changer. :)
1
0
6
@omarsar0
elvis
17 hours
1
0
4
@omarsar0
elvis
17 hours
Looks like Mistral has entered the agentic coding arena! They just released Mistral Vibe CLI, an open-source command-line coding assistant powered by Devstral.
9
13
66
@omarsar0
elvis
2 days
The AI Consumer Index (ACE) Most AI benchmarks today focus on reasoning and coding. But most people use AI to shop, cook, and plan their weekends. In those domains, LLM hallucinations continue to be a real problem. 73% of ChatGPT messages (according a recent report) are now
26
56
291
@dair_ai
DAIR.AI
2 days
Integrating LLMs with knowledge bases. Important read for AI practitioners LLMs generate impressive text but struggle with hallucinations, outdated knowledge, and reasoning over structured data. The default response has been scaling up (e.g., more parameters, more compute, more
26
125
725
@omarsar0
elvis
3 days
I totally get this. However, I prefer to prompt LLMs to plan first before jumping in. When that plan is in context it leads to more effective multi-turn conversations and better intent understanding. This is why using plan mode in Claude Code is extremely effective. There are
@karpathy
Andrej Karpathy
3 days
Don't think of LLMs as entities but as simulators. For example, when exploring a topic, don't ask: "What do you think about xyz"? There is no "you". Next time try: "What would be a good group of people to explore xyz? What would they say?" The LLM can channel/simulate many
14
25
268
@omarsar0
elvis
3 days
New survey on Agentic LLMs. The survey spans three interconnected categories: reasoning and retrieval for better decision-making, action-oriented models for practical assistance, and multi-agent systems for collaboration and studying emergent social behavior. Key applications
16
67
342
@dair_ai
DAIR.AI
3 days
Top AI Papers of the Week (Dec 1 - 7): - DeepSeek-V3.2 - Is Vibe Coding Safe? - Quiet Feature Learning - Autonomous Cloud Reliability - Thinking with Programming Vision - Evolving Multi-Agent Orchestration - Training LLMs for Honesty via Confessions Read on for more:
4
26
166
@omarsar0
elvis
4 days
Google just published a banger guide on effective context engineering for multi-agent systems. Pay attention to this one, AI devs! (bookmark it) Here are my key takeaways: Context windows aren't the bottleneck. Context engineering is. For more complex and long-horizon
29
171
968
@omarsar0
elvis
4 days
This is a really good report on the reality of building agents in production.
@dair_ai
DAIR.AI
4 days
First large-scale study of AI agents actually running in production. The hype says agents are transforming everything. The data tells a different story. Researchers surveyed 306 practitioners and conducted 20 in-depth case studies across 26 domains. What they found challenges
18
44
493
@dair_ai
DAIR.AI
4 days
First large-scale study of AI agents actually running in production. The hype says agents are transforming everything. The data tells a different story. Researchers surveyed 306 practitioners and conducted 20 in-depth case studies across 26 domains. What they found challenges
41
225
1K
@omarsar0
elvis
4 days
Google just published a banger guide on effective context engineering for multi-agent systems. Pay attention to this one, AI devs! (bookmark it) Here are my key takeaways: Context windows aren't the bottleneck. Context engineering is. For more complex and long-horizon
29
171
968
@omarsar0
elvis
4 days
// THE CASE FOR ENVIRONMENT SCALING // Environment scaling may be as important as model scaling for agentic AI. Current AI research suggests that building a powerful agentic AI model isn't just about better reasoning. It's also about better environments. The default approach
8
13
101
@omarsar0
elvis
5 days
Next week, we start our first live cohort on building with Claude Code. Excited to share how I've been using Claude Code for coding, research, designing, searching, and everything in between. You also get to build. :) A few more seats are available if you are interested.
6
7
63