elvis @omarsar0 X Profile

elvis

@omarsar0

Followers

279K

Following

34K

Media

4K

Statuses

17K

Building @dair_ai • Ex Meta AI, Elastic, PhD • New cohort: https://t.co/xw2XQ0z8up

https://t.co/XQto5ypkSM

DAIR.AI Academy

Joined September 2015

Don't wanna be here? Send us removal request.

elvis

@omarsar0

21 hours

I love this figure from Anthropic's new talk on "Skills > Agents". Here are my notes: The more skills you build, the more useful Claude Code gets. And it makes perfect sense. Procedural knowledge and continuous learning for the win! Skills essentially are the way you make

47

64

778

elvis

@omarsar0

21 hours

I love this figure from Anthropic's new talk on "Skills > Agents". Here are my notes: The more skills you build, the more useful Claude Code gets. And it makes perfect sense. Procedural knowledge and continuous learning for the win! Skills essentially are the way you make

47

64

778

elvis

@omarsar0

14 hours

I will be covering Skills in our upcoming Claude Code build sessions. There are so many impactful ways that non-technical or technical folks can build with Skills. The time to jump into this stuff is now: https://t.co/lmI57QlqVs

2

0

21

DAIR.AI

@dair_ai

18 hours

New research from Google: "The Illusion of Deep Learning Architecture". For those following research on continual learning, you may want to bookmark this one. Instead of stacking more layers, what if we give neural networks more levels of learning? The default approach to

16

89

420

elvis

@omarsar0

16 hours

Easy to give Mistral Vibe access to MCP and custom tools. It uses config.toml for setting up custom tools and custom agents as well.

0

8

elvis

@omarsar0

17 hours

It has access to tools like search, grep, bash, and a task manager for planning.

2

0

10

elvis

@omarsar0

17 hours

I really like the built-in theme changer. :)

1

0

6

elvis

@omarsar0

17 hours

https://t.co/8ikbwa39SA

1

0

4

elvis

@omarsar0

17 hours

Looks like Mistral has entered the agentic coding arena! They just released Mistral Vibe CLI, an open-source command-line coding assistant powered by Devstral.

9

13

66

elvis

@omarsar0

2 days

The AI Consumer Index (ACE) Most AI benchmarks today focus on reasoning and coding. But most people use AI to shop, cook, and plan their weekends. In those domains, LLM hallucinations continue to be a real problem. 73% of ChatGPT messages (according a recent report) are now

26

56

291

DAIR.AI

@dair_ai

2 days

Integrating LLMs with knowledge bases. Important read for AI practitioners LLMs generate impressive text but struggle with hallucinations, outdated knowledge, and reasoning over structured data. The default response has been scaling up (e.g., more parameters, more compute, more

26

125

725

elvis

@omarsar0

3 days

I totally get this. However, I prefer to prompt LLMs to plan first before jumping in. When that plan is in context it leads to more effective multi-turn conversations and better intent understanding. This is why using plan mode in Claude Code is extremely effective. There are

Andrej Karpathy

@karpathy

3 days

Don't think of LLMs as entities but as simulators. For example, when exploring a topic, don't ask: "What do you think about xyz"? There is no "you". Next time try: "What would be a good group of people to explore xyz? What would they say?" The LLM can channel/simulate many

14

25

268

elvis

@omarsar0

3 days

New survey on Agentic LLMs. The survey spans three interconnected categories: reasoning and retrieval for better decision-making, action-oriented models for practical assistance, and multi-agent systems for collaboration and studying emergent social behavior. Key applications

16

67

342

DAIR.AI

@dair_ai

3 days

Top AI Papers of the Week (Dec 1 - 7): - DeepSeek-V3.2 - Is Vibe Coding Safe? - Quiet Feature Learning - Autonomous Cloud Reliability - Thinking with Programming Vision - Evolving Multi-Agent Orchestration - Training LLMs for Honesty via Confessions Read on for more:

4

26

166

elvis

@omarsar0

4 days

Google just published a banger guide on effective context engineering for multi-agent systems. Pay attention to this one, AI devs! (bookmark it) Here are my key takeaways: Context windows aren't the bottleneck. Context engineering is. For more complex and long-horizon

29

171

968

elvis

@omarsar0

4 days

This is a really good report on the reality of building agents in production.

DAIR.AI

@dair_ai

4 days

First large-scale study of AI agents actually running in production. The hype says agents are transforming everything. The data tells a different story. Researchers surveyed 306 practitioners and conducted 20 in-depth case studies across 26 domains. What they found challenges

18

44

493

DAIR.AI

@dair_ai

4 days

First large-scale study of AI agents actually running in production. The hype says agents are transforming everything. The data tells a different story. Researchers surveyed 306 practitioners and conducted 20 in-depth case studies across 26 domains. What they found challenges

41

225

1K

elvis

@omarsar0

4 days

Google just published a banger guide on effective context engineering for multi-agent systems. Pay attention to this one, AI devs! (bookmark it) Here are my key takeaways: Context windows aren't the bottleneck. Context engineering is. For more complex and long-horizon

29

171

968

elvis

@omarsar0

4 days

// THE CASE FOR ENVIRONMENT SCALING // Environment scaling may be as important as model scaling for agentic AI. Current AI research suggests that building a powerful agentic AI model isn't just about better reasoning. It's also about better environments. The default approach

8

13

101

elvis

@omarsar0

5 days

Next week, we start our first live cohort on building with Claude Code. Excited to share how I've been using Claude Code for coding, research, designing, searching, and everything in between. You also get to build. :) A few more seats are available if you are interested.

6

7

63