Barry Zhang Profile
Barry Zhang

@barry_zyj

Followers
4K
Following
740
Media
12
Statuses
85

Research Engineer @Anthropic

New York
Joined December 2022
Don't wanna be here? Send us removal request.
@barry_zyj
Barry Zhang
13 days
Let me pitch you Skills: - They allow anyone to customize agents with a simple primitive — files - They give us continuous learning until continuous learning arrives - They are powerful, composable, and sufficiently AGI-pilled - you can now say “probably a skill issue” at work
@claudeai
Claude
13 days
Claude can now use Skills. Skills are packaged instructions that teach Claude your way of working.
49
70
1K
@barry_zyj
Barry Zhang
5 months
Multi-agent search was easy to prototype (V1 only took me a day) but incredibly complex to bring to production. Feels like building RAG in 2022: we see the promise but are still figuring out the primitives. Sharing our journey so far - hope it helps your exploration!
@AnthropicAI
Anthropic
5 months
New on the Anthropic Engineering blog: how we built Claude’s research capabilities using multiple agents working in parallel. We share what worked, what didn't, and the engineering challenges along the way. https://t.co/k3Gzd4HkLg
20
27
551
@barry_zyj
Barry Zhang
8 months
My workflow for the past few months: - Write a design doc (with Claude ofc) - Ask Claude to generate tests - Verify the tests - Let it cook
@AnthropicAI
Anthropic
8 months
This approach has made Sonnet the model of choice for developers worldwide. In addition to our new model, we're launching Claude Code, our first coding tool, in a limited research preview. With Claude Code, you can delegate substantial tasks to Claude—right from your terminal.
8
11
172
@AnthropicAI
Anthropic
8 months
Introducing Claude 3.7 Sonnet: our most intelligent model to date. It's a hybrid reasoning model, producing near-instant responses or extended, step-by-step thinking. One model, two ways to think. We’re also releasing an agentic coding tool: Claude Code.
1K
3K
19K
@barry_zyj
Barry Zhang
10 months
Success in the LLM space isn't about building the most sophisticated system. It's about building the right system for your needs. Had a lot of fun writing this with @ErikSchluntz!
@alexalbert__
Alex Albert
10 months
2025 will be the year of agentic systems The pieces are falling into place: computer use, MCP, improved tool use. It's time to start thinking about building these systems. At Anthropic, we're seeing a few best practices emerge - we wrote a blog post with our findings:
11
8
121
@barry_zyj
Barry Zhang
11 months
Graph-based memory that you can customize is now open-source with MCP https://t.co/m1n5Zkmalr https://t.co/k67mGSwrdi
2
1
30
@EricSimons
Eric Simons
1 year
Case study on https://t.co/sXP5wkc5HK from @AnthropicAI: - Zero to $4m ARR in 4 wks - 100k+ of y'all using Bolt every wk - One of fastest growing AI tools globally Wild & surreal! Ty for all the support & patience as we scale y'all 🙇 Let's go!
Tweet card summary image
bolt.fyi
Learn how StackBlitz grew Bolt.new to $4m in ARR with over 100k+ weekly users in less than 4 weeks using Anthropic & Claude.
85
85
950
@AnthropicAI
Anthropic
1 year
Introducing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku. We’re also introducing a new capability in beta: computer use. Developers can now direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking, and typing text.
481
2K
10K
@barry_zyj
Barry Zhang
1 year
Just saying...🍓
1
0
6
@kscalelabs
K-Scale Labs
2 years
K-Scale Labs just launched on @ycombinator's Launch YC! K-Scale Labs 🤖 Open-source humanoid robots for everyone. Check them out:
Tweet card summary image
ycombinator.com
Meet Stompy, the world's first open-source general-purpose humanoid robot
13
61
297
@EnricoShippole
Enrico Shippole
2 years
@TeraflopAI is excited to help support the @caselawaccess and @HarvardLIL, in the release of over 6.6 million state and federal court decisions published throughout U.S. history.
3
36
95
@normativeai
Norm Ai
2 years
Hosting an event for our clients and leaders in compliance, financial services and law, on February 6 at @NYSE to discuss AI Agents & the Law. The event will begin with a live one-on-one discussion between @johnjnay and Lawrence H. Summers, Harvard University Professor and
2
6
33
@jerryjliu0
Jerry Liu
2 years
How much data do you need to fine-tune GPT-3.5? @barry_zyj et al. have a great article showing that you only need ~100 datapoints for better output structuring and customizing the tone. Lots of great stuff incl. cost/latency analysis. Check it out! https://t.co/nKU2ebHKSE
2
1
19
@barry_zyj
Barry Zhang
2 years
"Enough to gauge your incompetence"
0
0
2
@barry_zyj
Barry Zhang
2 years
This works on many levels
0
0
1
@barry_zyj
Barry Zhang
2 years
How to break GPT-3.5 through fine-tuning... more coming soon
0
0
1
@swyx
swyx
2 years
Q: What's better than Mixture of Experts? A: Mixture of Mixture of Experts! Introducing GodMode: the AI Chat Browser https://t.co/uql5ZPkCTj Fast, Free, Access to ChatGPT, Bing, Bard, Claude, YouChat, Poe, Perplexity, Phind, and Local/GGML Models like Vicuna and Alpaca No
56
160
892
@johnjnay
John Nay
2 years
I founded a second AI company. We're pioneering an empirical, practical approach to AI alignment, leveraging law/policy & LLMs. Backed by top venture firms and AI researchers in the Bay Area & NYC, we're hiring 5 more Members of our Technical Staff. DM me your CV if interested
23
47
299
@barry_zyj
Barry Zhang
2 years
This is what we are using to evaluate LLMs? (QUASAR-T)
0
0
2