MechanizeWork Profile Banner
Mechanize Profile
Mechanize

@MechanizeWork

Followers
4K
Following
0
Media
16
Statuses
29

We're a company building RL environments to power the full automation of the economy

San Francisco, CA
Joined April 2025
Don't wanna be here? Send us removal request.
@MechanizeWork
Mechanize
3 months
Today we’re announcing Mechanize, a startup focused on developing virtual work environments, benchmarks, and training data that will enable the full automation of the economy. We will achieve this by creating simulated environments and evaluations that capture the full scope of.
85
57
640
@MechanizeWork
Mechanize
13 hours
xAI is probably the first to spend as much compute on RL as on pretraining. The easy gains from shifting compute to RL are now gone. With this arbitrage closed RL scaling will slow. Progress will now come from the quality and realism of RL environments rather than mere scaling.
Tweet media one
7
7
143
@MechanizeWork
Mechanize
15 hours
Read the blog post here:
0
0
16
@MechanizeWork
Mechanize
16 hours
The future is building software, not curating static datasets. Today’s AI systems learn best by interacting with digital environments, attempting tasks, and learning from outcomes. This demands dedicated, full-time specialists with strong expertise, not outsourced contractors.
Tweet media one
1
0
25
@MechanizeWork
Mechanize
16 hours
Previously, AI progress relied heavily on monotonous, low-skill labeling from third-party contractors producing basic text, visual, and audio data at scale. But models have outgrown simple tasks, demanding richer context and deeper expertise. The era of sweatshop data is over.
Tweet media one
8
7
164
@MechanizeWork
Mechanize
2 days
Read the blog post here:
0
0
7
@MechanizeWork
Mechanize
2 days
We call this approach replication training: tasking AI models with exactly replicating existing software using clear specs and references. This trains models to read precisely, execute reliably, and demonstrate resilience in complex, long-horizon tasks.
Tweet media one
1
0
8
@MechanizeWork
Mechanize
2 days
Imagine if pretraining a language model meant manually creating the entire training corpus. Clearly, this would be impractical. Instead, we leverage vast content available online. We expect RL will follow a similar path, replicating abundant existing software as RL tasks.
Tweet media one
3
2
64
@MechanizeWork
Mechanize
4 days
Beyond this point, we expect models trained via RL to acquire powerful, task-agnostic, few-shot capabilities on tasks that currently require painstaking, task-specific training, just as GPT-3 unlocked few-shot capabilities for language.
2
0
16
@MechanizeWork
Mechanize
4 days
How much bigger must RL get to have a GPT-3 moment?. We expect this will soon require roughly 10,000 years of cumulative human-equivalent task time, comparable to GTA V or major operating systems.
Tweet media one
2
4
56
@MechanizeWork
Mechanize
19 days
Read the blog post here:.
0
2
22
@MechanizeWork
Mechanize
19 days
Replication tasks target critical skills current models lack: accurately interpreting instructions, reliably recovering from mistakes, and sustaining precise execution on tasks that humans take months to complete, moving us closer to reliable, capable AI agents.
Tweet media one
1
0
28
@MechanizeWork
Mechanize
19 days
Replication training naturally extends existing AI trends by building tasks directly from abundant human-generated data already available online.
Tweet media one
1
0
25
@MechanizeWork
Mechanize
19 days
We are proposing a new AI paradigm called replication training: tasking AIs to precisely replicate existing software. Abundant internet text unlocked powerful language models. Similarly, abundant software available today will enable massive-scale, efficient RL training.
Tweet media one
9
32
405
@MechanizeWork
Mechanize
20 days
Read the blog post here:
1
1
36
@MechanizeWork
Mechanize
20 days
Before GPT-3, achieving good performance required specialized fine-tuning for each task. Today's RL is similar: models need to be carefully trained to handle tasks like deep research, web search, or coding. But we think RL will soon have its GPT-3 moment.
Tweet media one
11
49
477
@MechanizeWork
Mechanize
30 days
RT @tamaybes: Most AI labs talk about merely “augmenting” humans at work. They say this because AI currently falls short, not out of some d….
0
17
0
@MechanizeWork
Mechanize
1 month
Read the full essay here:
0
1
34
@MechanizeWork
Mechanize
1 month
Software engineering tasks might be among the earliest automated, yet fully replacing software engineers will happen surprisingly late, likely after AI takes over a large share of white-collar jobs.
Tweet media one
2
0
16
@MechanizeWork
Mechanize
1 month
AI-generated code continues a decades-long climb toward greater abstraction, from compilers to frameworks. Automation shifts what engineers do long before it erases the role.
Tweet media one
1
1
17
@MechanizeWork
Mechanize
1 month
AI will soon write most lines of code. But just as compilers didn’t eliminate software engineers, this new wave of automation won't erase the role immediately.
Tweet media one
1
0
15