realmcore_ Profile Banner
akira Profile
akira

@realmcore_

Followers
5K
Following
11K
Media
58
Statuses
2K

In search of emergence • @0xrandomlabs

elysium
Joined November 2021
Don't wanna be here? Send us removal request.
@realmcore_
akira
1 month
Agents that can do large scale migration tasks are *almost* here. This could take an engineer days by hand. We just posted an update to our blog on what this looks like including the output of a long running python to typescript translation task. Find it in the blog post!
33
52
738
@justinstorre
Justin Torre
2 days
🚀 Announcing the Helicone AI Gateway - officially open to everyone! 1 API key. 100+ models. 0% markup fees. We spent the last 2 months ensuring the Helicone AI Gateway is stable and ready for our Product Hunt launch today! Here's what's been going on 👇
8
3
32
@jeffreyhuber
Jeff Huber
4 days
agent performance = LLM + prompt + tools (like retrieval) + agent harness the agent harness is the the most underrated right now
31
5
126
@0xblacklight
Kyle Mistele 🏴‍☠️
4 days
@jeffreyhuber harness engineering lots of overlap between tools + harness, and prompts + harness since the harness provides tools (and has opinions about how you add tools) and provides / modifies prompts
1
1
7
@realmcore_
akira
4 days
The harness itself is extremely underrated. It has been for a while
@jeffreyhuber
Jeff Huber
4 days
agent performance = LLM + prompt + tools (like retrieval) + agent harness the agent harness is the the most underrated right now
1
0
3
@realmcore_
akira
4 days
The basis for harness engineering for agents is mostly the following 3 questions: What is the stack around the token sampling and parsing How does the agent interact with the world How does it know/see the env state
0
0
2
@realmcore_
akira
4 days
"Context engineering" and "Prompt engineering" are basically the same and unless you're designing model behavior you are just prompting there is no system being engineered
0
2
4
@realmcore_
akira
4 days
Just like calorie counting ITS ALL TOKENS IN TOKENS OUT Stop force feeding your models.
0
0
7
@0xrandomlabs
Random Labs
5 days
This is the least autonomous slate will ever be
0
2
5
@realmcore_
akira
5 days
just move to sf
@adriannalakatos
Adrianna Lakatos
5 days
how to go viral on sf twitter 101 step 1: take a photo of the golden gate bridge step 2: caption it "just move to sf" step 3: congrats. enjoy your 500k views and 200 replies debating the rising cost of rent repeat weekly for optimal engagement
0
0
8
@realmcore_
akira
5 days
I keep seeing long horizon task discourse pop up on the timeline, and this is something we do extremely well. If you're tired of having agent sessions that die on you you know where to look
@0xrandomlabs
Random Labs
5 days
Imagine if your coding agent didn't have to compact its memory to run for hours. Stop imagining. It's real, and its in slate.
0
0
7
@0xrandomlabs
Random Labs
5 days
Imagine if your coding agent didn't have to compact its memory to run for hours. Stop imagining. It's real, and its in slate.
@0xrandomlabs
Random Labs
7 days
Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.
0
1
5
@realmcore_
akira
5 days
We def do not need more advanced RL to do this A better agent harness is all you really need Codebase maintenance is a process/awareness problem for 90% of cases
@samsja19
samsja
6 days
It's time to teach models to maintain codebases and not just to write code we need mode advance RL environments
0
0
5
@realmcore_
akira
6 days
To anyone considering working on something of their own The work won't feel like work if you genuinely care about your craft and making good things
0
0
14
@realmcore_
akira
6 days
what if we launched nv-smi but for agents?
0
0
1
@realmcore_
akira
7 days
Lots of agents suffer from weird conversational flows due to the nature of the data they are trained on vs how we are trying to use them. This leads to early stopping or difficult back and forth. Slate feels MUCH smoother to work with now in the new update.
@0xrandomlabs
Random Labs
7 days
Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.
1
0
13
@0xrandomlabs
Random Labs
7 days
Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.
3
3
12
@realmcore_
akira
7 days
Pretty sure claude just got a performance bump. Feels like something happened at Anthropic, maybe infra related 20% chance its a solution space I haven't explored, might be proportional to time spent with model But the generalized behavior improvement feels like an RL gain
3
0
9
@realmcore_
akira
7 days
The whole SWE-1.5 / Composer-1 thing is really simple guys You want to be in the loop? Speed is higher priority (with minimum intelligence) You want to be out of the loop? Speed is STILL important! But latency doesn't hurt UX, enabling smarter models
0
0
7
@realmcore_
akira
8 days
It's clear to me that some players want to win by advancing the space and some want to win by being the best at driving everything to zero Guess who But in the end its all scale and commodity game
1
0
1
@realmcore_
akira
8 days
Everyone is stuck building single loop coding agents, and optimizing for short trajectories This doesn't scale
0
0
1