akira
@realmcore_
Followers
5K
Following
11K
Media
58
Statuses
2K
In search of emergence • @0xrandomlabs
elysium
Joined November 2021
Agents that can do large scale migration tasks are *almost* here. This could take an engineer days by hand. We just posted an update to our blog on what this looks like including the output of a long running python to typescript translation task. Find it in the blog post!
33
52
738
🚀 Announcing the Helicone AI Gateway - officially open to everyone! 1 API key. 100+ models. 0% markup fees. We spent the last 2 months ensuring the Helicone AI Gateway is stable and ready for our Product Hunt launch today! Here's what's been going on 👇
8
3
32
agent performance = LLM + prompt + tools (like retrieval) + agent harness the agent harness is the the most underrated right now
31
5
126
@jeffreyhuber harness engineering lots of overlap between tools + harness, and prompts + harness since the harness provides tools (and has opinions about how you add tools) and provides / modifies prompts
1
1
7
The basis for harness engineering for agents is mostly the following 3 questions: What is the stack around the token sampling and parsing How does the agent interact with the world How does it know/see the env state
0
0
2
"Context engineering" and "Prompt engineering" are basically the same and unless you're designing model behavior you are just prompting there is no system being engineered
0
2
4
Just like calorie counting ITS ALL TOKENS IN TOKENS OUT Stop force feeding your models.
0
0
7
I keep seeing long horizon task discourse pop up on the timeline, and this is something we do extremely well. If you're tired of having agent sessions that die on you you know where to look
Imagine if your coding agent didn't have to compact its memory to run for hours. Stop imagining. It's real, and its in slate.
0
0
7
Imagine if your coding agent didn't have to compact its memory to run for hours. Stop imagining. It's real, and its in slate.
Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.
0
1
5
To anyone considering working on something of their own The work won't feel like work if you genuinely care about your craft and making good things
0
0
14
Lots of agents suffer from weird conversational flows due to the nature of the data they are trained on vs how we are trying to use them. This leads to early stopping or difficult back and forth. Slate feels MUCH smoother to work with now in the new update.
Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.
1
0
13
Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.
3
3
12
Pretty sure claude just got a performance bump. Feels like something happened at Anthropic, maybe infra related 20% chance its a solution space I haven't explored, might be proportional to time spent with model But the generalized behavior improvement feels like an RL gain
3
0
9
The whole SWE-1.5 / Composer-1 thing is really simple guys You want to be in the loop? Speed is higher priority (with minimum intelligence) You want to be out of the loop? Speed is STILL important! But latency doesn't hurt UX, enabling smarter models
0
0
7
It's clear to me that some players want to win by advancing the space and some want to win by being the best at driving everything to zero Guess who But in the end its all scale and commodity game
1
0
1
Everyone is stuck building single loop coding agents, and optimizing for short trajectories This doesn't scale
0
0
1