akira @realmcore_ X Profile

akira

@realmcore_

Followers

5K

Following

11K

Media

58

Statuses

2K

In search of emergence • @0xrandomlabs

elysium

Joined November 2021

Don't wanna be here? Send us removal request.

akira

@realmcore_

1 month

Agents that can do large scale migration tasks are *almost* here. This could take an engineer days by hand. We just posted an update to our blog on what this looks like including the output of a long running python to typescript translation task. Find it in the blog post!

33

52

738

Justin Torre

@justinstorre

2 days

🚀 Announcing the Helicone AI Gateway - officially open to everyone! 1 API key. 100+ models. 0% markup fees. We spent the last 2 months ensuring the Helicone AI Gateway is stable and ready for our Product Hunt launch today! Here's what's been going on 👇

8

3

32

Jeff Huber

@jeffreyhuber

4 days

agent performance = LLM + prompt + tools (like retrieval) + agent harness the agent harness is the the most underrated right now

31

5

126

Kyle Mistele 🏴‍☠️

@0xblacklight

4 days

@jeffreyhuber harness engineering lots of overlap between tools + harness, and prompts + harness since the harness provides tools (and has opinions about how you add tools) and provides / modifies prompts

1

7

akira

@realmcore_

4 days

The harness itself is extremely underrated. It has been for a while

Jeff Huber

@jeffreyhuber

4 days

agent performance = LLM + prompt + tools (like retrieval) + agent harness the agent harness is the the most underrated right now

1

0

3

akira

@realmcore_

4 days

The basis for harness engineering for agents is mostly the following 3 questions: What is the stack around the token sampling and parsing How does the agent interact with the world How does it know/see the env state

0

2

akira

@realmcore_

4 days

"Context engineering" and "Prompt engineering" are basically the same and unless you're designing model behavior you are just prompting there is no system being engineered

0

2

4

akira

@realmcore_

4 days

Just like calorie counting ITS ALL TOKENS IN TOKENS OUT Stop force feeding your models.

0

7

Random Labs

@0xrandomlabs

5 days

This is the least autonomous slate will ever be

0

2

5

akira

@realmcore_

5 days

just move to sf

Adrianna Lakatos

@adriannalakatos

5 days

how to go viral on sf twitter 101 step 1: take a photo of the golden gate bridge step 2: caption it "just move to sf" step 3: congrats. enjoy your 500k views and 200 replies debating the rising cost of rent repeat weekly for optimal engagement

0

8

akira

@realmcore_

5 days

I keep seeing long horizon task discourse pop up on the timeline, and this is something we do extremely well. If you're tired of having agent sessions that die on you you know where to look

Random Labs

@0xrandomlabs

5 days

Imagine if your coding agent didn't have to compact its memory to run for hours. Stop imagining. It's real, and its in slate.

0

7

Random Labs

@0xrandomlabs

5 days

Imagine if your coding agent didn't have to compact its memory to run for hours. Stop imagining. It's real, and its in slate.

Random Labs

@0xrandomlabs

7 days

Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.

0

1

5

akira

@realmcore_

5 days

We def do not need more advanced RL to do this A better agent harness is all you really need Codebase maintenance is a process/awareness problem for 90% of cases

samsja

@samsja19

6 days

It's time to teach models to maintain codebases and not just to write code we need mode advance RL environments

0

5

akira

@realmcore_

6 days

To anyone considering working on something of their own The work won't feel like work if you genuinely care about your craft and making good things

0

14

akira

@realmcore_

6 days

what if we launched nv-smi but for agents?

0

1

akira

@realmcore_

7 days

Lots of agents suffer from weird conversational flows due to the nature of the data they are trained on vs how we are trying to use them. This leads to early stopping or difficult back and forth. Slate feels MUCH smoother to work with now in the new update.

Random Labs

@0xrandomlabs

7 days

Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.

1

0

13

Random Labs

@0xrandomlabs

7 days

Slate - Changelog 0.0.21 Many agents suffer from either not being conversational enough, or stopping tasks early. This has been mostly solved in slate. Additionally: - Image support - ctrl+z (not cmd+z) etc. Video related.

3

12

akira

@realmcore_

7 days

Pretty sure claude just got a performance bump. Feels like something happened at Anthropic, maybe infra related 20% chance its a solution space I haven't explored, might be proportional to time spent with model But the generalized behavior improvement feels like an RL gain

3

0

9

akira

@realmcore_

7 days

The whole SWE-1.5 / Composer-1 thing is really simple guys You want to be in the loop? Speed is higher priority (with minimum intelligence) You want to be out of the loop? Speed is STILL important! But latency doesn't hurt UX, enabling smarter models

0

7

akira

@realmcore_

8 days

It's clear to me that some players want to win by advancing the space and some want to win by being the best at driving everything to zero Guess who But in the end its all scale and commodity game

1

0

1

akira

@realmcore_

8 days

Everyone is stuck building single loop coding agents, and optimizing for short trajectories This doesn't scale

0

1