
Tilde
@tilderesearch
Followers
2K
Following
95
Media
17
Statuses
44
~6/6~ We would like to credit this LessWrong post for inspiration. Big shoutout to the @NebiusAI solutions team for their assistance in testing on their platform.
0
0
12
And if you don’t like graph theory, but do like interpretability, we have plenty of other fun problems so feel free to email us join@tilderesearch.com. We are doing a lot of applied interpretability work like this: which was the first application of.
0
1
4
Thank you to @ArthurConmy, @NeelNanda5, and @StephenLCasper for their comments and suggestions during the drafting process. This blog post is joint work with @a_karvonen, and the task is derived from Benchify.
1
0
22