Geodesic Research @GeodesResearch X Profile

Geodesic Research

@GeodesResearch

Followers

36

Following

17

Media

4

Statuses

16

We're behind https://t.co/qHdncaj3hn. Let's align some AIs.

https://t.co/PIqoFDekst

Cambridge, UK

Joined August 2025

Don't wanna be here? Send us removal request.

Alex Turner

@Turn_Trout

1 month

Self-fulfilling alignment? (image credit: @QuintinPope5) https://t.co/VAho38TDgR

10

20

257

Geodesic Research

@GeodesResearch

12 days

We had a great time on @natolambert 's new podcast talking about our new Alignment Pre-Training Research Agenda, where we're focusing on compute intensive interventions through end-to-end training. https://t.co/Jsy5ssXL8c

0

13

20

Cam Tice

@cam_tice

16 days

https://t.co/3kRPnYDEDc

0

2

7

Geodesic Research

@GeodesResearch

25 days

p(simulation) is going up

0

1

Geodesic Research

@GeodesResearch

1 month

See here for the full post:

lesswrong.com

Background Deliberative alignment is a powerful post-training alignment technique that involves generating and training on re-contextualised supervis…

0

1

Geodesic Research

@GeodesResearch

1 month

We show this phenomenon persists despite filtering our dataset with methods commonly used in modern deliberative alignment training pipelines.

1

0

1

Geodesic Research

@GeodesResearch

1 month

Generalisation hacking is a process by which a model generates outputs via reasoning such that training on these reasoning–output pairs leads to a specific behaviour on a separate distribution.

1

0

Geodesic Research

@GeodesResearch

1 month

We show how deliberative alignment training can be undermined through a process we coin as 𝘨𝘦𝘯𝘦𝘳𝘢𝘭𝘪𝘴𝘢𝘵𝘪𝘰𝘯 𝘩𝘢𝘤𝘬𝘪𝘯𝘨.

1

4

Geodesic Research

@GeodesResearch

2 months

Find the full work here:

lesswrong.com

Introduction Current reasoning models have surprisingly monitorable chains-of-thought: they struggle to control their CoT without direct optimization…

0

Geodesic Research

@GeodesResearch

2 months

Open-weight labs apply pressure to the CoT in a variety of ways: • Intense Cold Start SFT for reasoning • Readability Incentives • Preference training 𝙖𝙛𝙩𝙚𝙧 reasoning training is completed

1

0

1

Geodesic Research

@GeodesResearch

2 months

What can we learn from open-weight training practices? Maybe a lot -- we've just released a blog post on common open-weight training practices across 6 model families and report implications for chain-of-thought monitorability.

1

3

Geodesic Research

@GeodesResearch

2 months

TL;DR Open-weight labs apply pressure to the CoT in a variety of ways: • Intense Cold Start SFT for reasoning • Readability Incentives • Preference training 𝙖𝙛𝙩𝙚𝙧 reasoning training is completed

0

1

Geodesic Research

@GeodesResearch

4 months

See https://t.co/1coPiQPc3f for more information on our current projects.

geodesicresearch.org

0

Geodesic Research

@GeodesResearch

4 months

Apart will provide institutional support to our four cohorts built through MARS 3.0. These projects include work on establishing clear metrics for chain-of-thought health, novel AI control protocols, and fine-tuning procedures that increase the monitorability of LLM reasoning.

1

0

Geodesic Research

@GeodesResearch

4 months

We’re partnering with Apart Research due to their track record in facilitating high-quality AI Safety research in their Apart Studio and Fellowship programs. https://t.co/4dBjNQ9IgH.

1

0

Geodesic Research

@GeodesResearch

4 months

Geodesic Research would like to announce its partnership with @apartresearch to assist in the development of Geodesic’s fellowship programs!

1

0

1