GeodesResearch Profile Banner
Geodesic Research Profile
Geodesic Research

@GeodesResearch

Followers
36
Following
17
Media
4
Statuses
16

We're behind https://t.co/qHdncaj3hn. Let's align some AIs.

Cambridge, UK
Joined August 2025
Don't wanna be here? Send us removal request.
@Turn_Trout
Alex Turner
1 month
Self-fulfilling alignment? (image credit: @QuintinPope5) https://t.co/VAho38TDgR
10
20
257
@GeodesResearch
Geodesic Research
12 days
We had a great time on @natolambert 's new podcast talking about our new Alignment Pre-Training Research Agenda, where we're focusing on compute intensive interventions through end-to-end training. https://t.co/Jsy5ssXL8c
0
13
20
@cam_tice
Cam Tice
16 days
0
2
7
@GeodesResearch
Geodesic Research
25 days
p(simulation) is going up
0
0
1
@GeodesResearch
Geodesic Research
1 month
We show this phenomenon persists despite filtering our dataset with methods commonly used in modern deliberative alignment training pipelines.
1
0
1
@GeodesResearch
Geodesic Research
1 month
Generalisation hacking is a process by which a model generates outputs via reasoning such that training on these reasoning–output pairs leads to a specific behaviour on a separate distribution.
1
0
0
@GeodesResearch
Geodesic Research
1 month
We show how deliberative alignment training can be undermined through a process we coin as 𝘨𝘦𝘯𝘦𝘳𝘢𝘭𝘪𝘴𝘢𝘵𝘪𝘰𝘯 𝘩𝘢𝘤𝘬𝘪𝘯𝘨.
1
1
4
@GeodesResearch
Geodesic Research
2 months
Open-weight labs apply pressure to the CoT in a variety of ways: • Intense Cold Start SFT for reasoning • Readability Incentives • Preference training 𝙖𝙛𝙩𝙚𝙧 reasoning training is completed
1
0
1
@GeodesResearch
Geodesic Research
2 months
What can we learn from open-weight training practices? Maybe a lot -- we've just released a blog post on common open-weight training practices across 6 model families and report implications for chain-of-thought monitorability.
1
1
3
@GeodesResearch
Geodesic Research
2 months
TL;DR Open-weight labs apply pressure to the CoT in a variety of ways: • Intense Cold Start SFT for reasoning • Readability Incentives • Preference training 𝙖𝙛𝙩𝙚𝙧 reasoning training is completed
0
0
1
@GeodesResearch
Geodesic Research
4 months
See https://t.co/1coPiQPc3f for more information on our current projects.
Tweet card summary image
geodesicresearch.org
0
0
0
@GeodesResearch
Geodesic Research
4 months
Apart will provide institutional support to our four cohorts built through MARS 3.0. These projects include work on establishing clear metrics for chain-of-thought health, novel AI control protocols, and fine-tuning procedures that increase the monitorability of LLM reasoning.
1
0
0
@GeodesResearch
Geodesic Research
4 months
We’re partnering with Apart Research due to their track record in facilitating high-quality AI Safety research in their Apart Studio and Fellowship programs. https://t.co/4dBjNQ9IgH.
1
0
0
@GeodesResearch
Geodesic Research
4 months
Geodesic Research would like to announce its partnership with @apartresearch to assist in the development of Geodesic’s fellowship programs!
1
0
1