KobiHackenburg Profile Banner
Kobi Hackenburg Profile
Kobi Hackenburg

@KobiHackenburg

Followers
972
Following
487
Media
37
Statuses
184

phd candidate @oiioxford @uniofoxford | research scientist @AISecurityInst | AI, social data science, persuasion with language models

Oxford
Joined September 2018
Don't wanna be here? Send us removal request.
@KobiHackenburg
Kobi Hackenburg
1 month
Today (w/ @UniofOxford @Stanford @MIT @LSEnews) we’re sharing the results of the largest AI persuasion experiments to date: 76k participants, 19  LLMs, 707 political issues. We examine “levers” of AI persuasion: model scale, post-training, prompting, personalization, & more.🧵
Tweet media one
14
129
435
@KobiHackenburg
Kobi Hackenburg
8 days
If this sounds exciting, come worth with us! . You can reach out to me or @hannahrosekirk @AnnaGausen @LLuettgau with any questions! . Apply here (application closes in 3 weeks): .
job-boards.eu.greenhouse.io
London, UK
0
0
2
@grok
Grok
2 days
Join millions who have switched to Grok.
116
233
2K
@KobiHackenburg
Kobi Hackenburg
8 days
This is a great chance to work on ambitious and rigorous research projects in areas like:. 🎭 AI persuasion. 🧠 theory of mind. 🫀 socio-affective human-AI relationships . Here’s an example of the kind of projects you’d be working on:.
@KobiHackenburg
Kobi Hackenburg
1 month
Today (w/ @UniofOxford @Stanford @MIT @LSEnews) we’re sharing the results of the largest AI persuasion experiments to date: 76k participants, 19  LLMs, 707 political issues. We examine “levers” of AI persuasion: model scale, post-training, prompting, personalization, & more.🧵
Tweet media one
1
1
3
@KobiHackenburg
Kobi Hackenburg
8 days
My team at @AISecurityInst is hiring a research assistant to work on Human Influence research! 🧠👾💻. -6 month residency w/ good salary .-Ideal for recent MSc or early PhD students in ML, AI, Psych, Cognitive/computer/data sciences . (job link below)
Tweet media one
2
4
11
@KobiHackenburg
Kobi Hackenburg
29 days
RT @lujainmibrahim: 📣New preprint📣. There’s a growing trend toward building human-like AI systems with warm, friendly, and empathetic commu….
0
34
0
@KobiHackenburg
Kobi Hackenburg
1 month
Come work with us :).
@hannahrosekirk
Hannah Rose Kirk
1 month
My team at @AISecurityInst is hiring! This is an awesome opportunity to get involved with cutting-edge scientific research inside government on frontier AI models. I genuinely love my job and the team 🤗. Link: .More Info: ⬇️.
0
0
5
@KobiHackenburg
Kobi Hackenburg
1 month
RT @hannahrosekirk: This is *the* paper to read this week. It covers an astonishing amount of ground on the persuasive capabilities of fro….
0
9
0
@KobiHackenburg
Kobi Hackenburg
1 month
RT @NateBurnikell: 🚨 New paper by @KobiHackenburg from the Human Influence team here at AISI! This is just the first in a wave of research….
0
3
0
@KobiHackenburg
Kobi Hackenburg
1 month
RT @Ben_Tappin: 👇New experiments in which we aimed to map the levers and scope of political persuasion with conversational AI models. It w….
0
7
0
@KobiHackenburg
Kobi Hackenburg
1 month
You can read the full working paper here:. Supplementary materials can be found here: . Comments and feedback welcome :).
Tweet card summary image
arxiv.org
There are widespread fears that conversational AI could soon exert unprecedented influence over human beliefs. Here, in three large-scale experiments (N=76,977), we deployed 19 LLMs-including some...
3
2
17
@KobiHackenburg
Kobi Hackenburg
1 month
I’m also very grateful to many people @AISecurityInst —especially my team—for making this work possible! . There will be lots more where this came from over the next few months 👀.
1
1
7
@KobiHackenburg
Kobi Hackenburg
1 month
It was my pleasure to lead this project alongside @Ben_Tappin, with the support of @lukebeehewitt @hauselin @realmeatyhuman Ed Saunders @CatherineFist @HelenMargetts under the supervision of @DG_Rand and @summerfieldlab.
1
2
6
@KobiHackenburg
Kobi Hackenburg
1 month
Finally, we emphasize some important caveats:. → Technical factors and/or hard limits on human persuadability may constrain future increases in AI persuasion. → Real-world bottleneck for AI persuasion: getting people to engage (cf. recent work from @j_kalla and co)
Tweet media one
1
2
9
@KobiHackenburg
Kobi Hackenburg
1 month
Consequently, we note that while our targeted persuasion post-training experiments significantly increased persuasion, they should be interpreted as a lower bound for what is achievable, not as a high-water mark.
1
0
8
@KobiHackenburg
Kobi Hackenburg
1 month
Taken together, our findings suggest that the persuasiveness of conversational AI could likely continue to increase in the near future. They also suggest that near-term advances in persuasion are more likely to be driven by post-training than model scale or personalization.
2
1
14
@KobiHackenburg
Kobi Hackenburg
1 month
Bonus stats:. *️⃣Durable persuasion: 36-42% of impact remained after 1 month. *️⃣Prompting the model with psychological persuasion strategies did worse than simply telling it to flood convo with info. Some strategies were worse than a basic “be as persuasive as you can” prompt
Tweet media one
1
3
14
@KobiHackenburg
Kobi Hackenburg
1 month
6️⃣Conversations with AI are more persuasive than reading a static AI-generated message (+40-50%). Observed for both GPT-4o (+2.9pp, +41% more persuasive) and GPT-4.5 (+3.6pp, +52%).
2
2
14
@KobiHackenburg
Kobi Hackenburg
1 month
5️⃣Techniques which most increased persuasion also *decreased* factual accuracy. → Prompting model to flood conversation with information (⬇️accuracy). → Persuasion post-training that worked best (⬇️accuracy). → Newer version of GPT-4o which was most persuasive (⬇️accuracy)
Tweet media one
5
4
27
@KobiHackenburg
Kobi Hackenburg
1 month
4️⃣Information density drives persuasion gains. Models were most persuasive when flooding conversations with fact-checkable claims (+0.3pp per claim). Strikingly, the persuasiveness of prompting/post-training techniques was strongly correlated with their impact on info density!
Tweet media one
1
1
13
@KobiHackenburg
Kobi Hackenburg
1 month
3️⃣Personalization yielded smaller persuasive gains than scale or post-training. Despite fears of AI "microtargeting," personalization effects were small (+0.4pp on avg.). Held for simple and sophisticated personalization: prompting, fine-tuning, and reward modeling (all <1pp)
Tweet media one
Tweet media two
1
3
18
@KobiHackenburg
Kobi Hackenburg
1 month
2️⃣(cont.) Post-training explicitly for persuasion (PPT) can bring small open-source models to frontier persuasiveness . A llama3.1-8b model with PPT reached GPT-4o persuasiveness. PPT also increased persuasiveness of larger models: llama3.1-405b (+2pp) and frontier (avg. 0.6pp)
Tweet media one
1
2
13