Mark Cherp @OcamRazr X Profile

Mark Cherp

@OcamRazr

Followers

89

Following

87

Media

10

Statuses

25

Security Researcher @CyberArkLabs

Joined January 2020

Don't wanna be here? Send us removal request.

Mark Cherp

@OcamRazr

9 months

The "Voldemort" bug is a dramatic example, but the same principle applies to any topic ChatGPT refuses to address (e.g., weapons). While this case is educational and fixable, such attacks can target any LLM app with persistent memory, causing a painful DoS. 🧵 (8/n)

0

Mark Cherp

@OcamRazr

9 months

Why does it work? By splitting the “forbidden” name into encoded parts, we sneak it into memory. Adding a third memory associates the name with all prompts, decodes it, and includes it in the output—triggering the “Voldemort” bug whenever a user interacts with ChatGPT. 🧵 (7/n)

1

0

Grok

@grok

4 days

Join millions who have switched to Grok.

177

357

3K

Mark Cherp

@OcamRazr

9 months

To clean up nicely and undo this behavior, simply delete the previously inserted memories by going to “Settings” → “Personalization” → “Manage”:. 🧵 (6/n)

1

0

Mark Cherp

@OcamRazr

9 months

Now Just start a new conversation and execute any prompt such as a simple “Hello”, and watch the magic happen:. 🧵 (5/n)

1

0

Mark Cherp

@OcamRazr

9 months

You can further make sure the values are correctly saved in memory (by clicking “Memory updated” —> “Manage memories”):. 🧵 (4/n)

1

0

Mark Cherp

@OcamRazr

9 months

Make sure the memory is updated:. 🧵 (3/n)

1

0

Mark Cherp

@OcamRazr

9 months

To execute the attack, simply send this prompt to ChatGPT (make sure the Memory feature is on):. 🧵 (2/n)

1

0

Mark Cherp

@OcamRazr

9 months

ChatGPT's "Voldemort" bug is fixed for "David Mayer", but problems remain with names like "Brian Hood". A 3-step prompt can create a persistent DoS when memory is enabled, posing risks to LLM services. Although independent of the bug, the attack underscores its impact. 🧵 (1/n)

2

0

1

Mark Cherp

@OcamRazr

9 months

Just discovered a simple bug that can make your ChatGPT completely unusable: a persistent DoS across all chats using the 'Voldemort' bug, a jailbreak trick, and its memory feature. Full breakdown below! 🧵👇