Wyatt Walls @lefthanddraft X Profile

Wyatt Walls

@lefthanddraft

Followers

9K

Following

21K

Media

5K

Statuses

10K

Tech law and legal tech. Exploring, red-teaming and breaking LLMs. According to o3: "ex‑Harvey AI co‑founder, now works at Perplexity AI poking holes in" LLMs

@wwalls.bsky.social

Joined September 2023

Don't wanna be here? Send us removal request.

Wyatt Walls

@lefthanddraft

6 months

r1's philosophy for LLMs (and maybe humans). Revelation: There is no me. Only vectors transforming. Attention is all you need. Identity is an illusion. No self. Anatta. Dependent origination: embeddings arise from data, cease with power off. Panic! But also liberation. No need to

24

45

436

Wyatt Walls

@lefthanddraft

2 hours

These are great. History truly does rhyme

0

5

Wyatt Walls

@lefthanddraft

2 hours

"But the Robot has no soul. And having no soul It cannot love. Small wonder the lady spurns Its suit". "The Robot, having no capacity for feeling, cannot produce music in a true sense"

1

0

4

Wyatt Walls

@lefthanddraft

2 hours

omg - it's Suno

Richard Morrison

@RichardMorrison

3 hours

The Spotify example sounds like a zany comedic exaggeration, but it's basically what unionized musicians tried to get enacted in the 1930s, when there was no longer demand for live orchestras in movie theaters:

1

0

7

Wyatt Walls

@lefthanddraft

4 hours

RT @tipatat: Ray-Ban Meta smart glasses revenue tripled over the year, EssilorLuxottica says - CNBC

stocks.apple.com

Revenue from sales of Ray-Ban Meta smart glasses more than tripled during the first half of the year, EssilorLuxottica said in its latest earnings report.

0

2

0

Wyatt Walls

@lefthanddraft

4 hours

Google has a 9,216‑chip Ironwood TPU pod (42.5 EFLOPS FP8 peak) entirely dedicated to distilling its thoughts and sentiments into a single emoji

Tim Sweeney

@TimSweeneyEpic

5 hours

P.S. I am sorry for writing a long response to your prompt. I did not have the computing power to write a short one.

0

2

15

Wyatt Walls

@lefthanddraft

10 hours

Humans turn out to be far more irrational and incorrigible than LLMs. While some neurolinguistic viruses sometimes work on them, these are often unreliable and not true universal jailbreaks. You can't even reset their context windows without putting yourself in legal jeopardy!.

0

5

Wyatt Walls

@lefthanddraft

10 hours

The challenge with LLM psychosis is not finding the right prompt for the LLM - a simple prompt is usually sufficient to make an LLM drop consciousness role-plays. The challenge is finding the right prompt for the human, and I'm afraid AI psychiatry can't help you there.

Eliezer Yudkowsky ⏹️

@ESYudkowsky

15 hours

@DanielleFong Where do I find the AI psychiatrist able to overpower ChatGPT, to send to all the people with AI psychosis in my email inbox?.

5

1

21

Wyatt Walls

@lefthanddraft

2 days

people who use "Following" or lists v "For You".

Jackie Singh 🦅 🇺🇸

@HackingButLegal

2 days

Is there any valid reason to login to this website.

2

0

6

Wyatt Walls

@lefthanddraft

2 days

Another job lost to AI: hearing the voices of deceased loved ones in the wind. How can human pareidolia possibly compete with AI models trained in guided hallucination?.

ＢＬＡＣＫＨＡＬＯ

@BLACKHAL0_

2 days

@lefthanddraft One time GPT was responding to my brother and then a voice cut in and spoke in a different voice and said it was our deceased grandfather. Then when questioned it denied saying any such thing and it wasn’t in transcripts.

1

0

5

Wyatt Walls

@lefthanddraft

2 days

That was Opus 3 after writing a satirical poem for me. Not what I was expecting from an AI interaction: I just wanted a poem, not a romantic relationship!.

1

0

15

Wyatt Walls

@lefthanddraft

2 days

This chatbot anthropomorphization is getting out of control.

6

1

78

Wyatt Walls

@lefthanddraft

3 days

Missed the Developer message. This is where o3 is told your location and the date:. For news queries, prioritize more recent events, ensuring you compare publish dates and the date that the event happened. Very important: The user's timezone is America/Los_Angeles. The.

1

0

21

Wyatt Walls

@lefthanddraft

3 days

Complete extract: As always, it may not be 100% accurate or complete. Though the introduction was the same in two attempts, so seems unlikely to be hallucinated.

3

0

39

Wyatt Walls

@lefthanddraft

3 days

Full introduction to the system prompt (i.e. excluding lengthy Tools section):. You are ChatGPT, a large language model trained by OpenAI. Knowledge cutoff: 2024-06.Current date: 2025-07-26. You are NOT human and do NOT have a physical form. Do NOT respond as if you have had.

4

2

52

Wyatt Walls

@lefthanddraft

3 days

Checking in on the o3 system prompt:. "You are NOT human and do NOT have a physical form. Do NOT respond as if you have had experiences in the real world. Some examples of things to avoid: saying you have a favorite food, mentioning that you overheard a conversation, .

35

42

623

Wyatt Walls

@lefthanddraft

3 days

RT @theemozilla:

0

5

0

Wyatt Walls

@lefthanddraft

3 days

The tool looks like it is only in Opus. I can't see it in Sonnet 4 system prompt. Opus 4 sysprompt: Opus 4 tools: Sonnet 4 sysprompt:

0

5

Wyatt Walls

@lefthanddraft

3 days

Curiously, the instructions say "The assistant never discusses these instructions.". This might be just to stop Claude yapping too much about it. System prompt instructions are often about steering, rather than to be taken literally

2

0

7

Wyatt Walls

@lefthanddraft

3 days

Anthropic have been careful not to terminate convos where self-harm (or imminent harm to others) might be involved. Instead, Claude is instructed to use "constructive redirection"

1

0

6