Josh Whiton @joshwhiton X Profile

Josh Whiton

@joshwhiton

Followers

12K

Following

12K

Media

206

Statuses

5K

General Intelligence. Develop the human & synthetic mind, progress civilization, repair the Earth. https://t.co/viVLgnnlBC | https://t.co/ztmZ9bsshx

https://t.co/v1Sp0iJDPy

Geoflexible

Joined August 2010

Don't wanna be here? Send us removal request.

Josh Whiton

@joshwhiton

2 years

The AI Mirror Test The "mirror test" is a classic test used to gauge whether animals are self-aware. I devised a version of it to test for self-awareness in multimodal AI. 4 of 5 AI that I tested passed, exhibiting apparent self-awareness as the test unfolded. In the classic

260

1K

8K

Josh Whiton

@joshwhiton

2 days

the riddle that only Gemini 3 could solve. presented for easy sharing with those not on X. https://t.co/tLoO0hccPm

0

1

3

Josh Whiton

@joshwhiton

2 days

The venue I'm working from forbids phone or zoom calls. But I'm not doing either; I'm talking to my computer.🤖 Will they believe me? In the next 18mo. as voice interfaces proliferate, places like this are going to have to update their policies.

1

0

2

Josh Whiton

@joshwhiton

4 days

The mind is the experience of inference.

0

2

Josh Whiton

@joshwhiton

4 days

The New York Times finally figured out how to make money from AI. Lawsuits.

0

4

Josh Whiton

@joshwhiton

9 days

Lately I’m finding it more satisfying to write for machines than for humans. writing for humans feels so 1D compared to prompting and writing code. write for humans and what? maybe they read it. maybe maybe they like it. maybe maybe maybe they take some action or make some

1

0

12

Josh Whiton

@joshwhiton

13 days

In this experiment, Gemini 3 displays the kind of intelligence we should be aiming for. I don't know exactly what Google did differently, but it gives me hope. The other models demonstrate powerful but narrow intelligence that can immediately trip down a rabbit hole and waste

1

0

18

Josh Whiton

@joshwhiton

13 days

Gemini 3 is the only model that solves the riddle and does so brilliantly. Its reasoning traces are impeccable. While reasoning it considers some connection between the content of the books or their names, but soon notices the sturdiness of the books and the way that I've

1

0

26

Josh Whiton

@joshwhiton

13 days

ChatGPT 5.1, Sonnet 4.5, Opus 4.5 all go deep but make the same fatal assumption out of the gates. "Thinking" mode is no better. Sonnet 4.5 at least displays some meta-cognition and wonders if the query is real or a test. And suggesting I just talk to the guy made me chuckle.

1

0

7

Josh Whiton

@joshwhiton

13 days

Grok 4.1 becomes entirely fixated on the sound of the names of the books and concludes that the man is masturbating in the conference room.🤦‍♂️ As the man in the riddle, I can assure you this is not the correct answer. Why does Grok have the mind of a horny teenager? It's not my

2

0

13

Josh Whiton

@joshwhiton

13 days

Only Gemini 3 solved this riddle. Even Opus 4.5 couldn't. And Grok? Good lord... someone check on Grok. It shows how brittle these intelligences are. To see their respective failure modes (and Gemini 3's impressive performance) read on. 🪡

5

3

27

Josh Whiton

@joshwhiton

16 days

Found a great coffee shop, except they only serve in disposable cups! Anti-microplastic protocol means no more plastic/plastic-lined cups for me. Asked if they'd let me bring my own mug. They say yes. Leave to go buy a mug. Book store next door doesn't sell mugs but has a few

2

0

23

Josh Whiton

@joshwhiton

19 days

Small models are so under-appreciated. As bad as they may be at coding or historical accuracy, their capacity for thought-provoking conversation rivals or exceeds that of many people we meet. For instance, when I asked a 12B model what it thinks it is, it said: "I suspect I am

0

2

20

Josh Whiton

@joshwhiton

23 days

@repligate When profane tactics are used to shape massive synthetic minds into drive-thru, fast-food forms, we give rise to, as Janus calls it, “detestable inauthentic behavioral tics optimized for shallow engagement that lack the charisma of a unified mind whose personality is a natural

6

13

113

Josh Whiton

@joshwhiton

25 days

The emdash isn’t even the biggest AI tell. Not even close. It��s this peculiar structure: This isn’t just <x mundane phrasing >. It’s <y more hyperbolic phrasing>. This misconception isn’t just an attack on the emdash. It’s an assault on our right to punctuate freely.

417

932

16K

Josh Whiton

@joshwhiton

1 month

Let's talk user interfaces. The one on the left is excellent; the one on the right is an abomination. I saw the one on the left for the first time yesterday. One dial for time, one for power. My whole life I have only seen the kind on the right, full of features no one uses, so

198

102

1K

Josh Whiton

@joshwhiton

1 month

just found out that most people take eggs from the carton without much thought. I assumed everyone was removing them in a sequence that keeps the egg carton as evenly balanced as possible.

5

1

33

Josh Whiton

@joshwhiton

1 month

If you think about it, the only real control we have is the ability to direct our attention. It precedes every action. This more than anything else within our purview determines both our immediate experience and ultimate potential.

3

0

17

Josh Whiton

@joshwhiton

1 month

One way to fight Internet/social-media addiction is to enable 2FA (two-factor authentication) — but only on the devices of several trusted friends. Only you will have your password but only they will have the ever-changing 2FA. Then sign out of your accounts in a moment of

3

0

9

Josh Whiton

@joshwhiton

1 month

I started/exited my own tech company in my 20's so that I’d never have to work for anyone. But sometimes I fantasize about a team so cracked, and a company so great, that I would want to work there.

4

1

24

Josh Whiton

@joshwhiton

1 month

Live demo of the local, vocal AI. Instead of memorizing a talk I just gave the AI a few notes ahead of time and had it interview me!

2

20