Vincent Conitzer @conitzer X Profile

Vincent Conitzer

@conitzer

Followers

4K

Following

2K

Media

152

Statuses

2K

AI professor. Director, @FOCAL_lab @CarnegieMellon. Head of Technical AI Engagement, @UniofOxford @EthicsInAI. Author, "Moral AI - And How We Get There."

Joined June 2009

Don't wanna be here? Send us removal request.

Vincent Conitzer

@conitzer

2 years

Our book exists! (but won't be out until Feb. 8.)

13

9

141

Vincent Conitzer

@conitzer

4 hours

The Nash75 talks are on YouTube! Below is my talk "Game Theory for AI Agents" (link also gives all other talks on the side).

0

1

6

Vincent Conitzer

@conitzer

2 days

(18/n, n=18) up to lambda(2m+1)-m*sqrt(2) > lambda + 2m*epsilon. For some sufficiently large m, (m*epsilon)^2 > 2m+2 and hence playing this move will win the game for A, proving (b).

0

Vincent Conitzer

@conitzer

2 days

(17/n) lambda(2m+1)-m*sqrt(2) >= lambda >= 0, so A won't get in trouble the next move; this means A will never lose, proving the other half of (c). Moreover, if lambda > sqrt(2)/2, then for some epsilon>0, we have lambda > sqrt(2)/2 + epsilon, so that A's next move can be . .

1

0

Vincent Conitzer

@conitzer

2 days

(16/n) then the sum of the numbers is at most m*sqrt(2). [This is because A will have always played 0, and B would have made the sum of the numbers the highest by choosing sqrt(2) every round, by convexity.] This means that A can make a move of up to . .

1

0

Vincent Conitzer

@conitzer

2 days

(15/n) (2m+1)lambda - m*sqrt(2) < lambda - 2m*epsilon, and at some point this will be negative, so B must win eventually, proving (a). Suppose lambda >= sqrt(2)/2. With the wait-for-the-kill strategy, if after 2m rounds the game is still active, . .

1

0

Vincent Conitzer

@conitzer

2 days

(14/n) B won't get in trouble the next move; hence, this means B will never lose, proving half of (c). Moreover, if lambda < sqrt(2)/2, then for some epsilon>0, we have lambda < sqrt(2)/2 - epsilon, so that A's next move can be at most. .

1

0

Vincent Conitzer

@conitzer

2 days

(13/n) [This is because what would make the sum of the numbers the lowest is if for every i, the numbers played in rounds 2i+1 and 2i+1 are 0 and sqrt(2), by convexity.] It follows that A's next move can be at most (2m+1)lambda - m*sqrt(2) <= lambda which means . .

1

0

Vincent Conitzer

@conitzer

2 days

(12/n) Proof:.Suppose lambda <= sqrt(2)/2. With the fill'er-up strategy, if after 2m rounds the game is still active, then the sum of squares so far is 2m and the sum of the numbers is at least m*sqrt(2).

1

0

Vincent Conitzer

@conitzer

2 days

(11/n) B can keep A from winning by playing fill'er up and A can keep B from winning by playing wait for the kill.

1

0

Vincent Conitzer

@conitzer

2 days

(10/n) Theorem:.(a) If lambda is strictly below sqrt(2)/2 then B will win by playing fill'er up. (b) If lambda is strictly above sqrt(2)/2 then A will win by playing wait for the kill. (c) If lambda = sqrt(2)/2 then. .

1

0

Vincent Conitzer

@conitzer

2 days

(9/n) For A: "Wait for the kill" -- always choose 0 unless and until there is a choice that makes the next move for B impossible. For B: "Fill'er up" -- always choose the largest number possible.

1

0

Vincent Conitzer

@conitzer

2 days

(8/n) Intuition: A wants to "concentrate" the value of the numbers, whereas B wants to spread it out as much as possible. Thus we will consider the following two strategies:.

1

0

Vincent Conitzer

@conitzer

2 days

(7/n) the difference isn't as extreme or meaningful as I expected. In particular GDT's doesn't *look* like any kind of raw search power proof. My own solution, done before looking at anyone or anything else's, follows.

1

0

Vincent Conitzer

@conitzer

2 days

(6/n) and I wave my hands a bit more. I added a bit of intuition at the beginning but many human mathematicians wouldn't. While one could probably still tell mine is the human one and GDT's is the computer one in a number of ways,.

1

0

Vincent Conitzer

@conitzer

2 days

(5/n) Mine's a bit shorter but it seems not in any particularly meaningful way, I just wanted to write/repeat less and write less algebra (so I didn't work out the exact round in which something happens; I was also too lazy to check GDT's algebra but I take it someone did),.

1

0

Vincent Conitzer

@conitzer

2 days

(4/n) We should devote a lot of resources now to preparing for and steering the various consequences, of all kinds.) . Anyone else with a similar analysis of the other problems? Where are OpenAI's solutions?. More about solution comparison:.

1

0

Vincent Conitzer

@conitzer

2 days

(3/n) But if everything is above board it definitely gets me to think (even) higher of the capabilities of these systems. (No worries, I will continue posting funny examples, but the point of those was never that these systems aren't impressive or aren't quickly getting better.

1

0

Vincent Conitzer

@conitzer

2 days

(2/n) Below is my proof (hope I didn't mess it up :-)) and GDT's is here: I'm impressed. The basic thinking ("thinking"?) behind the proofs seems the same. More about the comparison below.

1

0

Vincent Conitzer

@conitzer

2 days

(1/n) I noticed that on the IMO where several LLM-based systems just got gold medals, the fifth problem (the last one solved by Gemini Deep Think) was a game problem, so I had no excuse :-) not to try it myself first and then see how the solutions compared.

1

0

12

Vincent Conitzer

@conitzer

3 days

I guess the message is to just never use your passwords. At all.

0

3

4