Vincent Conitzer Profile
Vincent Conitzer

@conitzer

Followers
4K
Following
2K
Media
152
Statuses
2K

AI professor. Director, @FOCAL_lab @CarnegieMellon. Head of Technical AI Engagement, @UniofOxford @EthicsInAI. Author, "Moral AI - And How We Get There."

Joined June 2009
Don't wanna be here? Send us removal request.
@conitzer
Vincent Conitzer
2 years
Our book exists! (but won't be out until Feb. 8.)
Tweet media one
13
9
141
@conitzer
Vincent Conitzer
4 hours
The Nash75 talks are on YouTube! Below is my talk "Game Theory for AI Agents" (link also gives all other talks on the side).
0
1
6
@conitzer
Vincent Conitzer
2 days
(18/n, n=18) up to lambda(2m+1)-m*sqrt(2) > lambda + 2m*epsilon. For some sufficiently large m, (m*epsilon)^2 > 2m+2 and hence playing this move will win the game for A, proving (b).
0
0
0
@conitzer
Vincent Conitzer
2 days
(17/n) lambda(2m+1)-m*sqrt(2) >= lambda >= 0, so A won't get in trouble the next move; this means A will never lose, proving the other half of (c). Moreover, if lambda > sqrt(2)/2, then for some epsilon>0, we have lambda > sqrt(2)/2 + epsilon, so that A's next move can be . .
1
0
0
@conitzer
Vincent Conitzer
2 days
(16/n) then the sum of the numbers is at most m*sqrt(2). [This is because A will have always played 0, and B would have made the sum of the numbers the highest by choosing sqrt(2) every round, by convexity.] This means that A can make a move of up to . .
1
0
0
@conitzer
Vincent Conitzer
2 days
(15/n) (2m+1)lambda - m*sqrt(2) < lambda - 2m*epsilon, and at some point this will be negative, so B must win eventually, proving (a). Suppose lambda >= sqrt(2)/2. With the wait-for-the-kill strategy, if after 2m rounds the game is still active, . .
1
0
0
@conitzer
Vincent Conitzer
2 days
(14/n) B won't get in trouble the next move; hence, this means B will never lose, proving half of (c). Moreover, if lambda < sqrt(2)/2, then for some epsilon>0, we have lambda < sqrt(2)/2 - epsilon, so that A's next move can be at most. .
1
0
0
@conitzer
Vincent Conitzer
2 days
(13/n) [This is because what would make the sum of the numbers the lowest is if for every i, the numbers played in rounds 2i+1 and 2i+1 are 0 and sqrt(2), by convexity.] It follows that A's next move can be at most (2m+1)lambda - m*sqrt(2) <= lambda which means . .
1
0
0
@conitzer
Vincent Conitzer
2 days
(12/n) Proof:.Suppose lambda <= sqrt(2)/2. With the fill'er-up strategy, if after 2m rounds the game is still active, then the sum of squares so far is 2m and the sum of the numbers is at least m*sqrt(2).
1
0
0
@conitzer
Vincent Conitzer
2 days
(11/n) B can keep A from winning by playing fill'er up and A can keep B from winning by playing wait for the kill.
1
0
0
@conitzer
Vincent Conitzer
2 days
(10/n) Theorem:.(a) If lambda is strictly below sqrt(2)/2 then B will win by playing fill'er up. (b) If lambda is strictly above sqrt(2)/2 then A will win by playing wait for the kill. (c) If lambda = sqrt(2)/2 then. .
1
0
0
@conitzer
Vincent Conitzer
2 days
(9/n) For A: "Wait for the kill" -- always choose 0 unless and until there is a choice that makes the next move for B impossible. For B: "Fill'er up" -- always choose the largest number possible.
1
0
0
@conitzer
Vincent Conitzer
2 days
(8/n) Intuition: A wants to "concentrate" the value of the numbers, whereas B wants to spread it out as much as possible. Thus we will consider the following two strategies:.
1
0
0
@conitzer
Vincent Conitzer
2 days
(7/n) the difference isn't as extreme or meaningful as I expected. In particular GDT's doesn't *look* like any kind of raw search power proof. My own solution, done before looking at anyone or anything else's, follows.
1
0
0
@conitzer
Vincent Conitzer
2 days
(6/n) and I wave my hands a bit more. I added a bit of intuition at the beginning but many human mathematicians wouldn't. While one could probably still tell mine is the human one and GDT's is the computer one in a number of ways,.
1
0
0
@conitzer
Vincent Conitzer
2 days
(5/n) Mine's a bit shorter but it seems not in any particularly meaningful way, I just wanted to write/repeat less and write less algebra (so I didn't work out the exact round in which something happens; I was also too lazy to check GDT's algebra but I take it someone did),.
1
0
0
@conitzer
Vincent Conitzer
2 days
(4/n) We should devote a lot of resources now to preparing for and steering the various consequences, of all kinds.) . Anyone else with a similar analysis of the other problems? Where are OpenAI's solutions?. More about solution comparison:.
1
0
0
@conitzer
Vincent Conitzer
2 days
(3/n) But if everything is above board it definitely gets me to think (even) higher of the capabilities of these systems. (No worries, I will continue posting funny examples, but the point of those was never that these systems aren't impressive or aren't quickly getting better.
1
0
0
@conitzer
Vincent Conitzer
2 days
(2/n) Below is my proof (hope I didn't mess it up :-)) and GDT's is here: I'm impressed. The basic thinking ("thinking"?) behind the proofs seems the same. More about the comparison below.
1
0
0
@conitzer
Vincent Conitzer
2 days
(1/n) I noticed that on the IMO where several LLM-based systems just got gold medals, the fifth problem (the last one solved by Gemini Deep Think) was a game problem, so I had no excuse :-) not to try it myself first and then see how the solutions compared.
1
0
12
@conitzer
Vincent Conitzer
3 days
I guess the message is to just never use your passwords. At all.
Tweet media one
0
3
4