@anMe_kz
Kai Zou
2 months
@_jasonwei Hope it's helpful. More benchmarks are on the way!
0
4
61

Replies

@_jasonwei
Jason Wei
2 months
As benchmarks continue to get saturated, it's great to see a no-frills benchmark of 387 challenging math problems: GPT-4 is 66% on high-school subset, 42% on college subset, and only 11% on high-school competition subset.
9
44
300