Musk Viewer
About
Privacy Policy
Removal Request
Kai Zou
@anMe_kz
2 months
@_jasonwei
Hope it's helpful. More benchmarks are on the way!
0
4
61
Replies
Jason Wei
@_jasonwei
2 months
As benchmarks continue to get saturated, it's great to see a no-frills benchmark of 387 challenging math problems: GPT-4 is 66% on high-school subset, 42% on college subset, and only 11% on high-school competition subset.
9
44
300