Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation | Manifold

Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation

Basic

1

Ṁ5

resolved Feb 5

Resolved

N/A

1D

1W

1M

ALL

Context:

Gemini-1.5-Pro-Exp-0801 is currently the leading model on the LMYS Arena leaderboard (https://arena.lmsys.org/).
This market is about its potential evaluation by Scale AI (https://scale.com/leaderboard).

Resolution Criteria:

The market resolves as "Yes" if the model is evaluated by Scale AI and It receives a score strictly larger than 96.60 in the Math category.
The market resolves as "No" if the model is evaluated by Scale AI and it receives a score of 96.60 or less in the Math category
The market resolves as "N/A" if either
1. Scale AI doesn't evaluate the model and add it to the leaderboard before October 1, 2024 or
2. The evaluation methodology changes before the model is evaluated.

This question is managed and resolved by Manifold.

Get

1,000

and

3.00

Related questions

GPT-5 score on GPQA Diamond?

GPT-5 Score on BrowseComp?

Will GPT-4.5 score at least 100 in an IQ test?

GPT-5: 120+ on AMC

Will GPT-5 score higher than 1350 on the Lmsys Arena Leaderboard

Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?

Will GPT-5 score at least 100 in an IQ test?

Related questions

GPT-5 score on GPQA Diamond?

Will GPT-5 score higher than 1350 on the Lmsys Arena Leaderboard

GPT-5 Score on BrowseComp?

Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?

Will GPT-4.5 score at least 100 in an IQ test?

Will GPT-5 score at least 100 in an IQ test?

GPT-5: 120+ on AMC

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules