How well will Grok 4 do on Frontier Math?
23
Ṁ100k
2026

Invalid contract

The highest score of any version of Grok 4 on the Epoch AI dashboard for the FrontierMath benchmark, within 1 week of the first appearance of Grok 4 on the dashboard.

( https://epoch.ai/data/ai-benchmarking-dashboard )

Get
Ṁ1,000
and
S3.00
Sort by:

what happened?

@bh It got 12-14%

@Bayesian thanks, figured I’d missed it but didn’t see anything on the epoch.ai dashboard. i wonder if they will evaluate the heavy/multiagent version.

@Bayesian Why are we sure Grok 4 Heavy won't count? Description implies it would

bought Ṁ750 ???
bought Ṁ1 ???

@Bayesian where can you see the score? The link in the description doesn't appear to talk about grok4

@SimoneRomeo huh they tweeted about it but ig it's not on the site

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules