Google's Gemini models lead trader sentiment at 61% implied probability due to repeated top scores on math and reasoning benchmarks including GPQA Diamond near 94% and strong AIME results from the 3.1 Pro and related variants. Recent enhancements to Google's Deep Think features and long-context multimodal tools have solidified this positioning ahead of the June resolution. Anthropic's Claude Opus series follows at 27% on the strength of consistent general reasoning performance, while OpenAI sits at 10.5% despite a May breakthrough solving an 80-year-old Erdős conjecture. Remaining labs trail amid benchmark saturation on simpler math tasks and ongoing model iterations.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于Which company has the best Math AI model end of June?
Google 61%
Anthropic 27%
OpenAI 10%
Z.ai 3.6%
$36,946 交易量
$36,946 交易量

61%

Anthropic
27%

OpenAI
10%

Z.ai
4%

Alibaba
1%

xAI
1%

Baidu
1%

ByteDance
<1%

Mistral
<1%

Amazon
<1%

Microsoft
<1%

Meta
<1%

Moonshot
<1%

DeepSeek
<1%

Meituan
<1%
Google 61%
Anthropic 27%
OpenAI 10%
Z.ai 3.6%
$36,946 交易量
$36,946 交易量

61%

Anthropic
27%

OpenAI
10%

Z.ai
4%

Alibaba
1%

xAI
1%

Baidu
1%

ByteDance
<1%

Mistral
<1%

Amazon
<1%

Microsoft
<1%

Meta
<1%

Moonshot
<1%

DeepSeek
<1%

Meituan
<1%
Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市场开放时间: May 26, 2026, 6:36 PM ET
Resolver
0x69c47De9D...Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Google's Gemini models lead trader sentiment at 61% implied probability due to repeated top scores on math and reasoning benchmarks including GPQA Diamond near 94% and strong AIME results from the 3.1 Pro and related variants. Recent enhancements to Google's Deep Think features and long-context multimodal tools have solidified this positioning ahead of the June resolution. Anthropic's Claude Opus series follows at 27% on the strength of consistent general reasoning performance, while OpenAI sits at 10.5% despite a May breakthrough solving an 80-year-old Erdős conjecture. Remaining labs trail amid benchmark saturation on simpler math tasks and ongoing model iterations.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题