FactCheckRadar Fact-check archive

Published fact-check

MiniMax M3 Claims Frontier Coding Performance

Supported

Claim checked

“MiniMax dropped M3 1M context 59% on SWE-Bench Pro close to opus 4.7”

Published

Updated

Verdict

Supported

The claim that MiniMax M3 scored 59% on SWE-Bench Pro and came close to Opus 4.7 is supported by the company's official announcement and multiple tech outlets. However, the evidence shows this is a vendor-reported figure that has not yet been independently verified on public leaderboards. MiniMax's own blog states M3 "approaches Opus 4.7" on SWE-Bench Pro, and third-party coverage from Apidog and Lushbinary confirms the 59% score while noting independent confirmation is still pending.

Reasoning

MiniMax officially released M3 on June 1, 2026, and its blog post explicitly states the model scores 59.0% on SWE-Bench Pro, surpassing GPT-5.5 and Gemini 3.1 Pro while approaching Claude Opus 4.7. This matches the X post's claim exactly. Multiple tech publications including Lushbinary, Apidog, and Pasquale Pillitteri's site all report the same 59% figure, consistently framing it as vendor-reported data. Apidog's comparison piece specifically warns that "most of the numbers behind that claim come from MiniMax itself" and that "independent leaderboard confirmation is still pending." The claim about being "close to Opus 4.7" is supported by MiniMax's own framing, though the exact Opus 4.7 score isn't provided in these sources for direct comparison. The 1M context window claim is also confirmed across all sources.

The evidence consists of MiniMax's official blog post and several tech publication reports all dated June 1, 2026. While multiple sources corroborate the 59% SWE-Bench Pro figure, they all trace back to MiniMax's own announcement. No independent leaderboard verification exists yet. The sources are recent and directly relevant to the claim.

Key checks

  • 59% SWE-Bench Pro score: MiniMax's official blog confirms M3 scores 59.0% on SWE-Bench Pro, matching the X post's claim exactly.

  • Close to Opus 4.7 performance: MiniMax states M3 'approaches Opus 4.7' on SWE-Bench Pro, but the exact Opus score isn't provided for direct comparison. The framing is vendor-reported.

  • Independent verification status: Multiple outlets including Apidog note these are vendor-reported benchmarks with independent leaderboard confirmation still pending.

Confidence

Medium

Was this useful?

Your vote helps us see which fact-checks deserve more attention.

4 reviewed sources behind this verdict.

Might interest you next