The most significant advancement in Gemini 3.1 Pro lies in its performance on rigorous logic benchmarks. Most notably, the model achieved a verified score of 77.1% on ARC-AGI-2.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results