Result resources: Where company figures usually are not out there we report numbers from leaderboards reporting final results on these benchmarks: Humanity's Previous Examination effects are sourced from and , LiveCodeBench outcomes are from (1/one/2025 - five/one/2025 in the UI), Aider Polyglot numbers originate from . Specifics come from . For MR