|
2026年2月3日、ARC-AGI-2 Leaderboardに、 Johan Land氏がGPT-5.2に手を加えたシステムが、ARC-AGI-2(Abstraction and Reasoning Corpus for AGI - Version 2)において72.9%という驚異的な正答率を記録したことが正式に記載されました。 ARC-AGI-2は、人間にとっては簡単でも、現在のAIシステムにとっては非常に難しいタスクで構成された抽象的推論能力(蓄積された知識ではなく、新しい状況で推論し、問題を解決する能力=流動的知性)を測定するベンチマークの新しいバージョンで、この記録は、直前のState-of-the-Art(SOTA)であったGPT-5.2のスコア(54.2%)を大幅に上回るものであり、AIが特定領域のスキルだけでなく、未知の問題に対する適応能力において人間レベル(平均的な人間は約60-66%とされる)を超えたことを示しています。 このARC-AGI-2で72.9%達成という情報について、生成AIに深掘りさせました。さらに、結果をNotebookLMでインフォグラフィック、スライド資料にさせました。 なお、生成AIによる調査・分析結果は、公開された情報からだけの分析であり、必ずしも実情を示したものではないこと、誤った情報も含まれていることについてはご留意されたうえで、ご参照ください。 ARC-AGI-2 Leaderboard https://arcprize.org/leaderboard Johan Land Achieves 72.9% on ARC-AGI-2 On February 3, 2026, the ARC-AGI-2 Leaderboard officially recorded that a system developed by Johan Land, based on a modified version of GPT-5.2, achieved an astonishing accuracy of 72.9% on ARC-AGI-2 (Abstraction and Reasoning Corpus for AGI – Version 2). ARC-AGI-2 is a new version of a benchmark designed to measure abstract reasoning ability—that is, fluid intelligence, the capacity to reason and solve problems in novel situations rather than relying on accumulated knowledge. While the tasks are relatively easy for humans, they are extremely challenging for current AI systems. This result significantly surpasses the previous state-of-the-art score of 54.2%, achieved by GPT-5.2 itself, and suggests that AI has exceeded average human-level performance in adaptability to unfamiliar problems (with average human scores generally estimated at around 60–66%), not merely in narrow, domain-specific skills. I conducted an in-depth analysis of this 72.9% ARC-AGI-2 result using generative AI, and further converted the findings into infographics and slide materials using NotebookLM. Please note that the investigations and analyses conducted by generative AI are based solely on publicly available information and may not fully reflect real-world conditions. They may also contain inaccuracies, so readers are advised to consult the results with appropriate caution. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document.
0 Comments
Leave a Reply. |
著者萬秀憲 アーカイブ
January 2026
カテゴリー |
RSS Feed