|
2025年3月にARC Prize Foundationが公開したARC-AGI-2は、AIの「流動的知能」(未知の状況への適応能力)を測定するベンチマークです。 2026年2月現在、DeepSeekやQwen等の中国製LLMは、SWE-Bench、AIME、GPQA-Diamondなど従来型ベンチマークで欧米フロンティアモデルに匹敵するスコアを叩き出していますが、ARC-AGI-2においては大幅に低いスコアにとどまっています。 この中国製LLMがARC-AGl-2で苦戦している状況を生成AIに深掘りさせました。さらに、結果をNotebookLMでインフォグラフィック、スライド資料にさせました。 なお、生成AIによる調査・分析結果は、公開された情報からだけの分析であり、必ずしも実情を示したものではないこと、誤った情報も含まれていることについてはご留意されたうえで、ご参照ください。 Chinese LLMs Struggling on ARC-AGI-2 ARC-AGI-2, released by the ARC Prize Foundation in March 2025, is a benchmark designed to measure AI’s “fluid intelligence” — its ability to adapt to novel situations. As of February 2026, Chinese-developed LLMs such as DeepSeek and Qwen have achieved scores comparable to Western frontier models on conventional benchmarks like SWE-Bench, AIME, and GPQA-Diamond. However, on ARC-AGI-2, their scores remain significantly lower. I asked a generative AI system to conduct an in-depth analysis of why Chinese LLMs are struggling on ARC-AGI-2. The results were then turned into infographics and slide materials using NotebookLM. Please note that the research and analysis conducted by generative AI are based solely on publicly available information and do not necessarily reflect actual conditions. They may also contain inaccuracies. I ask that you keep this in mind when reviewing the materials. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document. Your browser does not support viewing this document. Click here to download the document.
0 Comments
Leave a Reply. |
著者萬秀憲 アーカイブ
January 2026
カテゴリー |
RSS Feed