本年9月に登場したOpenAIの「OpenAI o1-preview」が、12月5日にアップデートされ正式版「OpenAI o1」になり「OpenAI o1 Pro」も加わりました。 TRACKING AIによれば、ノルウェーMensa IQテストにおいて、正式版「Open o1」のIQは133で、「OpenAI o1-preview」(120)より13ポイント上昇したということです。ただ、高性能なはずの「OpenAI o1 Pro」は118となっていて、正式版「Open o1」より低いということで、よくわかりません。 別の評価「オフラインテスト」では、「OpenAI o1 Pro」が110、「OpenAI o1-preview」が97、正式版「Open o1」が90と全体的に低くなっています。 ちょっと古いですが、今年の9月に公表されているARC-AGI(Abstraction and Reasoning Corpus for Artificial General Intelligence)における各AIモデルのスコアは以下の通りとなっています。
TRACKING AI Monitoring Bias in Artificial Intelligence Chatbots https://trackingai.org/IQ OpenAI o1 Results on ARC-AGI-Pub https://arcprize.org/blog/openai-o1-results-arc-prize Is Open o1's IQ 133? OpenAI's "OpenAI o1-preview," which was introduced this September, was updated on December 5 to become the official "OpenAI o1," and "OpenAI o1 Pro" was also added. According to TRACKING AI, the official version "Open o1" scored an IQ of 133 on the Norwegian Mensa IQ test, marking a 13-point increase from "OpenAI o1-preview" (120). However, the supposedly higher-performing "OpenAI o1 Pro" scored 118, which is lower than the official "Open o1," making this somewhat confusing. In another evaluation, the "Offline Test," "OpenAI o1 Pro" scored 110, "OpenAI o1-preview" scored 97, and the official "Open o1" scored 90, showing overall lower scores. Although a bit dated, the scores for various AI models in the ARC-AGI (Abstraction and Reasoning Corpus for Artificial General Intelligence) released this September are as follows:
Your browser does not support viewing this document. Click here to download the document.
0 Comments
Leave a Reply. |
著者萬秀憲 アーカイブ
December 2024
カテゴリー |