OpenAI Claims Its New Model Achieves Human-Level Performance on 'General Intelligence' Test
On December 20, OpenAI's o4 system achieved a score of 88% on the ARC-AGI benchmark, surpassing the previous AI high of 60% and aligning closely with the average human score. This performance also extended to a challenging mathematics test, further validating its capabilities.