A groundbreaking artificial intelligence system developed by Google DeepMind, one of the top AI labs globally, can solve complex geometry problems at a level comparable to a human gold medalist in the prestigious International Mathematical Olympiad (IMO).
Named AlphaGeometry, this innovative system merges two distinct approaches: a neural language model that generates intuitive ideas and a symbolic deduction engine that validates these ideas using formal logic. The language model is built on the same technology that powers Google’s renowned search engine and natural language processing systems. The deduction engine draws inspiration from a method created by Chinese mathematician Wen-Tsün Wu in 1978.
Researchers tested AlphaGeometry on 30 rigorous geometry problems from the IMO, which are considered challenging even for expert mathematicians. Remarkably, the system successfully solved 25 of these problems within the standard time limit of 4.5 hours, matching the average score of human gold medalists. In contrast, the previous leading system, based on Wu’s method, only solved 10 problems.
The findings, published in Nature, indicate that AI can not only reason logically but also unearth new mathematical insights. Mathematics, particularly geometry, has historically posed challenges for AI researchers due to the need for both creativity and precision. Unlike text-based models, which can access vast amounts of internet data, mathematical data is more symbolic and domain-specific, making it rarer. Furthermore, solving math problems requires robust logical reasoning, an area where most current AI models struggle.
To tackle these challenges, the researchers applied a novel neuro-symbolic approach, leveraging the strengths of both neural networks and symbolic systems. While neural networks excel at pattern recognition and predicting outcomes, they often lack explanatory power. In contrast, symbolic systems operate on strict formal logic, enabling them to correct and justify the decisions made by the neural network.
The researchers likened their approach to the concept of “thinking, fast and slow” popularized by Nobel laureate Daniel Kahneman, where one system offers quick, intuitive ideas while the other engages in more deliberate, logical reasoning. Together, these systems collaborate to tackle complex mathematical challenges.
Moreover, AlphaGeometry demonstrated the ability to generalize to new problems, proving theorems not explicitly stated in the problem statements. For instance, it successfully proved a theorem regarding the angle bisector of a triangle, which was neither given as a premise nor a goal.
The team hopes that by open-sourcing their system, they will inspire further research and applications in mathematics, science, and AI. They also recognize the current limitations, such as the need for more human-readable proofs and the scalability to more complex problems, alongside the ethical considerations surrounding AI in mathematics.
Although AlphaGeometry is currently focused on geometry, the researchers believe their synthetic data methodology could empower AI reasoning in mathematical and scientific fields where human-generated training data is limited. By automating the discovery and verification of new knowledge, machine learning has the potential to significantly accelerate human understanding across diverse disciplines.