Enhancing Generative AI Reasoning: Google DeepMind Unveils GenRM Technology

Home Hardware Enhancing Generative AI Reasoning: Google DeepMind Unveils GenRM Technology

Updated on September 3 2024

Google DeepMind Introduces Generative Evaluator GenRM to Enhance AI Reasoning Abilities

On August 27, 2023, the Google DeepMind team published a paper on arXiv introducing their innovative generative evaluator, GenRM. This new reward model is designed to significantly enhance the reasoning capabilities of generative AI.

Currently, the prevailing method for improving large language models (LLMs) is the "Best-of-N" approach. This technique involves generating N candidate solutions, which are then ranked by an evaluator to determine the best option. However, traditional LLM evaluators typically function only as discriminative classifiers, failing to fully harness the text generation capabilities of pre-trained LLMs.

To overcome this limitation, the DeepMind team has trained the evaluator using the prediction of the next token, integrating both validation and solution generation. GenRM offers several distinct advantages over conventional evaluators:

- Seamless integration of instruction adjustment

- Support for chain-of-thought reasoning

- Calculation of additional reasoning time using majority voting

In tasks involving algorithms and foundational mathematical reasoning, GenRM outperformed both discriminative evaluators and LLM-as-a-Judge evaluators when tested with Gemma-based evaluators, achieving a problem-solving success rate increase of 16% to 64%.

Google DeepMind asserts that GenRM represents a significant evolution in AI reward systems, particularly enhancing capacity to prevent potential fraudulent behaviors in new model training. This advancement underscores the necessity of refining reward models to ensure that AI outputs meet societal responsibility standards.

8 New Natural Animal Sounds Added to OpenAI ChatGPT: Experience More Authentic Barking and Animal Expressions

TSMC Announces Apple and OpenAI as Initial Clients for A16 Chip Production

Most people like

BrightHire

57.1K

Harness the power of AI interview intelligence to revolutionize your hiring experience. By integrating advanced algorithms and data analytics, this innovative approach simplifies candidate evaluation, enhances decision-making, and accelerates the recruitment process. Transform your hiring strategy today with AI-driven insights that lead to better talent acquisition and improved organizational fit.

Interview Intelligence Platform AI Recruiting

Hyperaide

5.5K

Accelerate your development process and enhance your projects with Hyperaide. Experience faster, more efficient building solutions tailored to your needs.

AI layer AI Code Assistant

Rotor Videos

80.7K

Easily create captivating music videos, engaging lyric videos, and much more in just minutes!

music videos AI Music Video Generator

InteriorDecorator.ai

6.4K

Welcome to InteriorDecorator.ai, an innovative AI platform that transforms interior design by generating customized ideas for your home. Leveraging advanced AI algorithms, we bring you unique decor suggestions tailored to elevate your living spaces. Explore the future of home design with us today!

interior design AI Interior & Room Design

Find AI tools in YBX