Intel's Gaudi 3 Debuts: A New Contender in the AI Chip Market with an Open Ecosystem to Compete with Nvidia

Home AI News Intel's Gaudi 3 Debuts: A New Contender in the AI Chip Market with an Open Ecosystem to Compete with Nvidia

Intel has unveiled its next-generation AI processing chip, the Gaudi 3 AI accelerator, designed to enhance AI development by streamlining workflows, simplifying infrastructure, and accelerating enterprise workloads.

The Gaudi 3 retains the architecture of its predecessor but offers significantly improved performance—four times the computing power, double the network bandwidth, and 1.5 times the high-bandwidth memory (HBM) capacity, enabling it to manage the increasing demands of large language models (LLMs) without sacrificing performance.

Originally rooted in graphics processing unit (GPU) technology, the Gaudi 3's parallel processing capabilities and multi-tile architecture make it well-suited as an AI accelerator. This launch is part of Intel's strategy to compete with Nvidia and AMD in the AI accelerator market.

Intel CEO Patrick Gelsinger previewed the Gaudi 3 at the AI Everywhere event and announced that while the chip officially launches today, general availability is set for the third quarter of 2024, with some customers already receiving samples.

According to Jeni Barovian, Intel’s vice president for data center AI solutions, “Generative AI represents a foundational transformation of compute.” She emphasized that Gaudi 3 will deliver the performance, scalability, and efficiency required to build future AI systems.

Intel Gaudi 3: Specifications and Performance

Eitan Medina, COO of Intel’s Habana Labs, describes the Gaudi 3 as featuring a heterogeneous computer architecture that includes 64 Tensor processor cores (5th gen), 8 Matrix Math Engines, 128 GB of HBM capacity with 3.7 TB/s bandwidth, and 24x 200 GbE RoCE Ethernet ports.

Building solutions with Gaudi 3 is designed to be as straightforward as with Gaudi 2. Intel has doubled the network bandwidth per accelerator, allowing for extensive cluster configurations based on workload needs—be it inference, fine-tuning, or training.

Comparison with Nvidia GPUs

When compared to Nvidia’s H100—a leading GPU for training large language models like Llama 2 and GPT-3—the Gaudi 3 is projected to be up to 1.7 times faster in training tasks. In inferencing tests using models like Llama-7B and Falcon 180B, Gaudi 3 reportedly performs 1.5 times faster than the H100 and 1.3 times faster than the newer H200. Notably, Gaudi 3 demonstrates a power efficiency rate up to 2.3 times greater than the H100 in inference tasks.

Extensive Product Lineup

Intel is not only launching the Gaudi 3 chip but also three complementary products:

1. Gaudi 3 AI Accelerator Card (HL-325L): OAM-compliant with 1,835 TFLOPs and 128 GB HBM2e.

2. Universal Baseboard (HLB-325): Offers 14.6 PFLOPS and over 1 TB HBM2e.

3. PCI Express Add-in Card: Features a dual-slot, passive cooling design with comparable performance metrics to its counterparts.

The Future of AI in Enterprises

Intel’s Gaudi 3 addresses enterprise-level concerns, with Sachin Katti, senior VP for the network and edge group, asserting that we are entering an era of AI agents that can autonomously handle complex workflows. The next phase of AI will see these agents leveraging proprietary data, setting the stage for a significant transformation across industries.

Katti highlights the challenge of integrating unstructured, proprietary data into AI systems, which often remain CPU-dependent and scattered across various formats. He advocates for a modular, secure ecosystem where enterprises can choose from a range of compatible AI solutions, focusing on responsible deployment to ensure trustworthiness and mitigate bias.

Intel aims to leverage Gaudi's enhanced capabilities to attract customers away from the Nvidia ecosystem, especially as AI costs rise. With the AI chip market projected to grow substantially, Intel is positioning itself as a viable alternative, emphasizing an open and collaborative approach to AI solutions.

Conclusion

As generative AI marks a pivotal moment in computing, Intel's Gaudi 3 introduces competitive performance and efficiency aimed at transforming enterprise AI deployment. The company’s commitment to open standards and system compatibility highlights its dedication to supporting the evolving AI landscape, promising to meet the needs of diverse enterprises seeking to harness the power of AI.

Intel's AI Vision: Championing 'Openness at Every Stack Layer' | Interview with Sachin Katti

Intel Unveils Vision for the Open AI Era and Strategies to Challenge Nvidia's Dominance

Most people like

Wenxin Yiyan

20.5M

Discover the power of an AI content partner for enhanced copywriting and engaging conversations. Whether you’re crafting compelling marketing copy or looking for an intelligent chat assistant, our AI solutions are designed to elevate your communication and streamline your creative process. Unlock new possibilities with advanced technology at your fingertips!

AI content generation Large Language Models (LLMs)

Img2html

27.4K

Transform your images into responsive HTML with our advanced AI-powered image to HTML converter. This innovative tool streamlines the process, allowing web developers and designers to effortlessly convert visual content into high-quality, customizable HTML code. Enhance your website's efficiency and aesthetics by leveraging cutting-edge technology designed to save you time while delivering exceptional results. Perfect for anyone seeking to optimize web design workflows, our converter is user-friendly and reliable. Start converting your images today!

AI-powered Other

Kolors Virtual Try On

42.6K

Discover the groundbreaking AI tool that allows you to virtually try on clothing by simply uploading your images. Experience a new way to shop, ensuring the perfect fit and style from the comfort of your home.

Virtual Try-On AI Clothing Generator

Readable

95.4K

Easily translate PDFs in real-time with Readable. Whether you need quick translations for work, study, or personal use, Readable simplifies the process, making it accessible and efficient.

AI Translate

Find AI tools in YBX