Nvidia has recently become the world's most valuable company, generating significant revenue from its high-demand GPUs, which command premium prices due to resource scarcity and market dynamics. This situation raises critical questions: “How will we secure the revenue needed to purchase the GPUs, and what resources are required to support our workloads?”
Nvidia's CEO, Jensen Huang, now the 11th richest person globally, expresses concerns that many customers lack the necessary data centers and power to fully utilize the chips they've purchased. The company continues to regulate chip allocations to prevent stockpiling amidst limited supply. This has led to tensions with Microsoft, which is unhappy about Nvidia's influence over how it integrates GPUs into its data centers.
In response to market pressures, Dell CEO Michael Dell announced a partnership with Nvidia aimed at creating a new AI factory for Elon Musk’s startup, xAI. This initiative also focuses on assisting companies in building data centers. Additionally, Hewlett Packard Enterprise (HPE) has partnered with Nvidia to offer turnkey private-cloud AI solutions.
The escalating costs associated with scaling infrastructure are now a hot topic, particularly in light of the ongoing Chip Wars and the challenges in securing computing power. Will rising infrastructure costs hinder AI's potential? This critical question will be explored during Transform 2024, live in San Francisco. Industry leaders will delve into the current landscape and its implications for enterprises, as well as alternative technologies that are gaining traction.
Key speakers include Kirk Bresniker, Chief Architect at Hewlett Packard Labs; Dr. Jamie Garcia, Director of Quantum Algorithms and Partnerships at IBM; and Paul Roberts, Director of Strategic Accounts at AWS. They will discuss the race to scale AI workloads while managing infrastructure costs, the rise of alternative providers aimed at enhancing AI workload performance, and the reduction of costs and environmental impact.
Join us at VB Transform 2024, taking place live in San Francisco from July 9-11. The event will focus on scaling AI effectively, featuring practical generative AI case studies and insights from industry leaders. Register now!