Snowflake Unveils Arctic: An Open 'Mixture-of-Experts' LLM to Compete with DBRX and Llama 3

Home AI News Snowflake Unveils Arctic: An Open 'Mixture-of-Experts' LLM to Compete with DBRX and Llama 3

Updated on October 28 2024

Today, Snowflake unveiled Arctic, a large language model (LLM) crafted for complex enterprise tasks like SQL generation, code creation, and instruction adherence.

Marketed as the “most open enterprise-grade LLM,” Arctic employs a mixture of expert (MoE) architecture, efficiently achieving top benchmarks for enterprise workloads. It showcases competitive performance against standard models from Databricks, Meta, and Mistral in areas such as world knowledge, common sense, reasoning, and mathematical capabilities.

“This is a watershed moment for Snowflake, with our AI research team at the forefront of innovation,” said CEO Sridhar Ramaswamy. “By providing industry-leading intelligence and efficiency in an open manner, we’re expanding the potential of open-source AI. Our research with Arctic will significantly enhance our ability to deliver reliable and efficient AI to our customers.”

The launch serves as Snowflake's strategic move to compete with Databricks, which has been aggressive in its AI initiatives. Snowflake’s focus on AI has recently intensified after its acquisition of Neeva and Ramaswamy's appointment as CEO.

Arctic: Designed for Enterprise Workloads

As modern enterprises embrace generative AI, there’s a surge in developing applications like retrieval-augmented generation (RAG) chatbots, data copilots, and code assistants. While numerous models exist, few specifically target enterprise requirements—this is where Snowflake Arctic excels.

“We believe AI will enhance the development of end-to-end AI products. Our vision is to create an API that allows business users to directly interact with data, democratizing it across the enterprise. Arctic is a vital step towards realizing this vision,” said Ramaswamy in a recent briefing.

Arctic utilizes a Dense MoE hybrid architecture, segmenting parameters into 128 specialized expert subgroups. These experts process only the input tokens they are best equipped to handle, activating only 17 billion of the 480 billion parameters when responding to a query. This targeted approach ensures high performance with minimal computing power.

Benchmarks indicate that Arctic effectively tackles enterprise tasks, achieving an average score of 65% across various tests. This performance closely aligns with Llama 3 70B’s average enterprise score of 70% and trails only behind Mixtral 8X22B’s 70%.

In the Spider benchmark for SQL generation, Arctic scored 79%, surpassing Databricks' DBRX and Mixtral 8X7B, and coming close to Llama 3 70B and Mixtral 8X22B. For coding tasks, Arctic achieved 64.3%, outperforming Databricks and the smaller Mixtral model while falling short of Llama 3 70B and Mixtral 8X22B.

Notably, in the IFEval benchmark for instruction-following capabilities, Arctic scored 52.4%, outperforming most competitors except for the latest Mixtral model.

Efficiency and Cost-Effectiveness

Snowflake asserts that Arctic's level of enterprise intelligence was achieved with groundbreaking efficiency, utilizing a training compute budget of under $2 million—significantly less than other models like Llama 3 70B, which used 17 times more computing resources. Additionally, Arctic's use of only 17 active parameters further enhances its cost-effectiveness.

Availability Under the Apache 2.0 License

Snowflake is making Arctic accessible through Cortex, its LLM application development service, and across various model catalogs, including Hugging Face, Lamini, Microsoft Azure, Nvidia API catalog, Perplexity, and Together. Users can download Arctic's model weights and code from Hugging Face under an Apache 2.0 license, facilitating unrestricted personal, commercial, or research use.

In conjunction with the model release, Snowflake is providing a data recipe for efficient fine-tuning on a single GPU and comprehensive research cookbooks detailing the model's design and training processes.

“The cookbook is crafted to accelerate the learning curve for anyone interested in world-class MoE models, offering both high-level insights and detailed technical specifications to empower users to construct efficient and cost-effective LLMs like Arctic,” stated Baris Gultekin, Snowflake’s head of AI, during the press briefing.

HubSpot Introduces Zendesk-Inspired Enhancements to Its Service Hub and New AI Solutions for Small and Medium Businesses

William Shatner Faces Controversy Over AI-Generated Cover Art for His Latest Music Album

Most people like

Kore.ai

166.8K

Kore.ai provides innovative AI-driven assistants designed to enhance and automate both customer and employee experiences, streamlining interactions and boosting efficiency.

Conversational AI AI Chatbot

MidReal

348.8K

Engage your audience like never before by harnessing the power of AI to create interactive stories. With AI technology, you can breathe life into your narratives, allowing readers to make choices that influence the storyline. Discover innovative tools and techniques to transform your storytelling approach and leave a lasting impression. Start your journey in interactive storytelling with AI today!

AI storytelling AI Story Writing

Help.center

45.5K

Help.Center is an AI-driven customer support platform that offers interactive chat features and a comprehensive knowledge base, designed to enhance user experience and streamline assistance.

AI customer support Large Language Models (LLMs)

TweetyAI

12.7K

TweetyAI is an innovative AI-driven tweet generator designed to elevate brand visibility and optimize social media strategies.

twitter AI Content Generator

Find AI tools in YBX