Mistral Launches Codestral Mamba: Accelerating and Extending Code Generation Speed

Home AI News Mistral Launches Codestral Mamba: Accelerating and Extending Code Generation Speed

Updated on October 25 2024

The well-funded French AI startup Mistral, renowned for its advanced open-source AI models, has launched two new large language models (LLMs): a math-focused model and a code-generation model for developers, both based on the innovative Mamba architecture introduced by researchers last year.

Mamba aims to enhance the efficiency of traditional transformer architectures by streamlining attention mechanisms. This advancement allows Mamba-based models to achieve faster inference times and support longer context, distinguishing them from typical transformer models. Other companies, including AI21, have also released AI models utilizing this architecture.

Mistral’s new Codestral Mamba 7B is designed for rapid response times, even with extended input texts, making it ideal for local coding projects. Available on Mistral's la Plateforme API, it can process inputs of up to 256,000 tokens—twice the capacity of OpenAI’s GPT-4.

In benchmarking tests, Codestral Mamba outperformed several rival open-source models, such as CodeLlama 7B, CodeGemma-1.17B, and DeepSeek in HumanEval assessments.

Developers can modify and deploy Codestral Mamba via its GitHub repository and HuggingFace under an open-source Apache 2.0 license. Mistral asserts that the earlier version of Codestral surpassed other code generators, including CodeLlama 70B and DeepSeek Coder 33B.

AI-powered code generation and coding assistant tools have become essential applications, with platforms like GitHub's Copilot, Amazon's CodeWhisperer, and Codenium gaining traction.

Mistral's second launch, Mathstral 7B, is focused on math-related reasoning and scientific discovery, developed in collaboration with Project Numina. With a 32k context window, Mathstral operates under an Apache 2.0 open-source license and has outperformed all existing math reasoning models. It delivers "significantly better results" on benchmarks requiring extensive inference-time computations, and users can choose to utilize it as is or fine-tune it for specific needs.

“Mathstral exemplifies the excellent performance-to-speed tradeoffs achievable when constructing models for specialized applications—a philosophy we are committed to in la Plateforme, particularly with its enhanced fine-tuning capabilities,” Mistral shared in a blog post.

Mathstral is accessible through Mistral's la Plateforme and HuggingFace.

Competing steadily with industry leaders like OpenAI and Anthropic, Mistral recently secured $640 million in Series B funding, boosting its valuation to nearly $6 billion, with investments from tech giants including Microsoft and IBM.

Cohere and Fujitsu Join Forces to Unveil ‘Takane’ – A Japanese LLM Designed for Enterprises

Transform Your Phone with Hugging Face’s SmolLM Models: Powerful AI Without the Need for Cloud Connectivity

Most people like

SHRED: Home & Gym Workouts App

Introducing our innovative AI-powered personal training app, designed to elevate your fitness experience at home or in the gym. This cutting-edge tool tailors workouts to suit your individual goals, ensuring you get the most out of every exercise session. Whether you're a beginner or a seasoned athlete, our app adapts to your needs, providing customized routines that maximize results and motivate you to stay on track. Get ready for a transformative fitness journey right at your fingertips!

Fitness app Fitness

Leap AI SEO Platform

Unlock the potential of your online presence with our advanced AI SEO tool, designed specifically to help you produce high-quality SEO content. Enhance your website's visibility and engagement by leveraging cutting-edge algorithms that analyze trends and optimize your writing for search engines. Create compelling, relevant, and keyword-rich content that resonates with your audience while improving your ranking on search results. Embrace the future of content creation and watch your visibility soar!

AI SEO Content Generation AI Blog Writer

AI Model Integration Platform

In today's competitive landscape, businesses are constantly seeking innovative solutions to improve their products. Enter the AI Model Integration Tool—an advanced platform designed to seamlessly integrate artificial intelligence into your product development process. By leveraging AI capabilities, this tool enhances product features, optimizes performance, and drives user engagement, positioning your brand ahead of the curve. Discover how integrating AI can revolutionize your product offerings and elevate customer satisfaction.

AI model integration AI Art Generator

2Slash

Introducing 2Slash: a powerful browser extension that enhances your text fields with AI capabilities, enabling you to accomplish a wide range of tasks effortlessly and swiftly.

AI assistant AI Content Generator

Find AI tools in YBX