Meta Launches Its First Open AI Model Capable of Image Processing

Home AI News Meta Launches Its First Open AI Model Capable of Image Processing

Updated on September 25 2024

Just two months after its last major AI model release, Meta has unveiled a significant update: Llama 3.2, its first open-source model that processes both images and text. This new model empowers developers to create more sophisticated AI applications, such as augmented reality experiences that offer real-time video analysis, content-based visual search engines, and document summarization tools.

Meta assures that integrating this model will be straightforward for developers, requiring minimal effort to incorporate its multimodal capabilities. “Developers can easily showcase Llama's ability to process images and communicate,” said Ahmad Al-Dahle, Vice President of Generative AI at Meta.

While other AI companies like OpenAI and Google launched multimodal models last year, Meta is now enhancing its AI capabilities, particularly in conjunction with hardware like its Ray-Ban Meta glasses.

Llama 3.2 features two vision models, equipped with 11 billion and 90 billion parameters, as well as two lightweight text-only models with 1 billion and 3 billion parameters. The smaller models are optimized for Qualcomm, MediaTek, and other Arm hardware, aiming for practical applications on mobile devices.

The earlier version, Llama 3.1, released in July, remains relevant as it includes a more robust model with 405 billion parameters, expected to excel in text generation tasks.

Meta to Introduce AI-Generated Images in Your Facebook and Instagram Feeds

Microsoft Showcases AI Safety Tool That Identifies and Corrects Errors Instantly

Most people like

TextPixie AI Translator

6.1K

Discover a powerful free translation tool that supports more than 100 languages, making communication easier for everyone, everywhere. Whether you’re traveling, studying, or connecting with friends worldwide, our tool ensures your messages break language barriers effortlessly.

AI translator AI Image Recognition

Thunderbit

33.4K

Revolutionize your workflow with our AI platform designed specifically for web task automation through customizable templates. Simplify your processes and enhance productivity by leveraging intelligent automation tailored to your needs. Discover how our user-friendly templates can streamline repetitive tasks, allowing you to focus on what truly matters.

AI automation Summarizer

Mermaid Chart

497.1K

AI-powered collaborative diagramming platform for creating visual diagrams effortlessly.

diagramming AI Diagram Generator

Ask AI

Discover an AI-powered chatbot assistant designed for instant answers and seamless writing support. Whether you need quick information or help enhancing your writing, our intelligent chatbot is here to assist you at any time. Experience the convenience of having a reliable virtual assistant that caters to your needs!

AI-powered chatbot AI Chatbot

Find AI tools in YBX