Meta Launches Its First Open AI Model Capable of Image Processing

Home AI News Meta Launches Its First Open AI Model Capable of Image Processing

Updated on September 25 2024

Just two months after launching its latest AI model, Meta has unveiled a significant update: its first open-source model that can process both images and text. The new Llama 3.2 model empowers developers to build advanced AI applications, such as augmented reality apps that provide real-time video analysis, visual search engines that categorize images by content, and document analysis tools that summarize lengthy texts.

Meta emphasizes that integrating Llama 3.2 will be straightforward for developers. As Ahmad Al-Dahle, Meta's vice president of generative AI, noted, developers simply need to incorporate this “new multimodality” to allow Llama to interact with images.

With competitors like OpenAI and Google already releasing multimodal models, Meta is catching up in this arena. The addition of vision support is crucial as Meta expands its AI capabilities, particularly on devices like its Ray-Ban Meta glasses.

Llama 3.2 features two vision models (11 billion and 90 billion parameters) alongside two lightweight text-only models (1 billion and 3 billion parameters). The smaller models are optimized for use on Qualcomm, MediaTek, and other Arm hardware, indicating Meta’s strategy to enhance mobile applications.

There’s still a role for the previous Llama 3.1 model, released in July, which includes a version with 405 billion parameters. This model remains superior for text generation tasks.

Meta Introduces AI-Generated Images to Your Facebook and Instagram Feeds

Microsoft's AI Safety Tool: Identifying and Correcting Errors with Precision

Most people like

Teloz

78.3K

Teloz offers cutting-edge cloud-based communication solutions, equipped with advanced features for efficient contact center management.

cloud contact center Other

Wondering

16.9K

Unlock the potential of AI-driven user insights to enhance your products like never before. By leveraging advanced analytics, businesses can fine-tune their offerings, ensuring they meet customer needs and preferences effectively. Discover how integrating AI insights can lead to more optimized, market-ready products that resonate with your audience.

User Research AI Product Description Generator

OptimizerAI

118.2K

Discover the world of unlimited AI-generated sounds, where creativity knows no bounds. Unlock a vast array of audio experiences tailored to inspire musicians, content creators, and sound designers alike. Whether you're seeking unique soundscapes for your projects or innovative sound effects for videos, our cutting-edge AI technology delivers endless possibilities. Dive in and explore the future of sound creation today!

Sound FX Voice & Audio Editing

Dittin AI

19.1K

Explore our innovative AI character chat platform, designed for engaging and safe interactions. Unlike other platforms, we prioritize a family-friendly environment, ensuring all discussions remain free from NSFW content. Join us for a unique experience where you can connect with characters in a protected space, perfect for users of all ages!

AI character chat AI Chatbot

Find AI tools in YBX