Google's Imagen 3: Enhanced Text-to-Image Foundation Model Now Available on Vertex AI

Home AI News Google's Imagen 3: Enhanced Text-to-Image Foundation Model Now Available on Vertex AI

Updated on October 25 2024

Google's advanced text-to-image foundation model, Imagen 3, is now set to launch on the Vertex AI platform. This next-generation AI tool will be available for select customers in preview, offering developers faster image generation, improved prompt comprehension, more photorealistic depictions of people, and enhanced text rendering capabilities compared to previous versions.

Originally introduced at Google I/O in May, Imagen 3 began its journey with a private preview in ImageFX for select creators. Google's announcement confirmed that this powerful AI model would soon be accessible via Vertex AI.

Douglas Eck, senior research director at Google DeepMind, emphasized its capabilities, stating, “It’s our most capable image generation model yet. Imagen 3 is more photorealistic, richer in detail, and it minimizes visual artifacts. It comprehends prompts crafted in a natural, creative manner—detailed instructions yield the best results. Additionally, it excels at incorporating subtle details from longer prompts and improves text rendering, a persistent challenge in earlier image generation models.”

With the transition to Vertex AI, Imagen 3 introduces multi-language support, robust safety features such as Google DeepMind’s SynthID digital watermarking, and support for various aspect ratios.

Shutterstock, a leader in stock photography, has already integrated this model. Justin Hiza, vice president of data services at Shutterstock, remarked, “Since incorporating Imagen into our AI image generator, our users have created millions of images. We’re thrilled about the improvements Imagen 3 offers, allowing users to realize their ideas more quickly without compromising quality. This enhancement further solidifies Shutterstock’s commitment to an ethically-sourced AI image generator, ensuring safety and protection through Google Cloud’s indemnification for generative AI.”

While Google continues to evolve Imagen, it has not disclosed when its Gemini AI will resume image generation after facing criticism over inaccuracies. During a recent press briefing, Google Cloud CEO Thomas Kurian clarified the difference between the two models: “Gemini is a multimodal model designed to process diverse types of input, including images, video, and audio, enabling reasoning across these modalities. In contrast, Imagen is a diffusion model focused solely on generating high-fidelity text-to-image outputs. They serve distinct purposes.”

Questions about the timeline for re-enabling Gemini’s image functionality remain unanswered.

Google Launches Gemini 1.5 Flash and Pro Versions with 2 Million Token Limit for Public Access

Google Enhances Vertex AI’s Enterprise Capabilities with New Mistral Small, Large, and Codestral Models

Most people like

Fine - AI Agents for Software Development

11.4K

Streamlined AI Agents Transforming Software Development.

AI-driven Writing Assistants

BlueGPT

25.6K

Discover a single platform to access all AI models seamlessly. Enjoy the convenience of exploring a diverse range of AI solutions designed to enhance your projects and drive innovation. Whether you're a developer, researcher, or enthusiast, this platform provides everything you need to harness the power of artificial intelligence in one accessible location.

AI models Large Language Models (LLMs)

ColorifyAI

5.7K

Are you looking for a fun and innovative way to engage with art? An AI coloring page generator can elevate your creative experience by transforming images and ideas into unique coloring pages. Whether you're a parent seeking entertaining activities for your kids, an artist in search of inspiration, or someone looking to unwind with a soothing hobby, this cutting-edge tool offers endless possibilities. Discover how AI can spark your imagination and bring your coloring pages to life!

ColorifyAI AI Photo & Image Generator

SpoiledChild™

1.8M

Revolutionary skincare and haircare solutions designed specifically for anti-aging. Discover how intelligent formulations can help you achieve youthful, radiant skin and vibrant hair.

Intelligent skincare AI Product Description Generator

Find AI tools in YBX