Google Unveils Imagen 3: A Major Upgrade in AI Image Generation with Enhanced Features and Accessibility
- Oct 12, 2024
- 155
In a remarkable turn of events, Google has unveiled a major enhancement for its proprietary artificial intelligence model, Gemini. This update introduces the Imagen 3 AI model for image generation, making it accessible to all users. This latest version stands as the most advanced creation from the tech giant in Mountain View, providing powerful image generation features. Furthermore, the new functionality is available not just within the Gemini app, but also through its API, enabling developers to create applications and experiences that leverage this upgraded capability.
Access to Imagen 3 is now granted to all Gemini users, including those utilizing the free version. An announcement on X (formerly Twitter) confirmed that this advanced model allows for image creation that showcases a high level of realism, improved adherence to user prompts, and the inclusion of fewer extraneous elements.
Verification from Gadgets 360 confirmed that image generation in the Gemini app employs the Imagen 3 model. To evaluate its performance against Meta AI, a direct comparison was conducted using the same prompt: "Create an image of a golden retriever dog sitting on a train berth, gazing out the window at the Alps. The train features a wooden interior with green seats, and all other passengers are animals, while one human conductor checks for tickets."
The images produced by both AI systems are displayed above. Though both models missed certain elements from the prompt, Gemini successfully incorporated more aspects than its counterpart. Moreover, while images generated by Meta AI have a resolution of 1280 x 1280, those from Imagen 3 boast a higher resolution of 2048 x 2048.
Imagen 3 supports a diverse array of artistic styles, ranging from photorealistic to textured oil paintings and claymation scenes. Additionally, users can specify to mimic photos captured with particular cameras, including Nikon DSLR, GoPro style, and wide-angle lenses.
Google has emphasized that its new AI model includes built-in protections designed to minimize the risk of misuse, such as deepfakes. Every image generated also carries a SynthID watermark, a unique feature that embeds an imperceptible AI label within the image’s pixels, ensuring it remains intact even in screenshots.