Quick Overview
Google released its new AI model for generating images, called Imagen 3. The company did not make a big announcement about this release. Instead, it made the model available to users without much fanfare. Along with the release, Google also published a research paper online explaining how the model works.
Availability and Features
Right now, Imagen 3 is only available to users in the US. There is no information yet on when it will be available in other countries. Users can access the model through Google’s AI Test Kitchen. This platform lets people sign up and start using Imagen 3 to create images.
Imagen 3 is the third version of Google’s Imagen model. It features improved texture generation, better word recognition, and stricter adherence to prompts. This means it should be better at creating images based on the descriptions given.
User Experience and Issues
Currently, there are some mixed reviews about Imagen 3. However, some users seems to have trouble with close-up images involving multiple people and poorly lit scenes. These were areas where the previous version of the model performed better.
Another issue reported is with generating images of limbs.
Users found that when they used prompts like “a guy holding a cup of coffee,” the model sometimes produced extra limbs or merged limbs with objects in unusual ways. The model also has very strict censorship rules for prompts, which can affect the results.
Technical Detail
Google’s research paper, published on arXiv, explains some of the technical aspects of Imagen 3. The model uses a technique called latent diffusion. This method is a variation of the diffusion model popularized by Stable Diffusion. Google also mentioned that they have implemented new methods to reduce potential harm from using the model.
Comparison with Other Models
Unlike Gemini’s free-tier chatbot, which can also generate images but uses different capabilities, Imagen 3 is built on a distinct architecture. Imagen 3 is trained mainly on image data, which helps it generate better and more accurate AI images.
Summary of Points about Imagen 3
- Google released its Imagen 3 AI model for image generation on Thursday.
- The model is currently available only to users in the US.
- Users can access Imagen 3 through Google’s AI Test Kitchen.
- Imagen 3 has improved texture, better word recognition, and stricter prompt adherence.
- Some users report issues with close-up images and underlit scenes.
- The model struggles with generating accurate limbs in images.
- Google’s research paper details the use of latent diffusion, a variant of diffusion models.
- Imagen 3 is different from Gemini’s image generation, focusing more on image-specific training.