is a news writer who covers the streaming wars, consumer tech, crypto, social media, and much more. Previously, she was a writer and editor at MUO.
Posts from this author will be added to your daily email digest and your homepage feed.
OpenAI is rolling out the latest version of its AI-powered image generator with new “thinking capabilities,” allowing it to search the web to help it create multiple images from a single prompt. In a blog post, OpenAI says ChatGPT Images 2.0 can now create more “sophisticated” images, with improvements to its ability to follow instructions, preserve details of your choosing, and generate text.
It’s powered by OpenAI’s new GPT Image 2 model, with new thinking capabilities available to ChatGPT Plus, Pro, Business, and Enterprise subscribers. When a thinking model is selected, the chatbot’s image generator can pull information from the web, create visual explainers based on files you upload, and “reason through the structure of the image before generating.”
Image: OpenAI
ChatGPT Images 2.0 can also create up to eight images at once with thinking enabled, all while maintaining the same characters, objects, and styles in each scene. OpenAI says this should make it easier to generate things like manga pages, a series of social graphics, or design plans for every room in a house.
All ChatGPT users can take advantage of updates that let ChatGPT Images 2.0 “better capture the defining characteristics of photos,” in addition to pixel art, manga, cinematic stills, and other types of images. It can now generate images with a resolution of up to 2K and in more aspect ratios, ranging from wider formats, such as 3:1, to taller ones like 1:3. And it’s not only better at generating English and other Latin-script languages; OpenAI says Images 2.0 makes “significant gains” in creating images containing text in Japanese, Korean, Chinese, Hindi, and Bengali.
Previous Next
1 / 3 Image: OpenAI
... continue reading