Date:

ChatGPT’s Image-Generation Feature Gets an Upgrade

OpenAI Unveils Major Upgrade to ChatGPT’s Image-Generation Capabilities

New Feature Leverages GPT-4o Model for Native Image and Photo Creation

During a livestream on Tuesday, OpenAI CEO Sam Altman announced the first major upgrade to ChatGPT’s image-generation capabilities in over a year. The company’s AI-powered chatbot platform has long been known for its ability to generate and edit text, but now, it can also natively create and modify images and photos.

How it Works

The new feature leverages OpenAI’s GPT-4o model, which has been used to power the company’s AI video-generation product, Sora. GPT-4o is capable of generating more accurate and detailed images than its predecessor, DALL-E 3, which it replaces. The model can also edit existing images, including those with people in them, by transforming or "inpainting" details like foreground and background objects.

Training Data

To power the new image feature, OpenAI trained GPT-4o on publicly available data, as well as proprietary data from its partnerships with companies like Shutterstock. This training data is a key component of the model’s ability to generate high-quality images, but it’s also a sensitive topic for many companies, who often keep it close to the chest to avoid potential IP-related lawsuits.

Policies and Guidelines

OpenAI has implemented policies to ensure the responsible use of its image-generation capabilities. The company respects the rights of artists and has an opt-out form for creators to request that their works be removed from its training datasets. Additionally, OpenAI respects requests to disallow its web-scraping bots from collecting training data, including images, from websites.

Competition and Controversy

The new feature follows on the heels of Google’s experimental native image output for Gemini 2.0 Flash, one of the company’s flagship models. While the feature went viral on social media, it was met with controversy due to a lack of guardrails, allowing users to remove watermarks and create images depicting copyrighted characters.

Conclusion

The upgrade to ChatGPT’s image-generation capabilities is a significant step forward for the company, offering users new creative possibilities and applications. As the technology continues to evolve, it’s essential for companies like OpenAI to prioritize responsible use and respect for the rights of artists and creators.

Frequently Asked Questions

  • Q: What is GPT-4o?
    A: GPT-4o is a model used by OpenAI to generate and modify images and photos.
  • Q: How does GPT-4o work?
    A: GPT-4o uses publicly available data and proprietary data from OpenAI’s partnerships to generate and modify images and photos.
  • Q: How does OpenAI ensure responsible use of its image-generation capabilities?
    A: OpenAI has implemented policies to ensure responsible use, including an opt-out form for creators and a prohibition on web-scraping bots collecting training data from websites.

Latest stories

Read More

LEAVE A REPLY

Please enter your comment!
Please enter your name here