OpenAI Enhances ChatGPT with Video Processing and Real-Time Visual Interactions
Posted: Sat Dec 14, 2024 12:03 am
OpenAI continues to expand the horizons of its generative AI chatbot, ChatGPT, which has already proven its versatility in answering questions, generating code, and more using text and image prompts. Now, OpenAI is taking things further by introducing the ability for ChatGPT to process videos and interact with users in real-time using visual inputs.
Video and Visual Recognition: The Next Big Step
During a live-streamed event, OpenAI unveiled this groundbreaking feature for ChatGPT. Users can now point their smartphone cameras at objects, and ChatGPT will provide insights based on what it observes. Think of it as ChatGPT merging the functionality of Google Lens with advanced AI-driven conversations. With this update, the app will visually "see" the objects pointed out by users and offer context-aware responses, according to a Bloomberg report.
Who Gets It and When?
The rollout begins this week for paid ChatGPT Plus and Pro subscribers. Educational and enterprise users can expect access by January 2025, aligning with OpenAI's vision to embed advanced AI tools across sectors.
Part of a Broader AI Push
The addition of video processing is part of OpenAI's ambitious 12-day series of product launches. Alongside this, OpenAI has introduced a premium ChatGPT Pro subscription and unveiled an AI video generation tool named Sora Turbo, which allows users to create high-quality videos from text prompts.
Sora Turbo: Redefining Video Creation
Initially launched as a research preview in February, Sora Turbo is now accessible to ChatGPT Plus and Pro users. This tool enables the creation of 20-second videos in resolutions up to 1080p, available in widescreen, vertical, and square formats.
With Sora Turbo, users can:
Limitations and Future Plans
Currently, Sora Turbo is geographically restricted and unavailable in regions such as the EU, UK, and Switzerland. Users must be 18 or older to access this feature. Plus subscribers can generate up to 50 videos per month at 480p resolution, while Pro users enjoy higher production limits and resolutions.
Looking ahead, OpenAI plans to introduce flexible pricing models in 2025 to cater to a broader audience.
Video and Visual Recognition: The Next Big Step
During a live-streamed event, OpenAI unveiled this groundbreaking feature for ChatGPT. Users can now point their smartphone cameras at objects, and ChatGPT will provide insights based on what it observes. Think of it as ChatGPT merging the functionality of Google Lens with advanced AI-driven conversations. With this update, the app will visually "see" the objects pointed out by users and offer context-aware responses, according to a Bloomberg report.
Who Gets It and When?
The rollout begins this week for paid ChatGPT Plus and Pro subscribers. Educational and enterprise users can expect access by January 2025, aligning with OpenAI's vision to embed advanced AI tools across sectors.
Part of a Broader AI Push
The addition of video processing is part of OpenAI's ambitious 12-day series of product launches. Alongside this, OpenAI has introduced a premium ChatGPT Pro subscription and unveiled an AI video generation tool named Sora Turbo, which allows users to create high-quality videos from text prompts.
Sora Turbo: Redefining Video Creation
Initially launched as a research preview in February, Sora Turbo is now accessible to ChatGPT Plus and Pro users. This tool enables the creation of 20-second videos in resolutions up to 1080p, available in widescreen, vertical, and square formats.
With Sora Turbo, users can:
- Generate and remix videos from scratch or using personal assets.
- Customize their creations with a storyboard interface for frame-by-frame precision.
- Explore trending video creations through a dedicated community showcase.
Limitations and Future Plans
Currently, Sora Turbo is geographically restricted and unavailable in regions such as the EU, UK, and Switzerland. Users must be 18 or older to access this feature. Plus subscribers can generate up to 50 videos per month at 480p resolution, while Pro users enjoy higher production limits and resolutions.
Looking ahead, OpenAI plans to introduce flexible pricing models in 2025 to cater to a broader audience.