Last Updated on September 24, 2024 7:16 pm by Laszlo Szabo / NowadAIs | Published on September 24, 2024 by Laszlo Szabo / NowadAIs
OpenAI Unveils Advanced Voice Mode for ChatGPT, Avoids Scarlett Johansson Controversy – Key Notes
- OpenAI is launching the “Advanced Voice Mode” (AVM) feature for paying ChatGPT users, starting with Plus and Teams tiers.
- AVM introduces 5 new nature-inspired voices and improved voice recognition capabilities.
- OpenAI had to remove a previous voice option, “Sky,” due to a legal dispute with Scarlett Johansson over its similarity to her voice.
ChatGPT Gains Advanced Voice Mode for Paying Customers
Advanced Voice is rolling out to all Plus and Team users in the ChatGPT app over the course of the week.
While you’ve been patiently waiting, we’ve added Custom Instructions, Memory, five new voices, and improved accents.
It can also say “Sorry I’m late” in over 50 languages. pic.twitter.com/APOqqhXtDg
— OpenAI (@OpenAI) September 24, 2024
On Tuesday, OpenAI declared that it will be launching the Advanced Voice Mode (AVM) for a larger group of paying clients of ChatGPT. This new audio feature, which enhances the conversational experience with ChatGPT, will be first available to the Plus and Teams tiers of ChatGPT’s customers. Enterprise and Edu clients will gain access to this feature starting next week.
AVM Gets a Redesign
AVM is undergoing a redesign as part of its release. The presentation of the feature now features a blue animated sphere instead of the previous animated black dots that were showcased by OpenAI in May during the publication of the technology.
New Voice Options for ChatGPT
ChatGPT is introducing five additional voices for users to experiment with: Arbor, Maple, Sol, Spruce, and Vale. This brings the total number of voices on ChatGPT to nine, which is almost equal to the number of voices available on Google’s Gemini Live. These new voices, namely Breeze, Juniper, Cove, and Ember, all draw inspiration from nature, fitting with the overall goal of AVM to enhance the naturalness of using ChatGPT.
Scarlett Johansson Controversy and Removal of “Sky” Voice
One of the voices that is not included in this lineup is Sky, the voice that was displayed by OpenAI in its Spring Update. This caused a legal issue when Scarlett Johansson, who portrayed an AI system in the movie “Her”, claimed that Sky’s voice sounded too similar to her own. As a result, OpenAI quickly removed Sky’s voice and stated that they did not intend for it to resemble Johansson’s voice, despite several staff members referencing the movie in their tweets at the time.
Multimodal Capabilities Still Pending
The latest release of ChatGPT does not include the video and screen sharing feature that was introduced by OpenAI in their Spring update four months ago. This functionality was designed to allow GPT-4o to process both visual and audible data simultaneously. During the demonstration, a member of the OpenAI team showcased the ability to ask ChatGPT real-time questions about math written on paper or code displayed on a computer screen. However, there is currently no timeline for when these multimodal capabilities will be available.
Improvements and Limitations of AVM
According to OpenAI, some enhancements have been made to AVM after the initial release of its restricted alpha test. The voice function of ChatGPT is reportedly more proficient in comprehending accents, and the company asserts that conversations are now more seamless and efficient. While using AVM in our trials, we encountered occasional malfunctions, but the company assures that this has been addressed.
Expanded Customization Options for AVM
In addition, OpenAI is also broadening the scope of AVM’s customization options, such as Custom Instructions, which enables users to personalize their interactions with ChatGPT, and Memory, which enables ChatGPT to retain conversations for future reference.
Limited Regional Availability for AVM
According to a representative from OpenAI, the AVM is currently unavailable in various regions such as the EU, UK, Switzerland, Iceland, Norway, and Liechtenstein.
Descriptions:
Advanced Voice Mode (AVM): This is a new audio feature from OpenAI that enhances the conversational experience with ChatGPT. It allows users to interact with the AI assistant using natural voice commands, rather than just text-based interactions.
Voices: ChatGPT is introducing 5 additional voices for users to experiment with – Arbor, Maple, Sol, Spruce, and Vale. These new voices, along with the existing ones (Breeze, Juniper, Cove, and Ember), are all inspired by nature, aiming to make the voice interactions more lifelike.
Scarlett Johansson Controversy: One of the previous voice options, “Sky,” had to be removed by OpenAI due to a legal issue. Actress Scarlett Johansson, who portrayed an AI system in the movie “Her,” claimed the voice sounded too similar to her own. OpenAI stated they did not intend for the voice to resemble Johansson’s.
Multimodal Capabilities: The latest ChatGPT update does not include the previously announced video and screen sharing features. These were designed to allow the AI to process both visual and audible data simultaneously, enabling users to ask questions about written math or displayed code. However, a timeline for when these capabilities will be available is still unclear.
Customization Options: OpenAI is expanding the customization options for AVM, such as “Custom Instructions” (personalized user interactions) and “Memory” (retaining conversation history).
Regional Availability: AVM is currently unavailable in certain regions, including the EU, UK, Switzerland, Iceland, Norway, and Liechtenstein.
Frequently Asked Questions:
- What is the “Advanced Voice Mode” (AVM) in ChatGPT?
AVM is a new audio feature from OpenAI that enhances the conversational experience with ChatGPT. It allows users to interact with the AI assistant using natural voice commands, rather than just text-based interactions. - What new voice options has ChatGPT introduced?
ChatGPT is introducing 5 new voices for users to experiment with: Arbor, Maple, Sol, Spruce, and Vale. These new voices, along with the existing ones, are all inspired by nature to make the voice interactions more lifelike. - Why did OpenAI remove the “Sky” voice option?
The “Sky” voice option had to be removed due to a legal issue. Actress Scarlett Johansson, who portrayed an AI system in the movie “Her,” claimed the voice sounded too similar to her own. OpenAI stated they did not intend for the voice to resemble Johansson’s. - When will ChatGPT’s multimodal capabilities be available?
The latest ChatGPT update does not include the previously announced video and screen sharing features, which were designed to allow the AI to process both visual and audible data simultaneously. However, a timeline for when these capabilities will be available is still unclear. - Where is the “Advanced Voice Mode” currently available?
According to OpenAI, the AVM is currently unavailable in certain regions, including the EU, UK, Switzerland, Iceland, Norway, and Liechtenstein.