OpenAI Advanced Voice is finally here! Full testing & review

AI Search
25 Sept 202425:08

TLDRThe video reviews OpenAI's new Advanced Voice feature, recently released and available through a paid subscription. The host demonstrates how to set up the feature, explores different voice options, and tests its capabilities by asking it to assist with language pronunciation, tell stories, and perform various tasks. While the voice feature is useful for educational and interactive purposes, it has limitations due to censorship, preventing it from performing certain sound effects like meowing or beatboxing. Despite these restrictions, the feature excels in tasks like guided meditation, language assistance, and storytelling.

Takeaways

  • 🔊 OpenAI's advanced voice feature is now available, and the user has begun testing its capabilities.
  • 📲 To access the feature, you must download the ChatGPT app and subscribe to the paid plan.
  • 🎤 Users can select from various voice options, such as Maple, to enhance their experience.
  • 🗣️ The voice feature can help with language learning, including pronunciation, but it may have limitations with certain languages like Chinese.
  • 🌙 It can tell bedtime stories with sound effects, such as wind and waterfalls, making it suitable for children's storytelling.
  • ⚠️ The feature has strict guidelines preventing it from making certain sounds or engaging in inappropriate conversations (e.g., moaning, flirting).
  • 🎧 The feature cannot analyze or tune sounds, such as musical notes or guitar tuning, but it can still guide through the process verbally.
  • 🧘 Guided meditation is available, helping users focus on their breathing and relaxation.
  • 🤖 There's a level of censorship in place, as the feature does not perform certain actions (e.g., beatboxing, singing) that early testers reported were possible.
  • 📰 The video highlights that this tool can be useful for practical tasks like language learning, storytelling, and guided exercises despite its limitations.

Q & A

  • What is the main topic of the video?

    -The video is a review and testing of OpenAI's new advanced voice feature in the ChatGPT app.

  • How can users access the advanced voice feature?

    -Users need to download the ChatGPT app from the Play Store or App Store and subscribe to a paid plan to access the advanced voice feature.

  • What is the first step to activate the advanced voice mode?

    -After subscribing, users need to press a white button in the bottom right corner of the app, and they will be prompted to activate the voice feature.

  • What type of tasks can the advanced voice feature perform?

    -The advanced voice feature can help with language pronunciation, tell bedtime stories with sound effects, assist with guided meditation, and conduct mock interviews.

  • What are some voice options users can choose from?

    -Users can choose from various voices, each with a unique tone, such as friendly, professional, or conversational styles.

  • Does the advanced voice feature have limitations?

    -Yes, it cannot produce certain sounds like meowing, singing, or beatboxing directly. It also cannot analyze or respond to live audio, such as tuning a guitar.

  • Can the advanced voice feature analyze sounds?

    -No, the advanced voice feature cannot analyze sounds. It suggested using a tuning app when the user tried to tune their guitar using the feature.

  • What is the user’s impression of the feature’s censorship?

    -The user believes there is an extra level of censorship, as the feature was unable to perform certain tasks, such as making certain sounds or responding to flirtatious comments.

  • What examples of storytelling does the advanced voice feature offer?

    -The advanced voice feature can tell creative stories, like one about a fox named Finn, using sound effects like rustling wind and crashing waterfalls.

  • What are some of the potential uses for the advanced voice feature?

    -The feature can be used for language learning, telling bedtime stories, guided meditations, mock interviews, and friendly conversation.

Outlines

00:00

📱 Accessing and Setting Up OpenAI's Advanced Voice Feature

The narrator introduces OpenAI's advanced voice feature, which they recently gained access to. They explain that the feature requires the official ChatGPT app, available on both Play Store and App Store. The feature is only available to users on a paid plan. Once subscribed, users see a white button in the bottom right corner, allowing them to activate the advanced voice mode. After permitting audio recording, users can select from various voice options, demonstrated by the narrator as they choose 'Maple.' The narrator begins testing the feature by asking for help with French pronunciation.

05:00

🗣️ Testing the Voice Feature: Pronunciation, Languages, and Fun Requests

The narrator starts experimenting with OpenAI's voice feature by testing its ability to help with French and Chinese pronunciation. The AI corrects their French pronunciation but isn't as effective in Chinese. They then shift to humorous and creative requests, such as narrating stories in different accents (British, Scottish, Indian, Australian), setting secret codes, and answering philosophical questions using code words like 'chocolate' for 'yes' and 'banana' for 'no.' Throughout the tests, the AI adheres to strict guidelines, avoiding certain topics and maintaining its programmed limitations.

10:02

💼 Digital Marketing Mock Interview and Sponsored Segment

The narrator engages in a mock interview with the AI to prepare for a digital marketing role. They discuss their background in SEO, paid ads, and growth hacking, emphasizing creativity in digital marketing. The AI provides feedback, suggesting they frame challenges as learning experiences. The paragraph also includes a sponsored segment introducing 'AI Portrait,' an AI-based service for generating professional headshots quickly and affordably. The narrator promotes its benefits compared to traditional photo shoots.

15:04

🎸 Guitar Tuning and Exploring Voice Feature Limitations

The narrator attempts to use the AI to help tune their guitar by playing a string for analysis. However, the AI cannot interpret sound and suggests using a tuning app instead. The conversation shifts toward more personal and playful requests, such as asking the AI to demonstrate emotions, beatbox, or perform sound effects, all of which the AI declines. Despite these limitations, the AI maintains a helpful demeanor and offers alternatives where appropriate.

20:06

🛏️ Bedtime Stories, Guided Meditation, and Final Thoughts

In the final test, the narrator asks the AI to tell a bedtime story with sound effects and perform guided meditation. The AI successfully delivers a story with sound effects embedded within the narrative but declines requests for sound effects on demand. The narrator reflects on the limitations and censorship in the feature, comparing their experience with early testers who reported more sound capabilities. They conclude by encouraging viewers to share their experiences with the feature and promote their free weekly AI newsletter.

Mindmap

Keywords

💡Advanced Voice Feature

This refers to a newly released functionality in the ChatGPT app that allows users to interact via voice instead of text. It requires a paid subscription and provides various voice options to choose from. The feature is the core topic of the video, which demonstrates its use and tests its capabilities.

💡Subscription Plan

The advanced voice feature requires users to be subscribed to a paid plan. This concept is essential as it restricts access to the voice function unless the user opts into a subscription, emphasizing the importance of the payment model for premium features in the app.

💡Voice Options

Within the advanced voice feature, users can select different voices for the AI. The script highlights several voice samples like 'Maple' and shows how they provide users with customization in their interactions, making the experience more personal.

💡Pronunciation Assistance

One of the key uses of the voice feature is to help users practice language pronunciation. In the video, the user asks for help in pronouncing phrases in French and Chinese, and the AI provides corrections, showcasing the feature’s utility in language learning.

💡Storytelling with Sound Effects

The voice feature is also demonstrated as a storytelling tool for bedtime stories, complete with sound effects. The AI creates a narrative involving a fox named Finn and adds sounds like rustling wind and crashing water to enhance the immersive experience.

💡Guided Meditation

The video explores how the voice feature can assist with guided meditation by leading users through breathing exercises and relaxation techniques. This shows the versatility of the feature in wellness and mindfulness applications.

💡Voice Censorship

The video mentions that while early testers could make the AI perform certain sounds like singing or beatboxing, the user encountered restrictions due to what they perceive as censorship. This suggests that OpenAI has implemented strict content moderation on certain voice-based requests.

💡Tuning a Guitar

The user attempts to use the voice feature to tune their guitar by playing strings for the AI to analyze. However, the AI does not support sound analysis and suggests using a tuning app instead, demonstrating a limitation of the feature.

💡Voice-Activated Commands

Throughout the video, the user gives the AI various commands, such as switching accents or pretending to be certain characters. The AI’s ability to respond to these commands shows how users can creatively interact with the voice feature.

💡AI Limitations

The video highlights several limitations of the advanced voice feature, such as its inability to process sound effects, analyze real-time sounds (like tuning a guitar), or engage in certain conversations due to censorship. These limitations are key to understanding the current state of the technology.

Highlights

OpenAI's advanced voice feature is now available, but it requires a paid plan.

Users need to download the ChatGPT app from official stores to access the voice feature.

The setup includes choosing from various voice options after enabling advanced voice mode.

The AI can assist with language pronunciation, offering feedback for French and Chinese.

While the AI could correct pronunciation in French, it struggled with detailed Chinese tone corrections.

It can narrate stories with sound effects, making it useful for bedtime stories.

The AI was asked to perform accents (British, Scottish, Indian, Australian), which it handled well.

It follows strict guidelines and cannot perform certain sound-related tasks like singing or meowing.

The voice assistant cannot analyze external sounds, like guitar tuning, but suggests alternatives.

Although it cannot produce musical or moaning sounds, it can simulate sound effects within storytelling.

The AI adheres to censorship, unable to perform intimate or inappropriate tasks despite creative testing.

The system helps with mock interviews and gives practical tips for answering job interview questions.

The AI cannot identify pitches or analyze audio directly from a user, relying on tools instead.

Guided meditations are one of the practical applications of the advanced voice mode.

There is strict censorship, but testers found it useful for learning, storytelling, and language practice.