ChatGPT's sensational upgrade: Seeing, Hearing, and Conversing!

Published October 2nd, 2023 - 02:51 GMT
ChatGPT open AI
Ai tech, businessman show virtual graphic Global Internet connect Chatgpt Chat with AI (Shutterstock)

ALBAWABA - OpenAI has unveiled an expansion of capabilities for ChatGPT, their widely adopted chatbot. Since its debut last year, ChatGPT has become a versatile tool, finding applications in tasks ranging from document summarization to coding, sparking intense competition in the field of artificial intelligence among major tech players.

According to an official blog post by OpenAI, this recent update equips ChatGPT with the remarkable ability to engage in spoken conversations with users and interact using visual inputs.

OpenAI is touting these new voice and image-focused features as a way to integrate ChatGPT more seamlessly into everyday life. Users can now engage in live discussions with ChatGPT about various aspects of a location by simply taking a photograph while traveling. 

Similarly, at home, users can snap a picture of the contents of their refrigerator and request ChatGPT for a dinner recipe. Furthermore, users can leverage photos to have ChatGPT assist their children in solving math problems.

OpenAI also highlights that users can now initiate spoken conversations with ChatGPT, opening up possibilities for interactions on the go. Whether it's asking ChatGPT to spin a tale or involving it in dinner table conversations, the new voice feature provides an added dimension to user experiences.

To activate ChatGPT's voice feature on the mobile app, users need to navigate to the "Settings" menu in the "New Features" tab. From there, they can select their preferred voice from five distinct options using the headphone icon located in the upper right corner of the main screen.

The introduction of the voice feature is underpinned by a novel text-to-speech model capable of generating speech that closely resembles human conversation from textual input and brief conversational samples. OpenAI emphasizes that they collaborated with professional voice actors to ensure the quality of these voice options.

OpenAI has stated that these newly introduced voice and visual-oriented features will be gradually rolled out to ChatGPT Plus and Enterprise users over the next two weeks. Shortly after, these features will become accessible to various other user groups, including developers. Furthermore, the announcement clarifies that the voice feature will be available on both iOS and Android versions, while the visual feature will be accessible across all platforms.

Subscribe

Sign up to our newsletter for exclusive updates and enhanced content