OpenAI gives ChatGPT a voice to chat

ChatGPT is evolving into much more than a text-based search engine, with OpenAI advertising which is adding new intelligence based on voice and image.

The popular generative AI assistant has been one of the biggest technological success stories of recent times since its debut about nine months ago, allowing anyone to generate essays, poems and summaries from simple text-based prompts. But now, ChatGPT is about to become much more interactive and users will also be able to have a voice conversation with the chatbot.

The announcement comes as Amazon pledged to invest up to $4 billion in OpenAI rival Anthropic, a move that is one part of a battle over generative AI between the world's tech giants, which includes Google, which is trying to catch up through its Bard chatbot. , Meta embraces a strong open source ethos to help you get ahead, and Microsoft aligns closely with OpenAI.

conversation starter

Today marks a notable evolution for the generative AI movement, with OpenAI combining the familiar world of based assistants voice with its powerful large language models (LLM).

For example, a user you can verbally ask ChatGPT Make up a bedtime story on the spot, with some vocal cues to guide the narrative. Or the user can simply ask you a question and ChatGPT will give you the answer in spoken form.

Elsewhere, ChatGPT users will also be able to search for answers using images, for example by uploading an image of something and asking ChatGPT to explain what it is or provide instructions for completing an objective.

ChatGPT Image Search

ChatGPT Image Search

The voice function is enabled by a combination of a new text model a speech that can generate human-like voices from text and a few seconds of voice sample. OpenAI said it partnered with established voice actors to create five different voices, with its Whisper voice recognition system from open source used to transcribe verbal expressions into text.

Spotify was also introduced as a partner at the launch, with the music streaming giant introducing a rather interesting new feature for podcasters that allows them to test their voice and translate their shows from English to Spanish, French or German, while retaining their own original voice. However, it seems that OpenAI is being careful not to attract criticism, as it does not make this technology available to anyone; has specifically worked with podcasters such as Dax Shepard, Monica Padman, Lex Fridman, Bill Simmons and Steven Bartlett for the launch.

"New voice technology, capable of creating realistic synthetic voices from just a few seconds of real speech, opens the doors to many creative and accessibility-focused applications," the company wrote in a blog post. "However, these capabilities also present new risks, such as the possibility of malicious actors impersonating public figures or committing fraud."

The new features will begin rolling out to paid Plus and Enterprise subscribers soon. To activate voice features, users must go to the app's "settings" menu, then "new features" and opt for voice conversations. They must then tap the headset button in the upper right corner and select the voice they want.

Voice will initially be limited to the Android and iOS ChatGPT apps in an optional beta, while image search will come to all platforms by default.

Source link

next >>

Amazon will invest up to 4.000 billion in AI startup Anthropic

OpenAI gives ChatGPT a voice to chat

conversation starter

SUBSCRIBE TO TRPLANE.COM

Publish on TRPlane.com

MORE PUBLICATIONS