Welcome Guest! You are here: Home » Science & Technology

"AudioPaLM": This Google tool tackles speech generation tasks with greater accuracy

Upon evaluation, AudioPaLM outperformed existing systems in speech translation by a significant margin. Read More

Monday June 26, 2023 5:39 PM, ummid.com News Network

San Francisco: A team of researchers from Google has introduced AudioPaLM, a Large Language Model (LLM) that can tackle speech understanding and generation tasks with greater accuracy.

AudioPaLM combines the advantages of two existing models - PaLM-2 and AudioLM.

PaLM-2 and AudioLM

AudioPaLM represents a multimodal architecture that effectively brings together the strengths of two established models: PaLM-2 and AudioLM.

PaLM-2 excels in comprehending text based linguistic knowledge, making it a robust text oriented language model whereas AudioLM demonstrates exceptional proficiency in retaining paralinguistic details such as speaker identity and tone.

Through combination of these two models, AudioPaLM harnesses the linguistic expertise of PaLM-2 and the paralinguistic information preservation capabilities of AudioLM.

This results in a comprehensive understanding and generation of both text and speech.

"Shared Vocabulary"

To facilitate this integration, AudioPaLM employs a shared vocabulary that effectively represents both speech and text using a finite set of discrete tokens. This unification enables various tasks, including speech recognition, text-to-speech synthesis, and speech-to-speech translation, to be seamlessly integrated within a single architecture and training process.

Upon evaluation, AudioPaLM outperformed existing systems in speech translation by a significant margin. It demonstrated the ability to perform zero-shot speech-to-text translation for language combinations which means it can accurately translate speech into text for languages it has never encountered before, opening up possibilities for broader language support.

It is unclear when this technology will be implemented into final products, but we can see Google Translate and other apps getting major upgrades through this development.

For all the latest News, Opinions and Views, download ummid.com App.

Select Language To Read in Urdu, Hindi, Marathi or Arabic.

Top Headlines

Post Comments

Note: By posting your comments here you agree to the terms and conditions of www.ummid.com

"AudioPaLM": This Google tool tackles speech generation tasks with greater accuracy

Upon evaluation, AudioPaLM outperformed existing systems in speech translation by a significant margin. Read More

PaLM-2 and AudioLM

"Shared Vocabulary"

Top Headlines

0 Usability issues more in iOS than Android: Report

1 LinkedIn's new AI bot will generate 1st draft after you share brief outline

2 Human remains found at site where Julian Sands went missing

3 PM Modi claims no discrimination against Muslims in India; Really?

4 Outrage as 'Secular', 'Socialist' miss from Preamble in Telangana textbook

5 Viral Video: Pak Cricketer Rizwan spotted cleaning Mataf area of Haram

6 Manipur Violence: Over 65K Displaced, Students Worst Hit

7 India of Cong or of BJP/RSS, up to people to decide: Sam Pitroda

8 ‘Perspectives’: Google Search's new filter starts rolling out

9 Kabir Garg who helped run dark web child abuse site in UK jailed

Top Stories

Simmering conflict between Moscow's military leadership and Yevgeny Prigozhin has exploded

Also Read

Wagner mercenaries on rampage in Russia; Betrayal, says Putin

More Stories

Heavy exchanges of fire reported from two Manipur districts

Also Read

"Like Libya, Lebanon, Nigeria, Syria": Top Army Veteran on Manipur ethnic violence

Eid al Adha 2023 Moon Sighted in India, Pakistan and Bangladesh

Also Read

Eid al Adha 2023 in Saudi Arabia on June 28, Youm e Arafat on June 27