incPres
No Result
View All Result
No Result
View All Result
incPres
No Result
View All Result
Home Global

Introducing SeamlessM4T, a Multimodal AI Model for Speech and Text Translations

photoact by photoact
August 22, 2023
in Global
Reading Time: 2 mins read
0
Seamless-M4T-Translation-Model_Thumbnail-1.jpg
Share on FacebookShare on Twitter


The world we live in has never been more interconnected, giving people access to more multilingual content than ever before. This also makes the ability to communicate and understand information in any language increasingly important.

Today, we’re introducing SeamlessM4T, the first all-in-one multimodal and multilingual AI translation model that allows people to communicate effortlessly through speech and text across different languages. SeamlessM4T supports:

  • Speech recognition for nearly 100 languages
  • Speech-to-text translation for nearly 100 input and output languages
  • Speech-to-speech translation, supporting nearly 100 input languages and 36 (including English) output languages
  • Text-to-text translation for nearly 100 languages
  • Text-to-speech translation, supporting nearly 100 input languages and 35 (including English) output languages

In keeping with our approach to open science, we’re publicly releasing SeamlessM4T under a research license to allow researchers and developers to build on this work. We’re also releasing the metadata of SeamlessAlign, the biggest open multimodal translation dataset to date, totaling 270,000 hours of mined speech and text alignments.

Building a universal language translator, like the fictional Babel Fish in The Hitchhiker’s Guide to the Galaxy, is challenging because existing speech-to-speech and speech-to-text systems only cover a small fraction of the world’s languages. But we believe the work we’re announcing today is a significant step forward in this journey. Compared to approaches using separate models, SeamlessM4T’s single system approach reduces errors and delays, increasing the efficiency and quality of the translation process. This enables people who speak different languages to communicate with each other more effectively.

SeamlessM4T builds on advancements we and others have made over the years in the quest to create a universal translator. Last year, we released No Language Left Behind (NLLB), a text-to-text machine translation model that supports 200 languages, and has since been integrated into Wikipedia as one of the translation providers. We also shared a demo of our Universal Speech Translator, which was the first direct speech-to-speech translation system for Hokkien, a language without a widely used writing system. And earlier this year, we revealed Massively Multilingual Speech, which provides speech recognition, language identification and speech synthesis technology across more than 1,100 languages.

SeamlessM4T draws on findings from all of these projects to enable a multilingual and multimodal translation experience stemming from a single model, built across a wide range of spoken data sources with state-of-the-art results.

This is only the latest step in our ongoing effort to build AI-powered technology that helps connect people across languages. In the future, we want to explore how this foundational model can enable new communication capabilities — ultimately bringing us closer to a world where everyone can be understood.

Learn more about SeamlessM4T on our AI blog.





Source link

Tags: Meta Platforms
ShareTweetSendSharePinShareSend

Related News

Privacy-Matters-AI_Header.jpg

Privacy Matters: Meta’s Generative AI Features

by photoact
September 27, 2023
0

Today, we’re launching new generative AI features, and like other Meta products, we took steps to protect your privacy and...

SuperNova_Social-Share.png

Introducing the New Ray-Ban | Meta Smart Glasses

by photoact
September 27, 2023
0

Today at Meta Connect, in partnership with EssilorLuxottica, we announced our next-generation Ray-Ban Meta smart glasses collection. We redesigned these...

Quest-3-Consumer-Highlights_Social-Share.png

Meet Meta Quest 3, Our Mixed Reality Headset Starting at $499.99

by photoact
September 27, 2023
0

Big news from the Connect stage: Meta Quest 3 hits shelves October 10, and pre-orders are open now! The world’s...

Additional-Facebook-Profiles_Header.jpg

You Can Now Have Multiple Personal Profiles on Facebook

by photoact
September 25, 2023
0

Sometimes separate is simpler, so we’re making it easier to customize your experience on Facebook with multiple personal profiles. Whether...

Impact-Is-Real_Social-Share.jpg

For Those Using Metaverse Technologies, the Impact Is Real

by photoact
September 22, 2023
0

Imagine if doctors could practice surgery in a realistic, risk-free way – with virtual reality (VR) that’s already possible and...

Meta-Verified-Business-Expansion_Header.jpg

Expanding Meta Verified to Businesses

by photoact
September 22, 2023
0

Earlier this year, we introduced Meta Verified for creators, a subscription bundle to help creators more easily establish their presence...

ADANI Adani Enterprises Aditya Birla Group Airtel Ambuja Cement Apollo Hospitals Apple Inc Bank of America Cigna Corporation Cognizant India Dabur India Dalmia Bharat Exxon Mobil FedEx Corp General Electric Godrej Group Hindustan Zinc Home Depot ICICI Bank Indian Bank Infosys KEC Inter Kubota Corporation L&T Mahindra & Mahindra Meta Platforms Microsoft National Aluminium Company Nestle India Royal Dutch Shell Samsung Global Saudi Aramco Standard Chartered Bank Steel Authority of India Sun Pharma Target Corporation Tata Consumer Products Tata Motors Tata Power Tata Steel Tcs UltraTech UltraTech Limited Verizon Communications Voltas Limited

    © 2022 incPres.com

    No Result
    View All Result

      © 2022 incPres.com