Repository logo

Speech to speech translation with translatotron : a state of the art review

dc.contributor.authorKala, Jules R.
dc.contributor.authorAdetiba, Emmanuel
dc.contributor.authorAbayom, Abdultaofeek
dc.contributor.authorDare, Oluwatobi E.
dc.contributor.authorIfijeh, Ayodele H.
dc.date.accessioned2025-12-23T11:34:27Z
dc.date.available2025-12-23T11:34:27Z
dc.date.issued2025-2-9
dc.date.updated2025-03-06T13:08:44Z
dc.description.abstractA cascade-based speech-to-speech translation has been considered a benchmark for a very long time, but it is plagued by many issues, like the time taken to translate a speech from one language to another and compound errors. These issues are because a cascade-based method uses a combination of methods such as speech recognition, speech-to-text translation, and finally, text-to-speech trans lation. Translatotron, a sequence-to-sequence direct speech-to-speech translation model was designed by Google to address the issues of compound errors associated with cascade model. Today there are 3 versions of the Translatotron model: Trans latotron 1, Translatotron 2, and Translatotron3. The first version was designed as a proof of concept to show that a direct speech-to-speech translation was possible, it was found to be less effective than the cascade model but was producing promising results. Translatotron2 was an improved version of Translatotron 1 with results sim ilar to the cascade model. Translatotron 3 the latest version of the model is better than the cascade model at some points. In this paper, a complete review of speech to-speech translation will be presented, with a particular focus on all the versions of Translatotron models. We will also show that Translatotron is the best model to bridge the language gap between African Languages and other well-formalized languages.
dc.format.extent13 p
dc.identifier.citationKala, J.R. et al. 2025. Speech Speech to speech translation with translatotron: a state of the art review.1-13.
dc.identifier.urihttps://hdl.handle.net/10321/6323
dc.language.isoen
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectTranslatotron
dc.subjectBLEU
dc.subjectCascade
dc.subjectSpeech-to-Speech
dc.titleSpeech to speech translation with translatotron : a state of the art review
dc.typeOther
local.sdgSDG09

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
2502.05980v2.pdf
Size:
32 MB
Format:
Adobe Portable Document Format
Description:
Published version
Loading...
Thumbnail Image
Name:
Kala_Adetiba et al_2025.pdf
Size:
32 MB
Format:
Adobe Portable Document Format