Speech to speech translation with translatotron : a state of the art review
| dc.contributor.author | Kala, Jules R. | en_US |
| dc.contributor.author | Adetiba, Emmanuel | en_US |
| dc.contributor.author | Abayom, Abdultaofeek | en_US |
| dc.contributor.author | Dare, Oluwatobi E. | en_US |
| dc.contributor.author | Ifijeh, Ayodele H. | en_US |
| dc.date.accessioned | 2025-03-08T17:53:43Z | |
| dc.date.available | 2025-03-08T17:53:43Z | |
| dc.date.issued | 2025-02-09 | |
| dc.date.updated | 2025-03-06T13:09:23Z | |
| dc.description.abstract | A cascade-based speech-to-speech translation has been considered a benchmark for a very long time, but it is plagued by many issues, like the time taken to translate a speech from one language to another and compound errors. These issues are because a cascade-based method uses a combination of methods such as speech recognition, speech-to-text translation, and finally, text-to-speech trans lation. Translatotron, a sequence-to-sequence direct speech-to-speech translation model was designed by Google to address the issues of compound errors associated with cascade model. Today there are 3 versions of the Translatotron model: Trans latotron 1, Translatotron 2, and Translatotron3. The first version was designed as a proof of concept to show that a direct speech-to-speech translation was possible, it was found to be less effective than the cascade model but was producing promising results. Translatotron2 was an improved version of Translatotron 1 with results sim ilar to the cascade model. Translatotron 3 the latest version of the model is better than the cascade model at some points. In this paper, a complete review of speech to-speech translation will be presented, with a particular focus on all the versions of Translatotron models. We will also show that Translatotron is the best model to bridge the language gap between African Languages and other well-formalized languages. | en_US |
| dc.format.extent | 13 p | en_US |
| dc.identifier.citation | Kala, J.R. et al. 2025. Speech to speech translation with translatotron: a state of the art review. arXiv preprint arXiv:2502.05980. | en_US |
| dc.identifier.uri | https://hdl.handle.net/10321/5838 | |
| dc.language.iso | en | en_US |
| dc.subject | Translatotron | en_US |
| dc.subject | BLEU | en_US |
| dc.subject | Cascade | en_US |
| dc.subject | Speech-to-speech | en_US |
| dc.title | Speech to speech translation with translatotron : a state of the art review | en_US |
| dc.type | Other | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Kala_Adetiba et al_2025.pdf
- Size:
- 32 MB
- Format:
- Adobe Portable Document Format
- Description:
