Unsupervised Machine Translation: How Machines Learn to Understand Across Languages

Kvapilíková, Ivana

dc.contributor.author	Kvapilíková, Ivana
dc.date.accessioned	2025-06-16T13:56:13Z
dc.date.available	2025-06-16T13:56:13Z
dc.date.issued	2025-06
dc.identifier.isbn	9788024660783
dc.identifier.uri	http://hdl.handle.net/20.500.11956/198746
dc.description.abstract	For decades, machine translation between natural languages fundamentally relied on human-translated documents known as parallel texts, which provide direct correspondences between source and target sentences. The notion that translation systems could be trained on non-parallel texts, independently written in different languages, was long considered unrealistic. Fast forward to the era of large language models (LLMs), and we now know that given their sufficient computational resources, LLMs exploit incidental parallelism in their vast training data, i.e., they identify parallel messages across languages and learn to translate without explicit supervision. LLMs have since demonstrated the ability to perform translation tasks with impressive quality, rivaling systems specifically trained for translation. This monograph explores the fascinating journey that led to this point, focusing on the development of unsupervised machine translation. Long before the rise of LLMs, researchers were exploring the idea that translation could be achieved without parallel data. Their efforts centered on motivating models to discover cross-lingual correspondences through various techniques, such as the mapping of word embedding spaces, back-translation, or parallel sentence mining. Although much of the research described in this monograph predates the mainstream adoption of LLMs, the insights gained remain highly relevant. They offer a foundation for understanding how and why LLMs are able to translate.	en
dc.language.iso	en
dc.publisher	Nakladatelství Karolinum	cs
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	linguistics	en
dc.subject	translation	en
dc.subject	language	en
dc.subject	LLM	en
dc.subject	machine translation	en
dc.title	Unsupervised Machine Translation: How Machines Learn to Understand Across Languages	en
dc.type	kniha	cs_CZ
dc.type	book	en_US
dcterms.accessRights	openAccess
dcterms.extent	176
uk.abstract.en	For decades, machine translation between natural languages fundamentally relied on human-translated documents known as parallel texts, which provide direct correspondences between source and target sentences. The notion that translation systems could be trained on non-parallel texts, independently written in different languages, was long considered unrealistic. Fast forward to the era of large language models (LLMs), and we now know that given their sufficient computational resources, LLMs exploit incidental parallelism in their vast training data, i.e., they identify parallel messages across languages and learn to translate without explicit supervision. LLMs have since demonstrated the ability to perform translation tasks with impressive quality, rivaling systems specifically trained for translation. This monograph explores the fascinating journey that led to this point, focusing on the development of unsupervised machine translation. Long before the rise of LLMs, researchers were exploring the idea that translation could be achieved without parallel data. Their efforts centered on motivating models to discover cross-lingual correspondences through various techniques, such as the mapping of word embedding spaces, back-translation, or parallel sentence mining. Although much of the research described in this monograph predates the mainstream adoption of LLMs, the insights gained remain highly relevant. They offer a foundation for understanding how and why LLMs are able to translate.	en
dc.publisher.publicationPlace	Praha	cs
uk.internal-type	uk_publication
oaire.fundingReference.awardNumber	19-26934X
oaire.fundingReference.funderName	Grantová agentura České republiky	cs
oaire.fundingReference.fundingStream	Neural Representations in Multi-modal and Multi-lingual Modeling	en
dc.identifier.isbnPDF	9788024660844

Soubory tohoto záznamu

Název:: 9788024660844.pdf
Velikost:: 4.250Mb
Formát:: application/pdf
Popis:: Fulltext

Zobrazit/otevřít

Tento záznam se objevuje v následujících sbírkách

Open Access monografie [68]
Open Access monographs

Zobrazit minimální záznam

Kromě případů, kde je uvedeno jinak, licence tohoto záznamu je https://creativecommons.org/licenses/by/4.0/