Textual Ciphers as a Tool for Better Understanding the Transformers

Provazník, Jan

Textové šifry jako nástroj pro lepší pochopení modelů Transformer

bachelor thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (347.4Kb)

Permanent link

http://hdl.handle.net/20.500.11956/192072

Identifiers

Study Information System: 262108

Referee

Kasner, Zdeněk

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Computer Science with specialisation in Artificial Intelligence

Department

Institute of Formal and Applied Linguistics

Date of defense

28. 6. 2024

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

English

Grade

Excellent

Keywords (Czech)

Transformer|interpretovatelnost|NLP|deep learning|šifry

Keywords (English)

Transformer|interpretability|NLP|deep learning|ciphers

Architektura Transformer je velmi populární, takže může být potenciálně významné interpretovat, co ovlivňuje její výkon. Testujeme hypotézu, že mo- del se při práci s textem spoléhá na jeho lingvistické vlastnosti. Abychom eli- minovali vliv kultury na význam, používáme úlohu pracující na úrovni znaků s Transformer modelem ByT5. Dotrénujeme ByT5-small na dešifrování vět zašifrovaných pomocí textových šifer (Vigenère, Enigma). Anotujeme eva- luační dataset vět pomocí publikovaných nástrojů pro NLP. Na evaluačním datasetu zkoumáme vztahy mezi lingvistickými vlastnostmi a četností chyb dotrénovaného ByT5 při dešifrování vět. Analyzujeme korelace, trénujeme ML modely na predikci četnosti chyb věty z jijích lingvistických vlastností a interpretujeme důležitost vlastností pomocí SHAP. Nacházíme malé signifi- kantní korelace, ale predikce četnosti chyb z vlastností selhává. Dospíváme k závěru, že identifikované vlastnosti neposkytují vhled do výkonu Transfor- merů.

Abstract (English)

The Transformer architecture is very popular, so it is potentially im- pactful to interpret what influences its performance. We test the hypothesis that the model relies on the linguistic properties of a text when working with it. We remove interference with cultural aspects of meaning by using a character-level task with the ByT5 Transformer model. We train ByT5 to decipher sentences encrypted with text ciphers (Vigenère, Enigma). We annotate a sentence dataset with linguistic properties with published NLP tools. On this dataset, we study the relationships between the linguistic properties and the fine-tuned ByT5 decipherment error rate. We analyze correlations, train ML models to predict error rates from the properties and interpret them with SHAP. We find small significant correlations but can- not predict error rates from the properties. We conclude the properties we identified do not give much insight into the performance of the Transformer.

Citace dokumentu

Metadata

Show full item record