Vyhledávání frazémů na základě anomálie v distribuci tvarů

Dittrichová, Anna

Finding idioms based on anomalous word-form distribution

diploma thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (334.4Kb)

Permanent link

http://hdl.handle.net/20.500.11956/188429

Identifiers

Study Information System: 254902

Referee

Bozděchová, Ivana

Faculty / Institute

Faculty of Arts

Discipline

Empirical and Comparative Linguistics

Department

Institute of Czech Language and Theory of Communication

Date of defense

5. 2. 2024

Publisher

Univerzita Karlova, Filozofická fakulta

Language

Czech

Grade

Excellent

Keywords (Czech)

Keywords (English)

frazémů, a to na základě dat poskytovaných aplikací GramatiKat, která dokáže identifikovat anomální distribucí tvarů. Cílem práce je zjistit, zda gr anomálie substantiv (tedy nezvykle vysoká frekvence jednoho či více tvarů v morfologického paradigmatu) ukazují na formálně anomální víceslovné lexémy (tedy frazémy), případně jak se tato souvislost liší v rámci jednotlivých pádů a jaké typy frazémů se jednotlivých pádech jednotného čísla objevují nejčastěji. Z analýzy, která využívá jazykové SYNv11, vyplynulo, že na jednom či více frazémech se podílí celkem % analyzovaných lemmat, přičemž nejčastěji se jedná o akuzativ (až 88 %), nejméně naopak %). Ve zkoumaných datech jsou zastoupeny skupiny frazémů se stejnými vlastnostmi, například v dativu se často jednalo o verbální frazémy, ve vokativu o

Abstract (English)

This diploma thesis deals with the search, description, classification, and dictionary comparison of idioms based on data provided by GramatiKat application, which can identify nominal lemmas with anomalous word-form distribution. The aim of the diploma thesis is to determine whether grammatical anomalies in nouns (unusually high frequencies of one or more forms within a morphological paradigm) indicate formally anomalous multi-word lexemes (idioms) and how this relationship varies across different cases. Additionally, it explores the types of idioms that most commonly appear in individual cases of the singular. The analysis, utilizing SYN2015 a SYNv11 corpora, revealed that 28 % of analyzed lemmas are part of one or more idioms. The most common case is the accusative (88 %), while the least common is the vocative (5 %). The analysis also identified various groups of idioms with similar characteristics. For instance, verbal idioms were frequently observed in the dative, contact idioms predominated in the vocative, and grammatical idioms were prevalent in the locative.

Citace dokumentu

Metadata

Show full item record