Discourse relations of the Prague Discourse Treebank in Universal Dependencies

Pribytkova, Olga

Diskurzní vztahy Pražského diskurzního korpusu v Universal Dependencies

bachelor thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (347.1Kb)

Permanent link

http://hdl.handle.net/20.500.11956/200828

Identifiers

Study Information System: 277862

Consultant

Poláková, Lucie

Referee

Kuboň, Vladislav

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Computer Science with specialisation in Artificial Intelligence

Department

Institute of Formal and Applied Linguistics

Date of defense

20. 6. 2025

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

English

Grade

Excellent

Keywords (Czech)

diskurzní vztahy|Universal Dependencies|strojové učení

Keywords (English)

discourse relations|Universal Dependencies|machine learning

Tato diplomová práce navrhuje nový přístup k integraci anotace diskurzních vztahů Prague Discourse Treebanku do rámce Universal Dependencies. Spojením anotací PDiT transformovaných do formátu podobného PDTB se syntaktickými daty UD generovanými pomocí UDPipe jsme vytvořili jednotnou reprezentaci, která spojuje diskurzní vztahy s jejich odpovídajícími syntaktickými strukturami. Následně jsme provedli experimenty strojového učení v klasifikaci diskurzních typů s využitím tohoto nového formátu, při nichž jsme hodnotili příspěvky jednotlivých rysů a výkon modelů, zdůraznili jsme výhody a výzvy navrhovaného přístupu a položili základy pro další pokroky v automatické analýze diskurzu1 . 1 Tato česká verze abstraktu byla přeložená z anglického abstraktu pomocí strojového překladu s ručními úpravami.

Abstract (English)

This thesis introduces a novel approach for integrating discourse relation annotations from the Prague Discourse Treebank into the Universal Dependencies framework. By aligning PDiT annotations transformed into a PDTB-like format with UD's syntactic data generated by UDPipe, we have created a unified representation that links discourse relations to their corresponding syntactic structures. We then conducted machine learn- ing experiments in discourse type classification using this new format, evaluating feature contribution and performance, highlighting the benefits and challenges of the proposed approach and paving the way for further advancements in computational discourse anal- ysis.

Citace dokumentu

Metadata

Show full item record