Obsah a značkování diachronního korpusu češtiny
The Content and Annotation of the Diachronic Corpus of Czech
Vědecký článek
View/ Open
Permanent link
http://hdl.handle.net/20.500.11956/96551Identifiers
Collections
- Číslo 1 [8]
Issue Date
2015Publisher
Univerzita Karlova, Filozofická fakultaPraha
Source document
Časopis pro moderní filologii (Journal for Modern Philology) (web)ISSN: 2336-6591
Periodical publication year: 2015
Periodical Volume: 2015
Periodical Issue: 1
Link to license terms
https://creativecommons.org/licenses/by-nc-nd/2.0/Keywords (Czech)
diachronní korpus, korpusový manažer, vertikální text, frekvenceKeywords (English)
diachronic corpus, corpus manager, vertical format, frequencyThe paper discusses what kind of content and annotation should be included in the diachronic corpus of Old Czech. Based on his analysis of the current state of DIAKORP and the Old Czech Text Bank the author suggests solutions for how to treat the critical apparatus, foreign words in historical Czech texts and contemporaneous or later marginal or interlinear notes. He also discusses some aspects of the methodology of statistics computation in the diachronic corpus.