User Tools

Site Tools


01_corpus:02_preprocessing:06_pos

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
01_corpus:02_preprocessing:06_pos [2020/04/16 17:50] – [Swiss German dialect] simone01_corpus:02_preprocessing:06_pos [2020/04/17 11:09] – [Italian] simone
Line 107: Line 107:
  
 ===== Italian ===== ===== Italian =====
-The Italian corpus is annotated with the [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/|TreeTagger]], too, but based on the original tokens, i.e. not manually normalized. In this sub-corpus, however, only some parts were manually normalized resulting in the following three annotations:+The Italian corpus is annotated with the [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/|TreeTagger]], too, but based on the original tokens, i.e. not manually normalized. 
  
-   * gloss: The manual normalization (often _UNGLOSSED_) 
    * tt_pos: Part of Speech annotation with TreeTagger    * tt_pos: Part of Speech annotation with TreeTagger
    * tt_lem: The lemma as assigned by TreeTagger    * tt_lem: The lemma as assigned by TreeTagger
01_corpus/02_preprocessing/06_pos.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki