User Tools

Site Tools


01_corpus:02_preprocessing:06_pos

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
01_corpus:02_preprocessing:06_pos [2020/04/16 17:48] simone01_corpus:02_preprocessing:06_pos [2020/04/17 11:09] – [Italian] simone
Line 48: Line 48:
 The [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/stts_guide.pdf|tagset]] uses the following tags: The [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/stts_guide.pdf|tagset]] uses the following tags:
  
-  * ADJA attributive adjective (including participles used adjectivally) //das große Haus die versunkene Glocke// +  * ADJA attributive adjective (including participles used adjectivally)  
-  * ADJD predicate adjective; adjective used adverbially //der Vogel ist blau er fährt schnell// +  * ADJD predicate adjective; adjective used adverbially  
-  * ADV adverb (never used as attributive adjective) //sie kommt bald// +  * ADV adverb (never used as attributive adjective)  
-  * APPR preposition left hand part of double preposition //auf dem Tisch an der Straße entlang// +  * APPR preposition left hand part of double preposition  
-  * APPRART preposition with fused article //am Tag// +  * APPRART preposition with fused article  
-  * APPO postposition //meiner Meinung nach// +  * APPO postposition  
-  * APZR right hand part of double preposition //an der Straße entlang// +  * APZR right hand part of double preposition  
-  * ART article (definite or indefinite) //die Tante; eine Tante// +  * ART article (definite or indefinite)  
-  * CARD cardinal number (words or figures); also declined //zwei; 526; dreier// +  * CARD cardinal number (words or figures); also declined  
-  * FM foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN) //semper fidem// +  * FM foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN)  
-  * ITJ interjection //Ach!// +  * ITJ interjection  
-  * KON co-ordinating conjunction //oder ich bezahle nicht// +  * KON co-ordinating conjunction  
-  * KOKOM comparative conjunction or particle //er arbeitet als Straßenfeger, so gut wie du// +  * KOKOM comparative conjunction or particle  
-  * KOUI preposition used to introduce infinitive clause //um den König zu töten// +  * KOUI preposition used to introduce infinitive clause  
-  * KOUS subordinating conjunction //weil er sie gesehen hat// +  * KOUS subordinating conjunction  
-  * NA adjective used as noun //der Gesandte// +  * NA adjective used as noun  
-  * NE names and other proper nouns //Moskau// +  * NE names and other proper nouns  
-  * NN noun (but not adjectives used as nouns) //der Abend// +  * NN noun (but not adjectives used as nouns)  
-  * PAV [PROAV] pronominal adverb //sie spielt damit// +  * PAV [PROAV] pronominal adverb  
-  * PAVREL pronominal adverb used as relative //die Puppe, damit sie spielt// +  * PAVREL pronominal adverb used as relative  
-  * PDAT demonstrative determiner //dieser Mann war schlecht// +  * PDAT demonstrative determiner  
-  * PDS demonstrative pronoun //dieser war schlecht// +  * PDS demonstrative pronoun  
-  * PIAT indefinite determiner (whether occurring on its own or in conjunction with another determiner) //einige Wochen, viele solche Bemerkungen// +  * PIAT indefinite determiner (whether occurring on its own or in conjunction with another determiner)  
-  * PIS indefinite pronoun //sie hat viele gesehen// +  * PIS indefinite pronoun  
-  * PPER personal pronoun //sie liebt mich// +  * PPER personal pronoun  
-  * PRF reflexive pronoun //ich wasche mich, sie wäscht sich// +  * PRF reflexive pronoun  
-  * PPOSS possessive pronoun //das ist meins// +  * PPOSS possessive pronoun  
-  * PPOSAT possessive determiner //mein Buch, das ist der meine/meinige// +  * PPOSAT possessive determiner  
-  * PRELAT relative depending on a noun //der Mann, dessen Lied ich singe […], welchen Begriff ich nicht verstehe// +  * PRELAT relative depending on a noun  
-  * PRELS relative pronoun (i.e. forms of der or welcher) //der Herr, der gerade kommt; der Herr, welcher nun kommt// +  * PRELS relative pronoun (i.e. forms of der or welcher)  
-  * PTKA particle with adjective or adverb //am besten, zu schnell, aufs herzlichste// +  * PTKA particle with adjective or adverb  
-  * PTKANT answer particle //ja, nein// +  * PTKANT answer particle  
-  * PTKNEG negative particle //nicht// +  * PTKNEG negative particle  
-  * PTKREL indeclinable relative particle //so// +  * PTKREL indeclinable relative particle  
-  * PTKVZ separable prefix //sie kommt an//+  * PTKVZ separable prefix
   * PTKZU infinitive particle zu   * PTKZU infinitive particle zu
-  * PWS interrogative pronoun //wer kommt?// +  * PWS interrogative pronoun  
-  * PWAT interrogative determiner //welche Farbe?// +  * PWAT interrogative determiner  
-  * PWAV interrogative adverb //wann kommst du?// +  * PWAV interrogative adverb  
-  * PWAVREL interrogative adverb used as relative //der Zaun, worüber sie springt// +  * PWAVREL interrogative adverb used as relative  
-  * PWREL interrogative pronoun used as relative //etwas, was er sieht// +  * PWREL interrogative pronoun used as relative  
-  * TRUNC truncated form of compound //Vor- und Nachteile// +  * TRUNC truncated form of compound  
-  * VAFIN finite auxiliary verb //sie ist gekommen// +  * VAFIN finite auxiliary verb  
-  * VAIMP imperative of auxiliary //sei still!// +  * VAIMP imperative of auxiliary  
-  * VAINF infinitive of auxiliary //er wird es gesehen haben// +  * VAINF infinitive of auxiliary  
-  * VAPP past participle of auxiliary //sie ist es gewesen// +  * VAPP past participle of auxiliary  
-  * VMFIN finite modal verb //sie will kommen// +  * VMFIN finite modal verb  
-  * VMINF infinitive of modal //er hat es sehen müssen// +  * VMINF infinitive of modal  
-  * VMPP past participle of auxiliary //sie hat es gekonnt// +  * VMPP past participle of auxiliary  
-  * VVFIN finite full verb //sie ist gekommen// +  * VVFIN finite full verb  
-  * VVIMP imperative of full verb //bleibt da!// +  * VVIMP imperative of full verb  
-  * VVINF infinitive of full verb //er wird es sehen// +  * VVINF infinitive of full verb  
-  * VVIZU infinitive with incorporated zu //sie versprach aufzuhören// +  * VVIZU infinitive with incorporated zu  
-  * VVPP past participle of full verb //sie ist gekommen//+  * VVPP past participle of full verb
  
 As in the French corpus, there are also combined tags such as //VAFIN+PPER// when a personal pronoun is agglutinated to a verb (//hätti// for 'hätte ich'). As in the French corpus, there are also combined tags such as //VAFIN+PPER// when a personal pronoun is agglutinated to a verb (//hätti// for 'hätte ich').
Line 107: Line 107:
  
 ===== Italian ===== ===== Italian =====
-The Italian corpus is annotated with the [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/|TreeTagger]], too, but based on the original tokens, i.e. not manually normalized. In this sub-corpus, however, only some parts were manually normalized resulting in the following three annotations:+The Italian corpus is annotated with the [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/|TreeTagger]], too, but based on the original tokens, i.e. not manually normalized. 
  
-   * gloss: The manual normalization (often _UNGLOSSED_) 
    * tt_pos: Part of Speech annotation with TreeTagger    * tt_pos: Part of Speech annotation with TreeTagger
    * tt_lem: The lemma as assigned by TreeTagger    * tt_lem: The lemma as assigned by TreeTagger
01_corpus/02_preprocessing/06_pos.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki