Some sub-corpora have been annotated with Part Of Speech annotations. This concerns WUS_DIALOG_GSW, WUS_FRA, WUS_FRA_DEMOG, WUS_ITA, WUS_ITA_DEMOG.
The whole French corpus has been annotated with MElt (Modified French TreeBank) using the tag set CC Tagset. Available annotations are "mftb_pos" (for part of speech) and "mftb_lem" (for the lemma). The following tags are used:
ADJ adjectiveADJWH interrogative adjectiveADV adverbADVWH interrogative adverbCC coordinating conjunctionCLO object clitic pronounCLR reflexive clitic pronounCLS subject clitic pronounCS subordinating conjunctionDET determinerDETWH interrogative determinerET foreign wordI interjectionNC common nounNPP proper nounP prepositionP+D preposition+determiner amalgamP+PRO prepositon+pronoun amalgamPONCT punctuation markPREF prefixPRO full pronounPROREL relative pronounPROWH interrogative pronounV indicative or conditional verb formVIMP imperative verb formVINF infinitive verb formVPP past participleVPR present participleVS subjunctive verb formFive chats of the Swiss German dialectal data (34,683 tokens) have been manually normalized and annotated for Part of Speech. The according corpus is called WUS_DIALOG_GSW. Three annotations have been added to each token:
The tagset uses the following tags:
ADJA attributive adjective (including participles used adjectivally) ADJD predicate adjective; adjective used adverbially ADV adverb (never used as attributive adjective) APPR preposition left hand part of double preposition APPRART preposition with fused article APPO postposition APZR right hand part of double preposition ART article (definite or indefinite) CARD cardinal number (words or figures); also declined FM foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN) ITJ interjection KON co-ordinating conjunction KOKOM comparative conjunction or particle KOUI preposition used to introduce infinitive clause KOUS subordinating conjunction NA adjective used as noun NE names and other proper nouns NN noun (but not adjectives used as nouns) PAV [PROAV] pronominal adverb PAVREL pronominal adverb used as relative PDAT demonstrative determiner PDS demonstrative pronoun PIAT indefinite determiner (whether occurring on its own or in conjunction with another determiner) PIS indefinite pronoun PPER personal pronoun PRF reflexive pronoun PPOSS possessive pronoun PPOSAT possessive determiner PRELAT relative depending on a noun PRELS relative pronoun (i.e. forms of der or welcher) PTKA particle with adjective or adverb PTKANT answer particle PTKNEG negative particle PTKREL indeclinable relative particle PTKVZ separable prefix PTKZU infinitive particle zuPWS interrogative pronoun PWAT interrogative determiner PWAV interrogative adverb PWAVREL interrogative adverb used as relative PWREL interrogative pronoun used as relative TRUNC truncated form of compound VAFIN finite auxiliary verb VAIMP imperative of auxiliary VAINF infinitive of auxiliary VAPP past participle of auxiliary VMFIN finite modal verb VMINF infinitive of modal VMPP past participle of auxiliary VVFIN finite full verb VVIMP imperative of full verb VVINF infinitive of full verb VVIZU infinitive with incorporated zu VVPP past participle of full verb As in the French corpus, there are also combined tags such as VAFIN+PPER when a personal pronoun is agglutinated to a verb (hätti for 'hätte ich').
The Italian corpus is annotated with the TreeTagger, too, but based on the original tokens, i.e. not manually normalized.
The following PoS tagset was used:
ABR abbreviationADJ adjectiveADV adverbCON conjunctionDET:def definite articleDET:indef indefinite articleFW foreign wordINT interjectionLS list symbolNOM nounNPR nameNUM numeralPON punctuationPRE prepositionPRE:det preposition+articlePRO pronounPRO:demo demonstrative pronounPRO:indef indefinite pronounPRO:inter interrogative pronounPRO:pers personal pronounPRO:poss possessive pronounPRO:refl reflexive pronounPRO:rela relative pronounSENT sentence markerSYM symbolVER:cimp verb conjunctive imperfectVER:cond verb conditionalVER:cpre verb conjunctive presentVER:futu verb future tenseVER:geru verb gerundVER:impe verb imperativeVER:impf verb imperfectVER:infi verb infinitiveVER:pper verb participle perfectVER:ppre verb participle presentVER:pres verb presentVER:refl:infi verb reflexive infinitiveVER:remo verb simple past