Some sub-corpora have been annotated with Part Of Speech annotations. This concerns WUS_DIALOG_GSW, WUS_FRA, WUS_FRA_DEMOG, WUS_ITA, WUS_ITA_DEMOG.
The whole French corpus has been annotated with MElt (Modified French TreeBank) using the tag set CC Tagset. Available annotations are "mftb_pos" (for part of speech) and "mftb_lem" (for the lemma). The following tags are used:
ADJ
adjectiveADJWH
interrogative adjectiveADV
adverbADVWH
interrogative adverbCC
coordinating conjunctionCLO
object clitic pronounCLR
reflexive clitic pronounCLS
subject clitic pronounCS
subordinating conjunctionDET
determinerDETWH
interrogative determinerET
foreign wordI
interjectionNC
common nounNPP
proper nounP
prepositionP+D
preposition+determiner amalgamP+PRO
prepositon+pronoun amalgamPONCT
punctuation markPREF
prefixPRO
full pronounPROREL
relative pronounPROWH
interrogative pronounV
indicative or conditional verb formVIMP
imperative verb formVINF
infinitive verb formVPP
past participleVPR
present participleVS
subjunctive verb formFive chats of the Swiss German dialectal data (34,683 tokens) have been manually normalized and annotated for Part of Speech. The according corpus is called WUS_DIALOG_GSW. Three annotations have been added to each token:
The tagset uses the following tags:
ADJA
attributive adjective (including participles used adjectivally) ADJD
predicate adjective; adjective used adverbially ADV
adverb (never used as attributive adjective) APPR
preposition left hand part of double preposition APPRART
preposition with fused article APPO
postposition APZR
right hand part of double preposition ART
article (definite or indefinite) CARD
cardinal number (words or figures); also declined FM
foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN) ITJ
interjection KON
co-ordinating conjunction KOKOM
comparative conjunction or particle KOUI
preposition used to introduce infinitive clause KOUS
subordinating conjunction NA
adjective used as noun NE
names and other proper nouns NN
noun (but not adjectives used as nouns) PAV [PROAV]
pronominal adverb PAVREL
pronominal adverb used as relative PDAT
demonstrative determiner PDS
demonstrative pronoun PIAT
indefinite determiner (whether occurring on its own or in conjunction with another determiner) PIS
indefinite pronoun PPER
personal pronoun PRF
reflexive pronoun PPOSS
possessive pronoun PPOSAT
possessive determiner PRELAT
relative depending on a noun PRELS
relative pronoun (i.e. forms of der or welcher) PTKA
particle with adjective or adverb PTKANT
answer particle PTKNEG
negative particle PTKREL
indeclinable relative particle PTKVZ
separable prefix PTKZU
infinitive particle zuPWS
interrogative pronoun PWAT
interrogative determiner PWAV
interrogative adverb PWAVREL
interrogative adverb used as relative PWREL
interrogative pronoun used as relative TRUNC
truncated form of compound VAFIN
finite auxiliary verb VAIMP
imperative of auxiliary VAINF
infinitive of auxiliary VAPP
past participle of auxiliary VMFIN
finite modal verb VMINF
infinitive of modal VMPP
past participle of auxiliary VVFIN
finite full verb VVIMP
imperative of full verb VVINF
infinitive of full verb VVIZU
infinitive with incorporated zu VVPP
past participle of full verb As in the French corpus, there are also combined tags such as VAFIN+PPER when a personal pronoun is agglutinated to a verb (hätti for 'hätte ich').
The Italian corpus is annotated with the TreeTagger, too, but based on the original tokens, i.e. not manually normalized.
The following PoS tagset was used:
ABR
abbreviationADJ
adjectiveADV
adverbCON
conjunctionDET:def
definite articleDET:indef
indefinite articleFW
foreign wordINT
interjectionLS
list symbolNOM
nounNPR
nameNUM
numeralPON
punctuationPRE
prepositionPRE:det
preposition+articlePRO
pronounPRO:demo
demonstrative pronounPRO:indef
indefinite pronounPRO:inter
interrogative pronounPRO:pers
personal pronounPRO:poss
possessive pronounPRO:refl
reflexive pronounPRO:rela
relative pronounSENT
sentence markerSYM
symbolVER:cimp
verb conjunctive imperfectVER:cond
verb conditionalVER:cpre
verb conjunctive presentVER:futu
verb future tenseVER:geru
verb gerundVER:impe
verb imperativeVER:impf
verb imperfectVER:infi
verb infinitiveVER:pper
verb participle perfectVER:ppre
verb participle presentVER:pres
verb presentVER:refl:infi
verb reflexive infinitiveVER:remo
verb simple past