01_corpus:02_preprocessing:04_languages
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionLast revisionBoth sides next revision | ||
01_corpus:02_preprocessing:04_languages [2020/04/16 16:42] – simone | 01_corpus:02_preprocessing:04_languages [2020/05/04 13:51] – simone | ||
---|---|---|---|
Line 2: | Line 2: | ||
===== Languages and varieties per chat ===== | ===== Languages and varieties per chat ===== | ||
- | In order to assign a language tagging to each chat, we looked the first 250 messages and assigned two possible attributes per language: | + | In order to assign a language tagging to each chat, we looked |
* lang_100_and_more: | * lang_100_and_more: | ||
Line 22: | Line 22: | ||
For an overview over languages and varieties in the corpus consult: | For an overview over languages and varieties in the corpus consult: | ||
- | Ueberwasser, | + | Ueberwasser, |
- | ===== 1.3.5 Languages and varieties per message ===== | + | ===== Languages and varieties per message ===== |
The information of the main language of a message is saved in the annotation // | The information of the main language of a message is saved in the annotation // | ||
01_corpus/02_preprocessing/04_languages.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1