01_corpus:02_preprocessing:07_normalization
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionLast revisionBoth sides next revision | ||
corpus:04_annotations:03_normalization [2019/09/23 13:55] – ↷ Page name changed from corpus:04_annotations:02_normalization to corpus:04_annotations:03_normalization simone | 01_corpus:02_preprocessing:07_normalization [2020/04/17 11:16] – simone | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Normalization ====== | + | ====== |
+ | Normalization is the task of " | ||
+ | |||
+ | In the case of our corpus, we have manually normalized some data in the Swiss German dialect, resulting in the corpus WUS_DIALOG_GSW (5 chats, 34,683 tokens). | ||
01_corpus/02_preprocessing/07_normalization.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1