User Tools

Site Tools


01_corpus:02_preprocessing:07_normalization
no way to compare when less than two revisions

Differences

This shows you the differences between two versions of the page.


Previous revision
Next revision
01_corpus:02_preprocessing:07_normalization [2020/04/16 16:36] – ↷ Page moved and renamed from 01_corpus:04_annotations:03_normalization to 01_corpus:02_preprocessing:07_normalization simone
Line 1: Line 1:
 +====== Normalization ======
 +Normalization is the task of "translating" non-standard language into standard language. It can be performed manually or automatically with computational linguistic tools.
  
 +In the case of our corpus, we have manually normalized some data in the Swiss German dialect, resulting in the corpus WUS_DIALOG_GSW.
 +
 +Another set of data was process automatically. You can read more about that project in:
 +
 +Ruzsics, Tatiana; Lusetti, Massimo; Göhring, Anne; Samardžić, Tanja; Stark, Elisabeth (2019): Neural Text Normalization with Adapted Decoding and PoS Features. [[https://www.cambridge.org/core/journals/natural-language-engineering/article/neural-text-normalization-with-adapted-decoding-and-pos-features/474B380A32EF96CCED1708229848F3FB|Natural Language Engineering]].
 +
 +This data will be made available soon.
01_corpus/02_preprocessing/07_normalization.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki