User Tools

Site Tools


corpus:00_corpus

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
corpus:00_corpus [2019/09/22 10:34] – ↷ Links adapted because of a move operation simonecorpus:00_corpus [2019/09/23 13:48] (current) – removed simone
Line 1: Line 1:
-====== The corpus ====== 
- 
-The corpus consists of 617 chats that were sent in by the Swiss population in 2014 through a [[corpus:01_collection|fixed procedure]] that was communicated in the press in order to get people interested. The individual chats were checked for their [[corpus:02_permission|permission]] to use them and for chats that had to be [[corpus:removed|removed]]. Furthermore, [[corpus:demographics|demographic data]] (were provided) were linked to the chats. 
- 
-In a first step the most basic processing of the data took place such as to allow the project members to work with the data. This included the [[corpus:03_anonymization|anonymization]] and the annotation of a [[corpus:languages|main language]] per chat and thus the creation of [[subcorpora|subcorpora]]. 
- 
  
corpus/00_corpus.1569141279.txt.gz · Last modified: 2022/06/27 09:21 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki