User Tools

Site Tools


01_corpus:start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
01_corpus:start [2020/05/04 07:04] simone01_corpus:start [2025/09/16 12:02] (current) Gabrielle Aguila-Multner
Line 8: Line 8:
   * Number of chats: 617   * Number of chats: 617
   * Number of messages (with permission to be used): 763’644   * Number of messages (with permission to be used): 763’644
 +  * Number of informants (who gave their permission): 944
   *  Number of tokens: 5'155'476 (without redactedQ.* (cf. [[01_corpus:02_preprocessing:02_without_permission|Messages without permission]]))   *  Number of tokens: 5'155'476 (without redactedQ.* (cf. [[01_corpus:02_preprocessing:02_without_permission|Messages without permission]]))
   * Number of emojis: 382'116   * Number of emojis: 382'116
Line 32: Line 33:
   * roh-vl: rumantsch vallader   * roh-vl: rumantsch vallader
   * roh-gr: rumantsch grischun    * roh-gr: rumantsch grischun 
 +
 +The main way to browse the corpus is through the [[https://lcp.linguistik.uzh.ch/manual/|LiRI Corpus Platform]] (LCP). UZH members can also browse it using [[https://corpus-tools.org/annis/|ANNIS]], which was developed and made available by Anke Lüdeling and her team:
 +
 +Krause, Thomas & Zeldes, Amir (2016): ANNIS3: A new architecture for generic corpus query and visualization. in: Digital Scholarship in the Humanities 2016 (31). [[http://dsh.oxfordjournals.org/content/31/1/118|http://dsh.oxfordjournals.org/content/31/1/118]]
  
  
  
01_corpus/start.1588575859.txt.gz · Last modified: (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki