01_corpus:02_preprocessing:03_emojis
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | Last revisionBoth sides next revision | ||
01_corpus:02_preprocessing:03_emojis [2020/04/22 12:56] – ↷ Links adapted because of a move operation simone | 01_corpus:02_preprocessing:03_emojis [2020/04/22 13:00] – simone | ||
---|---|---|---|
Line 2: | Line 2: | ||
Emojis are characters in Unicode. The application WhatsApp uses special fonts such as to have the same appearance of emojis on all operation systems. In our corpus browsers, emojis can be displayed, but they are represented in the font that is used by the user, thus, it cannot be guaranteed that an emoji in the original text looked as it does on your screen. | Emojis are characters in Unicode. The application WhatsApp uses special fonts such as to have the same appearance of emojis on all operation systems. In our corpus browsers, emojis can be displayed, but they are represented in the font that is used by the user, thus, it cannot be guaranteed that an emoji in the original text looked as it does on your screen. | ||
- | Querying emojis is not an easy task. We decided to encode them in the messages, e.g. as | + | Querying emojis is not an easy task. We decided to encode them in the messages, e.g. as |
* '' | * '' | ||
* '' | * '' |
01_corpus/02_preprocessing/03_emojis.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1