User Tools

Site Tools


02_browsing:04_queries:04_combined_queries

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
02_browsing:04_queries:04_combined_queries [2020/04/21 11:24] – ↷ Page name changed from 02_browsing:04_queries:03_combined_queries to 02_browsing:04_queries:04_combined_queries simone02_browsing:04_queries:04_combined_queries [2020/04/29 21:35] – ↷ Links adapted because of a move operation 83.77.216.186
Line 1: Line 1:
-====== 2.4.Combined queries ======+====== 2.4.Combined queries ======
 As shown above, you can query for individual tokens. But what if you need more detail, maybe from different [[02_browsing:02_layers|layers]]? Let us consider the following examples: As shown above, you can query for individual tokens. But what if you need more detail, maybe from different [[02_browsing:02_layers|layers]]? Let us consider the following examples:
   - The Italian token //io// used by only male informants   - The Italian token //io// used by only male informants
Line 18: Line 18:
   - In the third example we look for two tokens, one directly following the other. Here, we could use one of the normalisations, too, i.e. **mftb_lem** (the [[01_corpus:02_preprocessing:06_pos|tagger]] used for French) or we could use the token. This choice depends on what we want to find. If we are after the spelling //est-ce que// used by the informant, we query for ''tok=/.../''. If, on the other hand, we want to include unconventional spellings like //sq//, we have to use ''mftb_lem=/.../''. Let us use the first option, which gives us the following query: ''tok="est-ce" & tok="que" & #1 . #2'', which we can read as: a first token //est-ce// and a second token //que//. The expression ''#1 . #2'' means the first token has to directly precede the second one.   - In the third example we look for two tokens, one directly following the other. Here, we could use one of the normalisations, too, i.e. **mftb_lem** (the [[01_corpus:02_preprocessing:06_pos|tagger]] used for French) or we could use the token. This choice depends on what we want to find. If we are after the spelling //est-ce que// used by the informant, we query for ''tok=/.../''. If, on the other hand, we want to include unconventional spellings like //sq//, we have to use ''mftb_lem=/.../''. Let us use the first option, which gives us the following query: ''tok="est-ce" & tok="que" & #1 . #2'', which we can read as: a first token //est-ce// and a second token //que//. The expression ''#1 . #2'' means the first token has to directly precede the second one.
  
-That much for the examples. But how can you remember all of these options? You do not have to, since ANNIS offers you lots of [[02_browsing:04_queries:05_support|support in creation the queries]].+That much for the examples. But how can you remember all of these options? You do not have to, since ANNIS offers you lots of [[02_browsing:04_queries:01_support|support in creation the queries]].
02_browsing/04_queries/04_combined_queries.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki