compute_intersection
Usage
Computes intersections of posting lists.
Usage: ../../../build/bin/compute_intersection [OPTIONS]
Options:
-h,--help Print this help message and exit
-e,--encoding TEXT REQUIRED Index encoding
-i,--index TEXT REQUIRED Inverted index filename
-w,--wand TEXT REQUIRED WAND data filename
--compressed-wand Needs: --wand
Compressed WAND data file
--tokenizer TEXT:{english,whitespace} [english]
Tokenizer
-H,--html Strip HTML
-F,--token-filters TEXT:{krovetz,lowercase,porter2} ...
Token filters
--stopwords TEXT Path to file containing a list of stop words to filter out
-q,--queries TEXT Path to file with queries
--terms TEXT Term lexicon
--weighted Weights scores by query frequency
-L,--log-level TEXT:{critical,debug,err,info,off,trace,warn} [info]
Log level
--config Configuration .ini file
--combinations Compute intersections for combinations of terms in query
--max-term-count,--mtc UINT Needs: --combinations
Max number of terms when computing combinations
--min-query-len UINT Minimum query length
--max-query-len UINT Maximum query length
--header Write TSV header
Description
Computes an intersection of posting lists given by the input queries.
It takes a file with queries and outputs the documents in the
intersection of the posting lists. See queries
for
more details on the input parameters.