compute_intersection

Usage

Computes intersections of posting lists. Usage: ../../../build/bin/compute_intersection [OPTIONS] Options: -h,--help Print this help message and exit -e,--encoding TEXT REQUIRED Index encoding -i,--index TEXT REQUIRED Inverted index filename -w,--wand TEXT REQUIRED WAND data filename --compressed-wand Needs: --wand Compressed WAND data file --tokenizer TEXT:{english,whitespace} [english] Tokenizer -H,--html Strip HTML -F,--token-filters TEXT:{krovetz,lowercase,porter2} ... Token filters --stopwords TEXT Path to file containing a list of stop words to filter out -q,--queries TEXT Path to file with queries --terms TEXT Term lexicon --weighted Weights scores by query frequency -L,--log-level TEXT:{critical,debug,err,info,off,trace,warn} [info] Log level --config Configuration .ini file --combinations Compute intersections for combinations of terms in query --max-term-count,--mtc UINT Needs: --combinations Max number of terms when computing combinations --min-query-len UINT Minimum query length --max-query-len UINT Maximum query length --header Write TSV header

Description

Computes an intersection of posting lists given by the input queries.

It takes a file with queries and outputs the documents in the intersection of the posting lists. See queries for more details on the input parameters.