kth_threshold

Usage

A tool for performing threshold estimation using the k-highest impact score for each term, pair or triple of a query. Pairs and triples are only used if provided with --pairs and --triples respectively. Usage: ../../../build/bin/kth_threshold [OPTIONS] Options: -h,--help Print this help message and exit -e,--encoding TEXT REQUIRED Index encoding -i,--index TEXT REQUIRED Inverted index filename -w,--wand TEXT REQUIRED WAND data filename --compressed-wand Needs: --wand Compressed WAND data file --tokenizer TEXT:{english,whitespace} [english] Tokenizer -H,--html Strip HTML -F,--token-filters TEXT:{krovetz,lowercase,porter2} ... Token filters --stopwords TEXT Path to file containing a list of stop words to filter out -q,--queries TEXT Path to file with queries --terms TEXT Term lexicon --weighted Weights scores by query frequency -k INT REQUIRED The number of top results to return -s,--scorer TEXT REQUIRED Scorer function --bm25-k1 FLOAT Needs: --scorer BM25 k1 parameter. --bm25-b FLOAT Needs: --scorer BM25 b parameter. --pl2-c FLOAT Needs: --scorer PL2 c parameter. --qld-mu FLOAT Needs: --scorer QLD mu parameter. -L,--log-level TEXT:{critical,debug,err,info,off,trace,warn} [info] Log level --config Configuration .ini file -p,--pairs TEXT Excludes: --all-pairs A tab separated file containing all the cached term pairs -t,--triples TEXT Excludes: --all-triples A tab separated file containing all the cached term triples --all-pairs Excludes: --pairs Consider all term pairs of a query --all-triples Excludes: --triples Consider all term triples of a query --quantized Quantizes the scores