Queries : bag of words, Set of documents D: a seq of words. For query , retrieved relevant documents ; Got full or partial orderings . Let . Maybe want to complete Q vs D relevence level matrix. Or, given docs D’ relevant for unseen query q, want to rank D'.
; D is usually . could be TF/IDF, common word count etc.. Let be scoring function; it induces a relevance order on D for every query ; let .
Let be ground truth relevance-ordering of D corresponding to query ; let y be induced by .
Loss function: measures deviation of from .\
Risk: . Want to find f which minimizes L. But Pr(q) usually unknown; so minimize empirical risk: . Assume that .
\tbc
Checks if this is violated: ; so . Widely used. Eg: SVM ordinal regression; ranknet (best pairwise function as of 2009); lambda-rank: .
Measure badness of a bigger part of the ranking, not just correctness of ranking of pairs of documents. uniquely identified by ; so can consider or .
Define prob distribution over permutations with good properties: favors permutations rated highly according to f to permutations rated lower, best permutation according to f has max probability. So find f whose is closest to . Can minimize cross entropy: (List net); or : thereby maximizing likelihood (List-MLE). These Listwise loss calculation is hard: may need to consider n! permutations: so instead consider only ranking of top docs induced by f. List-MLE known to be best Listwise loss function based ranking method (2009), also better than all pairsise loss function based methods.