PHP/ir

Information Retrieval and other interesting topics

Alternative Term Weighting

In: ranking, probability

11 Nov 2009

The term weighting and ranking function is at the core of any information retrieval system. The vector space model with the cosine similarity is maybe the best known and most widely used, but there are plenty of alternatives. We're looking at two here, the BM25 function based around a probabilistic model, and a function based around language modeling.