On Thu, Mar 1, 2012 at 6:15 PM, Mike O'Leary <tmole...@uw.edu> wrote: > In the Javadoc page for the Similarity class, it says, > > "Lucene combines Boolean model (BM) of Information Retrieval with Vector > Space Model (VSM) of Information Retrieval - documents "approved" by BM are > scored by VSM." > > Is the Vector Space Model that is referred to here different than the term > vectors that can optionally be stored in index fields?
Yes, it refers to http://en.wikipedia.org/wiki/Vector_space_model, which uses statistics stored in the index. Term vectors are not used here. Instead term vectors are really just like storing a separate individual inverted index for each document. For example, they are used by MoreLikeThis to retrieve the terms and frequencies from just that one document. -- lucidimagination.com --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org