Dear Wiki user, You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The "TextProfileSignature" page has been changed by JoelNothman: http://wiki.apache.org/solr/TextProfileSignature?action=diff&rev1=3&rev2=4 Comment: small additional caveat These properties all suggest that TextProfileSignature is brittle for short texts. - TextProfileSignature operates on raw text, without the filtering provided by Analyzers, and hence will fail to ignore HTML, normalize for diacritics, stem, or incorporate the relative importance of different tokens, etc. + TextProfileSignature operates on raw text, without the filtering provided by Analyzers, and hence will fail to ignore HTML, normalize for diacritics, word stem/semantics, or incorporate the relative importance of different tokens, etc. It also considers only the bag of words, ignoring any word order.