Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change 
notification.

The "TextProfileSignature" page has been changed by JoelNothman:
http://wiki.apache.org/solr/TextProfileSignature?action=diff&rev1=3&rev2=4

Comment:
small additional caveat

  
  These properties all suggest that TextProfileSignature is brittle for short 
texts.
  
- TextProfileSignature operates on raw text, without the filtering provided by 
Analyzers, and hence will fail to ignore HTML, normalize for diacritics, stem, 
or incorporate the relative importance of different tokens, etc.
+ TextProfileSignature operates on raw text, without the filtering provided by 
Analyzers, and hence will fail to ignore HTML, normalize for diacritics, word 
stem/semantics, or incorporate the relative importance of different tokens, 
etc. It also considers only the bag of words, ignoring any word order.
  

Reply via email to