Better Highligther fragmenter?

Michael Imbeault Sat, 16 Sep 2006 12:04:49 -0700

I'm now using the excellent Hightlighter from within Solr and it worksvery well; except that the generated fragments sometimes begins withbad-looking characters (the "." of the end of the previous phrase, or a), /10, etc). The same is true for the fragments ends. I looked at boththe dev and user lucene list in search for a better Fragmenter class,but it seems that there's none right now (just the simple and nullfragmenters).

To me the 'simple' fragmenter is a bit too simple; anyone had success inimplementing a more intelligent one? I have no java coding experience,sadly, so I don't know where to begin on this one. I don't think fancyphrase recognition is needed; just a better boundary algorithm (avoidbeginning / ending fragments with bad looking characters) and theaddition of "..." at the end and beginning of the fragment iffragmentation of a phrase took place.

Also, is it required that the highlighted field is 'stored'? I'm prettysure it is, but just want confirmation.


Thanks,

--
Michael Imbeault
CHUL Research Center (CHUQ)
2705 boul. Laurier
Ste-Foy, QC, Canada, G1V 4G2
Tel: (418) 654-2705, Fax: (418) 654-2212


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Better Highligther fragmenter?

Reply via email to