On 1/4/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
If the HTMLStripReader would simply replace the HTML with spaces (same length as the removed HTML part) then the positions for the highlighter would be correct. And most of the Tokenizers would be happy with this solution (except maybe the KeywordTokenizer).
Good idea Mirko, that's probably a much easier fix than the one I envisioned. -Yonik
