OK this is now fixed.  Thanks Steve!

Mike

Steven A Rowe wrote:

Hi Mike,

On 3/9/2009 at 2:34 PM, Michael McCandless wrote:
See changes at http://lucene.apache.org/java/2_4_1/changes/Changes.html

Minor nit: the encoding of Christian Kohlschütter's name in the 2.4.1 section of CHANGES.txt appears to be Latin-1, but changes2html.pl assumes that CHANGES.txt is encoded as UTF-8, so the resulting Changes.html has an improperly encoded "ü" (lowercase "u" with an umlaut):

   14. LUCENE-1186: Add Analyzer.close() to free internal ThreadLocal
   resources.
   (Christian Kohlsch�tter via Mike McCandless)

For me, both in the web browser and in the excerpt from it that I've pasted above, instead of a lowercase "u" with an umlaut, I see a small white question mark on a black diamond background, indicating an invalid UTF-8 byte sequence: byte 0xFC, marking the beginning of a multi-byte sequence, but then no trailing bytes with the high bit set.

Anyway, I think the fix is simple: edit CHANGES.txt so that "Kohlschütter" is properly encoded as UTF-8, as the remainder of the file is, then regenerate Changes.html.

Steve



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to