Hi All, I would like to share an issue regarding the encoding using Nutch 0.9.x.
When I'm indexing some sites, which contains lot of ISO-8859-2 characters, (these are mainly eastern-european sites, mainly hungarian ones) then at the search page I cannot see the characters correcty. Even at the cached view, the non-english characters like áéúő are visible as a question mark. If some of you, have an experience with this issue, I would be glad when some of You can help me. Many Thanks in Advance, Zsolt ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
