Does anybody have Nutch 2.0 and MySQL working successfully with languages other 
than English? I have Nutch 2.0 working successfully with MySQL but non-English 
languages only work with the BLOB columns and not the varchar columns. I have 
modified the default sql table generated by Nutch 2.0 to handle UTF8 as MySQL 
defaults to latin but still get only question marks for the varchar fields for 
the Japanese from UTF-8 websites. I have tested directly entering Japanese into 
those fields via SQL and that works fine so I suspect it is my Nutch set up. 
Perhaps I have not configured something correctly for crawling sites with 
languages other than English.

 

James Sullivan
Mobile: 080-1083-5463

トムソン・ロイター・マーケッツ株式会社 

〒107-6119 東京都港区赤坂5-2-20 赤坂パークビル18階

 

 

Reply via email to