I meet a very strange problem,my nutch8.1 can crawl http sites normally but 
while fetching ftp sites, it got messy code.

for example, in the root diretory of an ftp site, there is a subdiretory named 
in chinese"教学资源中心", a normal crawl result should be 
index of /教学资源中心
but when nutch fetch it, it become 
Index of /???¡ì¡Á???????/ 
this problem not apeared while feching diretories named in english.

can anyone tell me how to do ?

thanks in advance.

 

Reply via email to