hi all While using nutch0.8 to parse some chinese pdf files encoded in GBK,I always get errors message as:" Unknown encoding for 'GBK-EUC-H' " , should I change some settings or recomplie the parse-pdf plugin?
thanks
hi all While using nutch0.8 to parse some chinese pdf files encoded in GBK,I always get errors message as:" Unknown encoding for 'GBK-EUC-H' " , should I change some settings or recomplie the parse-pdf plugin?
thanks