Hello folks, I am working on adopting nutch for a vertical. I have been able to get it up and running in pretty basic scenarios. I need some help in getting up to speed in trying to crawl sites which has some weird encoding on the URLs. I am kind of lost, how to go about it? If some one can share some insights, it will be very helpful.
Has any one solved a problem in crawling sites with encoded URLS, please let me know.. It was asked by another nutch user, but no response.. so not sure, if that user was able to solve it.. Thanks Sudhi
