Hey,I am confused with the crawling with nutch.
as you know,there are some website which can not be accessed becaused
they are the "post"method,that means,even if you know the web site's
url,when you input the url into the address bar on the IE or Mozilla,the
website 's some important content has lost.
what should I do,should I do a plugin to extend the crawling ?
eg:
http://www.51job.com/hot/show_job_detail.php?id=100655204&jobiduni=(102344234)

Reply via email to