Hi, I have followed the tutorial to setup my Nutch, up and running. Currently it is able to crawl php files, but not the pdf files.
Can anyone please advise how can I setup or configure to make it crawl onto pdf and word docs? Thanks.
Hi, I have followed the tutorial to setup my Nutch, up and running. Currently it is able to crawl php files, but not the pdf files.
Can anyone please advise how can I setup or configure to make it crawl onto pdf and word docs? Thanks.