HI ,
Please give your suggestions for the following problem.
I have a web appwith 3 pages index.jsp , p1.jsp and p2.jsp.
links for p1.jsp and p2.jsp are give in the index.jsp page.
I have crawled the site and have started the nutch web applciation.
I am able to search contents in index.jsp,p1.jsp,p2,jsp.
Now I have renamed the page p2.jsp.This makes the link for p2.jsp in
index.jsp invalid as p2.jsp is renamed.I have done recrawl and redeployed
the nutch web application.
Now if I do a search for content which is there in p2.jsp.I am getting
the link to p2.jsp.On clicking on the link I am getting 404 page not found
which is fine as the file is renamed.
How can I configure nutch to make sure that p2.jsp is not displayed in the
result as this is no more there in the site.
Regards,
Rinesh.
--
View this message in context:
http://www.nabble.com/Removal-of-deleted-pages-from-the-index-tp21229220p21229220.html
Sent from the Nutch - User mailing list archive at Nabble.com.