HI ,   
    Please give your suggestions for the following problem.
    I have a web appwith 3 pages index.jsp , p1.jsp and p2.jsp.
    links for p1.jsp and p2.jsp are give in the index.jsp page.
    I have crawled the site and have started the nutch web applciation.
    I am able to search contents in index.jsp,p1.jsp,p2,jsp.


    Now I have renamed the page p2.jsp.This makes the link for p2.jsp in
index.jsp invalid as p2.jsp is renamed.I have done recrawl and redeployed
the nutch web application.


   Now if I do a search for content which is there in  p2.jsp.I am getting
the link to p2.jsp.On clicking on the link I am getting 404 page not found
which is fine as the file is renamed.

How can I configure nutch to make sure that p2.jsp is not displayed in the
result as this is no more there in the site.
Regards,
Rinesh.
    
-- 
View this message in context: 
http://www.nabble.com/Removal-of-deleted-pages-from-the-index-tp21229220p21229220.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to