Hello Hany, You need to tell the indexer to delete those record. This will help:
<!-- delete gone and redirects --> <property> <name>indexer.delete</name> <value>true</value> </property> Regards, Markus Op ma 8 mrt. 2021 om 15:31 schreef Hany NASR <hany.n...@hsbc.com.invalid>: > Hi All, > > I'm using Nutch 1.15, and figure out that permeant redirect pages (301) > are still indexed and not removed in Solr. > > When I exported the crawlDB I found the page Status: 5 (db_redir_perm). > > How can I keep Solr index up to date and make Nutch clean these pages > automatically? > > Regards, > Hany > > ----------------------------------------- > SAVE PAPER - THINK BEFORE YOU PRINT! > > This E-mail is confidential. > > It may also be legally privileged. If you are not the addressee you may > not copy, > forward, disclose or use any part of it. If you have received this message > in error, > please delete it and all copies from your system and notify the sender > immediately by > return E-mail. > > Internet communications cannot be guaranteed to be timely secure, error or > virus-free. > The sender does not accept liability for any errors or omissions. >