Hello Jeff, Sorry for a late reply.
On 02/09/2017 02:25 AM, Jeff Taylor wrote: > I've been running Mnogo 3.3.13 under debian wheezy for a number of > years, but it seems that old cached content is never removed. I > normally run with HoldBadHrefs=7d, but I've even tried setting it to 1s, > and the old content is still never removed. As an example, on a recent > search I noticed pages that were last cached in Aug 2011 (which was > probably when the site went offline), but it still comes up in searches. > > Help? I would even be happy with running a mysql query to remove all > cached content that is more than 7 days old, but I didn't want to go > blindly deleting things without knowing how the info in the tables might > be cross-referenced. If there's a way to fix indexer.conf I would also > be happy. Note that the old pages which should be removed are no longer > referenced in server.list, and when I run indexer I get a long list of > URLs that can't be reached. So what can I do to get these old entries > removed from the database? Which http status do these old documents have? Can you please check statistics for a few old documents: ./indexer -S -u http://old1/ ./indexer -S -u http://old2/ ./indexer -S -u http://old3/ Or using this SQL query: SELECT status, url FROM url WHERE url IN ('http://old1/','http://old2/','http://old3/'); Also, the output from this command would be helpful: ./indexer -am -v6 -u http://old1/ Please also send your indexer.conf to b...@mnogosearch.org. > _______________________________________________ > General mailing list > General@mnogosearch.org > http://lists.mnogosearch.org/listinfo/general _______________________________________________ General mailing list General@mnogosearch.org http://lists.mnogosearch.org/listinfo/general