I would like to "groom" the crawldb.... My guess is that it should be an easy thing just to built upon the function that removes the 404 status and duplicates. But where do I find these?
Thank you
I would like to "groom" the crawldb.... My guess is that it should be an easy thing just to built upon the function that removes the 404 status and duplicates. But where do I find these?
Thank you