What is the correct way to verify a pattern using URLFilterChecker after adding
it to conf/regex-urlfilter.txt ? I know I’ll need rerun ant to get the conf
change into the mapreduce job when the pattern excludes as I intend.
To conf/regex-urlfilter.txt before my whitelist I added:
Thank you your example was helpful and I found the archived thread mentioned.
However I still don’t understand if my filter is working based on the output.
Can someone clarify the meaning of URLFilterChecker’s output?
goal:
prevent crawling of all obituaries on http://www.cabinet.com
Search this mailing list archI've for 'URLFilterChecker documentation',
you'll find the following:
From: Markus Jelsma markus.jel...@openindex.io
Date: Dec 9, 2011 2:02 PM
Subject: Re: URLFilterChecker documentation
To: remi tassing tassingr...@gmail.com
Cc:
That's not stdin is it?
echo
3 matches
Mail list logo