How to verify URLFilterChecker

2015-02-09 Thread Scott Lundgren
What is the correct way to verify a pattern using URLFilterChecker after adding it to conf/regex-urlfilter.txt ? I know I’ll need rerun ant to get the conf change into the mapreduce job when the pattern excludes as I intend. To conf/regex-urlfilter.txt before my whitelist I added:

Re: How to verify URLFilterChecker

2015-02-09 Thread Scott Lundgren
Thank you your example was helpful and I found the archived thread mentioned. However I still don’t understand if my filter is working based on the output. Can someone clarify the meaning of URLFilterChecker’s output? goal: prevent crawling of all obituaries on http://www.cabinet.com

Re: How to verify URLFilterChecker

2015-02-09 Thread remi tassing
Search this mailing list archI've for 'URLFilterChecker documentation', you'll find the following: From: Markus Jelsma markus.jel...@openindex.io Date: Dec 9, 2011 2:02 PM Subject: Re: URLFilterChecker documentation To: remi tassing tassingr...@gmail.com Cc: That's not stdin is it? echo