./nutch org.apache.nutch.net.URLFilterChecker -allCombined < in.txt Checking combination of all URLFilters available +http://urlAlpha.docx
So it looks like it is a valid one, right? Any other testing tools to try? -- Chris On Mon, Dec 19, 2011 at 2:52 PM, Markus Jelsma <[email protected]> wrote: > You must feed URL's from stdin. > >> Does it normally take a long time to run? It's been going about 5 >> minutes... >> >> -- Chris >> >> >> >> On Mon, Dec 19, 2011 at 2:43 PM, Markus Jelsma >> >> <[email protected]> wrote: >> >> bin/nutch org.apache.nutch.net.URLFilterChecker -allCombined

