Thanx, but that gives me the whole DMOZ directory, all I need is a few
specific categories of it like SCI-Fi for example

-----Original Message-----
From: Kzjnet [mailto:[email protected]] 
Sent: Tuesday, August 13, 2013 9:03 PM
To: [email protected]
Subject: Re: Nutch DMOZ parser

Try this:
bin/nutch org.apache.nutch.tools.DmozParser content.rdf.u8 > dmoz/urls

"Ralf R. Kotowski" <[email protected]> wrote:

>Hi,
>
> 
>
>In the tutorial it gives an example on how to extract about 1000 RANDOM
urls
>from DMOZ to use as seed for Nutch.
>
> 
>
>However I would like to have ALL URLS from a specific category, is this
>possible?
>
> 
>
> 
>
>Thnx!
>

Reply via email to