ok this is how i did it. i created a class in the
org.apache.nutch.crawl package, the same package where the crawl class
(which is nutch's main class, called by the crawl command). in that
class, you create the crawl class with the appropriate parameter. just
look at the code once you import it into eclipse or the javadoc.

here's the link: http://wiki.apache.org/nutch/RunNutchInEclipse0.9

just a tip, try using google or nutch wiki, there's a lot of useful
stuff there :-)

On Wed, Mar 18, 2009 at 2:01 PM, n_developer <spoo_0...@yahoo.co.in> wrote:
>
> "you have a manual for importing nutch into eclipse in case you don't know
> how"
> can u pl mention the link...
> thanx in advance
>
>
>
>
> ripper07 wrote:
>>
>> well you can always write a bash script or a java class that does
>> this. writing a java class is probably better and easier. you have a
>> manual for importing nutch into eclipse in case you don't know how. i
>> needed a similar thing done and it turned out that using java really
>> is easier...
>>
>> On Wed, Mar 18, 2009 at 12:36 PM, MyD <myd.ro...@googlemail.com> wrote:
>>>
>>> Hi @ all,
>>>
>>> is it possible to set the next fetch schedule for a url in another crawl
>>> dir?
>>>
>>> Example:
>>>
>>> crawl.dir.A
>>> -> retrieve links and set the fetch schedule but this should go into the
>>> crawl.dir.B
>>>
>>> crawl.dir.B
>>>
>>> Thanks in advance
>>>
>>> Regards,
>>> MyD
>>> --
>>> View this message in context:
>>> http://www.nabble.com/Nutch-1.0-trunk-Fetch-Schedule-tp22577234p22577234.html
>>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>>
>>>
>>
>>
>
> --
> View this message in context: 
> http://www.nabble.com/Nutch-1.0-trunk-Fetch-Schedule-tp22577234p22578664.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>

Reply via email to