Anurag
Thanks. It is my configuration problem.
Paul

--- On Sat, 12/18/10, Anurag <[email protected]> wrote:


From: Anurag <[email protected]>
Subject: Re: Nutch not fetching all urls from urlsdir
To: [email protected]
Received: Saturday, December 18, 2010, 10:09 AM



Check the congiguration files in Conf folder. It may be that domain
name of certain patterrns have been not allowed. ....eg.
urlfilter.txt, regex....files etc..

On 12/18/10, Chris Woolum [via Lucene]
<[email protected]> wrote:
>
>
>     Hello everyone,
>
> I have a list of urls that I am testing with.  There are currently 10
> urls that I am injecting. The problem though is that when I look through
> the log, I only see 3 or 4 of them being fetched. What would cause this?
> There are no errors that I can find. My understanding is that on the
> first pass of the fetch, all urls that were injected should be fetched.
> Am I correct in thinking this?
>
> Thanks,
> Chris
>
>
>
>     ______________________________________
>     View message @
> http://lucene.472066.n3.nabble.com/Nutch-not-fetching-all-urls-from-urlsdir-tp2109078p2109078.html
>     To start a new topic under Nutch - User, email
> [email protected]
>     To unsubscribe from Nutch - User, visit
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=603147&code=YW51cmFnLml0LmpvbGx5QGdtYWlsLmNvbXw2MDMxNDd8LTIwOTgzNDQxOTY=


-- 
Kumar Anurag


-----
Kumar Anurag

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-not-fetching-all-urls-from-urlsdir-tp2109078p2109883.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Reply via email to