Dear Amed,

some time ago I've stumbled on a similar problem and started a thread on the 
Nutch Users list:

http://www.mail-archive.com/[email protected]/msg14560.html

(a fix for parse-pdf as well as PDFBox is included)

Maybe that's related maybe not. It depends on the version of Nutch you use.
PDFBox is now (Nutch 1.1 and upwards) used via Tika. I haven't observerd the my 
problem in recent versions
of Nutch (1.2 and 1.3).

Sebastian


On 04/12/2011 04:41 PM, [email protected] wrote:
Hallo ,

Can any Body please tell me what is the Problem , in the Shell will nothing be 
done (stoped) !!!!!!!

Hier is my Output:

-activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0

-activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0

-activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0

-activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0

-finishing thread FetcherThread, activeThreads=0

-activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0

-activeThreads=0

I have crawled a Web with –depth 10 -topN 500 , and the Web has a lot of PDF 
(the Parse workes perfekt) !!!!!!!!!!!!!!!

Any Help please ???

Amed

  • ActiveThreads=0 ahmed.ridha
    • Re: ActiveThreads=0 Sebastian Nagel | exorbyte

Reply via email to