Thank you. I changed the value but no luck. (Changed in runtime/local/conf -
nutch-default.xml)

<property>
  <name>http.content.limit</name>
  <value>-1</value>
  <description>The length limit for downloaded content using the http

Output --------------------


andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch inject
../../../web_seeds
InjectorJob: starting
InjectorJob: urlDir: ../../../web_seeds
InjectorJob: finished
andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch generate
GeneratorJob: Selecting best-scoring urls due for fetch.
GeneratorJob: starting
GeneratorJob: filtering: true
GeneratorJob: done
GeneratorJob: generated batch id: 1329930779-110515839
andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch fetch
1329930779-110515839
FetcherJob: starting
FetcherJob : timelimit set for : -1
FetcherJob: threads: 10
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob: batchId: 1329930779-110515839
Using queue mode : byHost
Fetcher: threads: 10
QueueFeeder finished: total 1 records. Hit by time limit :0
fetching http://www.q1a.com/
-finishing thread FetcherThread1, activeThreads=1
-finishing thread FetcherThread2, activeThreads=1
-finishing thread FetcherThread3, activeThreads=1
-finishing thread FetcherThread4, activeThreads=1
-finishing thread FetcherThread5, activeThreads=1
-finishing thread FetcherThread6, activeThreads=1
-finishing thread FetcherThread7, activeThreads=1
-finishing thread FetcherThread8, activeThreads=1
-finishing thread FetcherThread9, activeThreads=1
-activeThreads=1, spinWaiting=0, fetchQueues= 1, fetchQueues.totalSize=0
-finishing thread FetcherThread0, activeThreads=0
-activeThreads=0, spinWaiting=0, fetchQueues= 0, fetchQueues.totalSize=0
-activeThreads=0
FetcherJob: done
andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch parse
1329930779-110515839
ParserJob: starting
ParserJob: resuming:    false
ParserJob: forced reparse:      false
ParserJob: batchId:     1329930779-110515839
Parsing http://www.q1a.com/
*Skipping http://www.q1a.com/q1a; different batch id* - Why does it say
skipping here?
ParserJob: success
andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch updatedb
DbUpdaterJob: starting
DbUpdaterJob: done



************************ The first iteration ****************************

andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch generate
GeneratorJob: Selecting best-scoring urls due for fetch.
GeneratorJob: starting
GeneratorJob: filtering: true
GeneratorJob: done
GeneratorJob: generated batch id: 1329930901-1268252438
andrew@andrew-ubuntu:~/nutch/branch/runtime/local$ bin/nutch fetch
1329930901-1268252438
FetcherJob: starting
FetcherJob : timelimit set for : -1
FetcherJob: threads: 10
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob: batchId: 1329930901-1268252438
Using queue mode : byHost
Fetcher: threads: 10
QueueFeeder finished: total 1 records. Hit by time limit :0
fetching http://www.q1a.com/q1a
-finishing thread FetcherThread1, activeThreads=1
-finishing thread FetcherThread2, activeThreads=1
-finishing thread FetcherThread3, activeThreads=1
-finishing thread FetcherThread4, activeThreads=1
-finishing thread FetcherThread5, activeThreads=1
-finishing thread FetcherThread6, activeThreads=1
-finishing thread FetcherThread7, activeThreads=1
-finishing thread FetcherThread8, activeThreads=1
-finishing thread FetcherThread9, activeThreads=1
-finishing thread FetcherThread0, activeThreads=0
-activeThreads=0, spinWaiting=0, fetchQueues= 0, fetchQueues.totalSize=0
-activeThreads=0
FetcherJob: done


I stopped here, since its not doing what it is supposed to. Please suggest.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Please-help-Nutch-fetch-command-not-fetching-data-tp3764751p3767125.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to