Hi Kshitij, On Mon, Feb 15, 2016 at 1:02 AM, <user-digest-h...@nutch.apache.org> wrote:
> From: Kshitij Shukla <kshiti...@cisinlabs.com> > To: Nutch User <user@nutch.apache.org>, Hbase User <u...@hbase.apache.org> > Cc: > Date: Mon, 15 Feb 2016 14:31:39 +0530 > Subject: [CIS-CMMI-3] ScannerTimeoutException: 157036ms passed since the > last invocation, timeout is currently set to 60000 > Hello everyone, > > During a very large crawl when indexing to Solr this will yield the > following exception: > > **********************************************************START > Parsing : > /home/c1/apache-nutch-2.3.1/runtime/deploy/bin/nutch parse -D > mapred.reduce.tasks=2 -D mapred.child.java.opts=-Xmx1000m -D > mapred.reduce.tasks.speculative.execution=false -D > mapred.map.tasks.speculative.execution=false -D > mapred.compress.map.output=true -D mapred.skip.attempts.to.start.skipping=2 > -D mapred.skip.map.max.skip.records=1 1455285944-9889 -crawlId 4 First thing is first, which version of HBase are you working with? The Nutch 2.3.1 search stack can be seen at the release announcement http://nutch.apache.org/#21-january-2016-nutch-231-release Once we can sort this out and confirm your software versions are compatible then we can move on to debugging the issue. Thanks Lewis