Re: Full table scan fails during map

David Alves Thu, 20 Nov 2008 10:44:58 -0800

Hi stack

Regarding the missing block part, my bad, i ddn't install thiscluster and didn't verify the ulimit first, then a new problem arosebecause of the number of xceivers. Regarding this property, I can'tfind the default value in hadoop-default.xml is this normal? I askonly because sometime ago you answered to someone to increase thedfs.datanode.max.xcievers proeprty but actually the exceptions referto an *Xceiver (notice the change in the i,e order). Anyway theproblem seemed to go away so probably that fixed it.Regarding the hang part the regionserver actually comes down (no morejvm process), next time it happens I'll investigate further.Btw Hbase has 2048 MB allocated, we have lots of columns but onlythree CFs, and we use a home grown crawler because we are crawling SVNand CVS and other exotic file systems.


Regards, aand thanks for you prompt help, again :)
David Alves


On Nov 20, 2008, at 6:26 PM, stack wrote:

David Alves wrote:
Hi guys
We've got HBase(0.18.0, r695089) and Hadoop(0.18.0, r686010)running for a while, and apart from the ocasional regionserverstopping without notice (and whithout explanations from what we cansee in the logs), problem that we solve easily just by restartingit, we now have come to face a more serious problem of what I thinkis data loss.
What you think it is David? A hang? We've seen occasional hangupson HDFS. You could try threaddumping and see if you can figurewhere things are blocked (Can do it in UI on problematicregionserver or by sending QUIT to the JVM PID).
We use Hbase as a links and documents database (similar tonutch) in a 3 node cluster (4GB Mem on each node), the linksdatabase has a 4 regions and the document database now has 200regions for a total of 216 (with meta and root).
How much RAM allocated to HBase? Each database has a single familyor more?
After the crawl task, which went ok, (we now have 60GB/300GBfull in hdfs) we proceed to do a full table scan to create theindexes and thats where things started to fail.We are seing a problem in the logs (at the end of this email).This repeats untils theres a retriesexausted exception and the taskfails in the map phase. Hadoop fsk tool tells us that hdfs is ok.I'm still to explore the rest of the logs searching for some kindof error I will post a new mail if I find anything.
   Any help would be greatly appreciated.
Is this file in your HDFS: hdfs://cyclops-prod-1:9000/hbase/document/153945136/docDatum/mapfiles/5163556575658593611/data? If so, canyou fetch it using ./bin/hadoop fs -get FILENAME?
What crawler are you using (out of interest).
St.Ack

Re: Full table scan fails during map

Reply via email to