HBase question on HRegions server

2007-11-01 Thread Bin YANG
Hi, I am confused with some thing in HBase. 1. All data is stored in HDFS. Data is served to clients by HRegionServers. Is it allowed that the tablet T is on machine A, and served by a HRegionServers running on machine B? What information does the META table maintain? The map from T to the

Re: Very weak mapred performance on small clusters with a massive amount of small files

2007-11-01 Thread Enis Soztutar
Hi, I think you should try using MultiFileInputFormat/MultiFileInputSplit rather than FileSplit, since the former is optimized for processing large number of files. Could you report you numMaps and numReduces and the avarage time the map() function is expected to take. André Martin wrote:

HDFS filesystem for JackRabbit

2007-11-01 Thread Eugeny N Dzhurinsky
Hello! We would like to use HDFS for our software, which software will be extended to use a cluster later. For now we would like to just create an implementation of file system interface for JackRabbit. We found how can we implement this using Hadoop part for HDFS, however it's still not clear

Re: HBase question on HRegions server

2007-11-01 Thread Michael Stack
Bin YANG wrote: Hi, I am confused with some thing in HBase. 1. All data is stored in HDFS. Data is served to clients by HRegionServers. Is it allowed that the tablet T is on machine A, and served by a HRegionServers running on machine B? Yes, tablet T may be hosted in HDFS on machine A

RE: HBase question on HRegions server

2007-11-01 Thread Jim Kellerman
-Original Message- From: Bin YANG [mailto:[EMAIL PROTECTED] Sent: Thursday, November 01, 2007 3:06 AM To: hadoop-user@lucene.apache.org Subject: HBase question on HRegions server Hi, I am confused with some thing in HBase. 1. All data is stored in HDFS. Data is served to clients

Re: clarification on running a job locally

2007-11-01 Thread Jim the Standing Bear
Hi Chris, I meant Local Runner. From the sounds of your email, it seems that modifying the two fore-mentioned properties is not enough to get a cluster node to run as a local runner? Also, what about including .xml to the class path? Do I include them like jar files? My ant script has

Re: clarification on running a job locally

2007-11-01 Thread Ted Dunning
It is definitely easier to build a jar and use the hadoop script. You can do it yourself, though. Just duplicate the line in bin/hadoop that runs java and prefix it with echo to see what is happening. On 11/1/07 1:37 PM, Jim the Standing Bear [EMAIL PROTECTED] wrote: Hi Ted, It is funny

Re: clarification on running a job locally

2007-11-01 Thread Jim the Standing Bear
Thanks Ted. just as I thought. On 11/1/07, Ted Dunning [EMAIL PROTECTED] wrote: It is definitely easier to build a jar and use the hadoop script. You can do it yourself, though. Just duplicate the line in bin/hadoop that runs java and prefix it with echo to see what is happening. On

Re: NoSuchElementException when creating a table

2007-11-01 Thread Michael Stack
Thanks for the detail Holger. Helps. Reading it, it looks like the cluster hasn't started up properly; the NoSuchElementException would seem to indicate that the basic startup deploying the catalog meta tables hasn't happened or has gotten mangled somehow. Whats in your hbase master log

Re: NoSuchElementException when creating a table

2007-11-01 Thread Holger Stenzhorn
Hi, Of course I yest another newbie but at least I have read the 10 minutes introduction... :-) So I am running HBase on a local filesystem. Attached you can find the (hopefully) necessary part of the master-log-file. Does not look to bad, right? BUT in the regionserver-log I get the

Re: HBase question on HRegions server

2007-11-01 Thread Bin YANG
Thank you very much Michael and Jim! That means the master does not maintain the mapping from HRegion to HRegionServer. And the mapping from HRegion to HRegionServer is in the META and ROOT. Is it right? So if a client want to read a tablet, it should first find the ROOT, find corresponding

RE: HBase question on HRegions server

2007-11-01 Thread Jim Kellerman
-Original Message- From: Bin YANG [mailto:[EMAIL PROTECTED] Sent: Thursday, November 01, 2007 6:59 PM To: hadoop-user@lucene.apache.org Subject: Re: HBase question on HRegions server Thank you very much Michael and Jim! That means the master does not maintain the mapping from

RE: NoSuchElementException when creating a table

2007-11-01 Thread Jim Kellerman
C:\hadoop is my installation C:\workspace\hadoop-commit is my checked out SVN tree which is current with trunk. /cygdrive/c$ diff hadoop/conf/hadoop-site.xml workspace/hadoop-commit/conf 7,17c7 property namehadoop.tmp.dir/name valueC:\hadoop\tmp/value descriptionA base for other