When I run my test suite, I am seeing incorrect results from HBaseAdmin.tableExists() in both candidate 1 and candidate 2. It is sometimes returning false when it should return true. If I revert to 0.20.2, the tests run cleanly.
Paul On Jan 18, 2010, at 9:57 AM, Jean-Daniel Cryans wrote: > I think this is https://issues.apache.org/jira/browse/HBASE-2035 fixed > in the upcoming 0.20.3. If you want to try it out, get the RC2 here > http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-2/ > > J-D > > On Mon, Jan 18, 2010 at 3:29 AM, Gaurav Vashishth <vashgau...@gmail.com> > wrote: >> >> Thanks a lot, your words have encouraged me that it is doable, will upgrade >> the system and re run the test case. >> >> Though, I have one more query >> >> When I insert the records in HBase through Put command, I send the row id as >> long value like "80760057" but when I run the HBase through Shell and scan >> the table I always see the value in >> \000\000\000\000\000\n\005+, this format. Also, I cann't get the value >> through this row id despite of that column qualifier has the values. >> >> >> >> Ryan Rawson wrote: >>> >>> Hey, >>> >>> So there are 2 major problems here: >>> - the setup is way off. There is no actual data duplication for >>> example, you will put every write to 1 machine, which when it fails, >>> so goes your data. >>> - These machines don't have enough ram. They must have at least >>> 1gb/core, ideally 2gb/core or more. This means they should have 8 gb >>> ram. crucial.com >>> >>> A better setup would be: >>> - 1 "master" node, runs: hmaster, 1xzookeeper, namenode >>> - 5 data/regionservers >>> >>> The key here to performance is to spread your workload over more >>> machines. This is how clustered software works in a nutshell. using >>> only 1/3 of your machines for "regionservers" and 1/6th for data >>> storage (datanode) is non-ideal. >>> >>> You really need to up the ram. I run: >>> - dual quad i7s with hyper-threading, which gives 16 cores to the OS >>> - 24 gb ram >>> - 4 x 1tb disk >>> >>> My small end machines are: >>> - dual quad xeons, 8 cores to the OS >>> - 16 gb ram >>> - 2 x 1tb disk >>> >>> For performance you really dont want to have less than 1-2gb ram per >>> core. Without a lot of ram, you don't get effective disk caching. You >>> can't run map-reduces on the same nodes, you may run into swap issues, >>> etc. 4 gb ddr3 ram is about $150 usd. >>> >>> But given a reasonable machine set, doing 50k inserts/sec sustained >>> over long periods of time is totally doable. You will need more than 6 >>> machines though! Don't forget your spares, since you really want to be >>> able to operate on N-{1,2} machines so failures don't cripple you. >>> >>> >>> >>> On Mon, Jan 18, 2010 at 2:55 AM, Gaurav Vashishth <vashgau...@gmail.com> >>> wrote: >>>> >>>> Using 6 machines, 8 core with 4 GB Ram, right now for setting up the >>>> scenario. >>>> >>>> 2 region servers >>>> 1 ZooKeeper >>>> 1 Data Node >>>> 2 Name Node >>>> >>>> >>>> >>>> Ryan Rawson wrote: >>>>> >>>>> How many machines do you have? I'd try at least 20+ late model boxes. >>>>> >>>>> On Jan 18, 2010 2:14 AM, "Gaurav Vashishth" <vashgau...@gmail.com> >>>>> wrote: >>>>> >>>>> >>>>> I need to store live data which is about 40-50K records /sec, evaluated >>>>> MYSql >>>>> and now trying HBase. >>>>> >>>>> Just read in docstoc that HBase insert performance, for few 1000 rows >>>>> and >>>>> 10 >>>>> columns with 1 MB values, is 68ms/row. My scenario is similar, we need >>>>> under >>>>> 10k rows, 10-20 columns and which can have thousands of version with >>>>> values >>>>> not greater than 300 bytes. Initially, I thought HBase can solve the >>>>> puprose >>>>> but reading docstoc article have put doubt in my mind. >>>>> >>>>> Can we get 40-50k records/sec insertion speed in HBase?? Also, there >>>>> would >>>>> be thousand of users who will be reading teh database also, can HBase >>>>> maintain that much of speed? >>>>> >>>>> Thanks >>>>> Gaurav >>>>> -- >>>>> View this message in context: >>>>> http://old.nabble.com/HBase-Insert-Performance-tp27208387p27208387.html >>>>> Sent from the HBase User mailing list archive at Nabble.com. >>>>> >>>>> >>>> >>>> -- >>>> View this message in context: >>>> http://old.nabble.com/HBase-Insert-Performance-tp27208387p27208828.html >>>> Sent from the HBase User mailing list archive at Nabble.com. >>>> >>>> >>> >>> >> >> -- >> View this message in context: >> http://old.nabble.com/HBase-Insert-Performance-tp27208387p27209231.html >> Sent from the HBase User mailing list archive at Nabble.com. >> >>