Re: issues copying data from one table to another

2012-08-21 Thread Norbert Burger
On Fri, Aug 17, 2012 at 4:09 PM, anil gupta anilgupt...@gmail.com wrote: If you want to customize the bulkloader then you can write your own mapper to define the business logic for loading. You need to specify the mapper at the time of running importsv by using: Thanks, Anil. I had that seen

Re: issues copying data from one table to another

2012-08-21 Thread Norbert Burger
On Sat, Aug 18, 2012 at 7:14 AM, Michael Segel michael_se...@hotmail.com wrote: Thanks. Just out of curiosity, what would happen if you could disable the table, alter the table's max file size and then attempted to merge regions? Note: I've never tried this, don't know if its possible,

Re: issues copying data from one table to another

2012-08-18 Thread Michael Segel
Can you disable the table? How much free disk space do you have? Is this a production cluster? Can you upgrade to CDH3u5? Are you running a capacity scheduler or fair scheduler? Just out of curiosity, what would happen if you could disable the table, alter the table's max file size and then

issues copying data from one table to another

2012-08-17 Thread Norbert Burger
Hi folks -- we're running CDH3u3 (0.90.4). I'm trying export data from an existing table that has far too many regions (2600+ for only 8 regionservers) into one with a more reasonable region count for this cluster (256). Overall data volume is approx. 3 TB. I thought initially that I'd use the

Re: issues copying data from one table to another

2012-08-17 Thread anil gupta
Hi Norbert, If you want to customize the bulkloader then you can write your own mapper to define the business logic for loading. You need to specify the mapper at the time of running importsv by using: -Dimporttsv.mapper.class=my.Mapper property. Refer to this link: