The only delete I see is in InputSampler where it does a cleanup of
any existing partition sample files first before writing a new one.
Even then its doing an explicit file delete rather than a dir delete.

St.Ack

On Thu, Apr 28, 2011 at 4:28 PM, Stack <[email protected]> wrote:
> I took a look through the code and don't see any explicit removes and
> looking through history of changes to the file, I don't see any change
> of substance.
>
> Can you figure what is doing the delete? At what stage?  Is it as
> completebulkload runs?
>
> St.Ack
>
> On Thu, Apr 28, 2011 at 10:59 AM, Adam Phelps <[email protected]> wrote:
>> We were using a backup scheme for our system where we have map-reduce jobs
>> generating HFiles, which we then loaded using LoadIncrementalHFiles before
>> making a remote copy of them using distcp.
>>
>> However we just upgraded hbase (we're using cloudera's package, so we went
>> from CDH3B4 to CDH3U0, both of which are versions of 0.90.1), and discovered
>> that the HFiles now get deleted by the load operation.  Is this a recent
>> change?  Is there a configuration variable to revert this behavior?
>>
>> We can work around it by doing the copy before the load, but that is less
>> than optimal in our scenario as we'd prefer to have quicker access to the
>> data in HBase.
>>
>> - Adam
>>
>

Reply via email to