Interesting. Which exact 0.98 release are you using ?
Can you inspect logs to see when the duplicate HFiles were introduced (during one bulk load run or multiple bulk load runs) ? bq. Will a compaction eventually take care of this? I think so. Thanks On Wed, Dec 9, 2015 at 7:18 AM, Anthony Nguyen <[email protected]> wrote: > Hi all, > > Having duplicate HFiles within a region should result in no change to the > data, correct? The reason I ask is because I'm seeing duplicate HFiles > being created during a bulk load - they have the same row count, same size, > and same firstKey and lastKey. Is this normal behavior? Will a compaction > eventually take care of this? > > Thanks! >
