First, you should define what you mean when you say duplicate data. Depending on your definition… it may already be handled.
On Jan 17, 2014, at 7:39 AM, Ted Yu <[email protected]> wrote: > Can you tell us where the duplicate data resides - between column families or > between columns in a single column family ? > > Cheers > > On Jan 17, 2014, at 4:46 AM, oc tsdb <[email protected]> wrote: > >> Hi all, >> >> We want to know if there is any option to remove duplicate data in Hbase >> based on column family dynamically? >> >> Thanks, >> OC >
