At top of my head, all I think of is getting to the hbase shell and running some queries to remove the unwanted things from the "*crawlId_w*ebpage" table. I have never done this so cant vouch if it would work well.
On Tue, Apr 30, 2013 at 5:11 AM, Bai Shen <[email protected]> wrote: > Is there a way to remove the files that fetched files from HBase after > they've been parsed? I'm running things locally and don't have the storage > space to store all of the fetched files. > > Thanks. >

