For my scans I have caching disabled. There must be something else producing significant amounts of garbage.
----- Original Message ----- From: Dhruba Borthakur <[email protected]> To: [email protected] Cc: lars hofhansl <[email protected]> Sent: Thursday, December 1, 2011 3:19 PM Subject: Re: Size of KeyValue Most of our "garbage" is from block cache, not directly from the KVs. Is that what you see? thanks, dhruba On Thu, Dec 1, 2011 at 11:06 AM, Stack <[email protected]> wrote: > On Thu, Dec 1, 2011 at 10:57 AM, lars hofhansl <[email protected]> > wrote: > > To try this out I changed the server side code to keep track of the > latest KV rather than the row, and also remove the caching from KV. > > The difference for GC and runtime is within the noise. I have to > conclude that allocating KVs is just not a big problem compared to other > garbage being > > produced during scans. > > > > > > Ain't all our garbage KVs? (Though I suppose sometimes its blocks of > KVs whether mslab or blocks from hdfs). > St.Ack > -- Subscribe to my posts at http://www.facebook.com/dhruba
