Put in spill() and getMemorySize() implementations in HCatArrayBag
------------------------------------------------------------------

                 Key: HCATALOG-82
                 URL: https://issues.apache.org/jira/browse/HCATALOG-82
             Project: HCatalog
          Issue Type: Improvement
            Reporter: Sushanth Sowmyan


HCatArrayBag is an implementation of the pig DataBag that can take a trivial 
array of objects and expose DataBag semantics on it, so as to allow it to be 
part of a Tuple, but yet not require opening and repackaging. It was intended 
to be a very thin, fast implementation, but the lack of proper 
spill()/getMemorySize() implementations means that Pig is unable to manage 
memory efficiently when dealing with these.

Given that we're trying to push HCat to be a default storage mechanism over all 
other earlier forms, we should ensure that these jagged edges are smoothed out 
- we need to put in a proper implementation here.

(Note also : There are a couple of other TODOs on this - they could use fixing 
too. :) )

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to