Hi Ryan, You can call setBatch() on the scan to get partial rows returned in your map task. (The Result object won't be guaranteed to have all cells for a given row.)
On Mon, May 11, 2015 at 6:24 PM, Webb, Ryan L. <[email protected]> wrote: > Looks like the duplicate made it through. > > Sorry about that! > > -----Original Message----- > From: Webb, Ryan L. [mailto:[email protected]] > Sent: Monday, May 11, 2015 11:15 AM > To: [email protected] > Subject: Mapping Over Cells > > Hello, > > We have a table in HBase that has very large rows and it goes OOM when the > table mapper attempts to read the entire row into a result. > > We would like to be able to map over each Cell in the table as a solution > and it is what we are doing in the map anyway. > Is this possible? Like the default behavior for Accumulo? > > We looked at the settings on Scan and didn't really see anything and the > source code of Result looks like it wraps an array of cells so the data is > already loaded at that point. > We are using HBase .98.1 and Hadoop 2 APIs > > Thanks > Ryan Webb >
