[
https://issues.apache.org/jira/browse/CRUNCH-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Micah Whitacre updated CRUNCH-534:
----------------------------------
Description:
With HBase's switch to ProtocolBuffers there is now an imposed restriction on
the size of data (64MB) which can be stored in protocol buffers and therefore
Cells/Puts/etc.
{quote}
Call to failed on local exception:
com.google.protobuf.InvalidProtocolBufferException: Protocol message was too
large. May be malicious. Use CodedInputStream.setSizeLimit() to increase the
size limit.
{quote}
While a single cell being that big seems extreme this limit also counts towards
Result objects so an entire row.
It is definitely an HBase problem[1] but Crunch also might have changes needed
since we use protobuf serialization in our HBaseTypes.[2] So might need to
make sure error isn't only on our side as they work through the issue or might
need to look at MOB support[3].
[1] - https://issues.apache.org/jira/browse/HBASE-13825
[2] -
https://github.com/apache/crunch/blob/d176778cf803374506cb7743069a05e28e07e2cf/crunch-hbase/src/main/java/org/apache/crunch/io/hbase/HBaseTypes.java#L33
[3] - https://issues.apache.org/jira/browse/HBASE-11339
was:
With HBase's switch to ProtocolBuffers there is now an imposed restriction on
the size of data which can be stored in protocol buffers and therefore
Cells/Puts/etc.
{quote}
Call to failed on local exception:
com.google.protobuf.InvalidProtocolBufferException: Protocol message was too
large. May be malicious. Use CodedInputStream.setSizeLimit() to increase the
size limit.
{quote}
It is definitely an HBase problem[1] but Crunch also might have changes needed
since we use protobuf serialization in our HBaseTypes.[2] So might need to
make sure error isn't only on our side as they work through the issue or might
need to look at MOB support[3].
[1] - https://issues.apache.org/jira/browse/HBASE-13825
[2] -
https://github.com/apache/crunch/blob/d176778cf803374506cb7743069a05e28e07e2cf/crunch-hbase/src/main/java/org/apache/crunch/io/hbase/HBaseTypes.java#L33
[3] - https://issues.apache.org/jira/browse/HBASE-11339
> Protobuf Size Limit Exception
> ------------------------------
>
> Key: CRUNCH-534
> URL: https://issues.apache.org/jira/browse/CRUNCH-534
> Project: Crunch
> Issue Type: Bug
> Components: IO
> Reporter: Micah Whitacre
>
> With HBase's switch to ProtocolBuffers there is now an imposed restriction on
> the size of data (64MB) which can be stored in protocol buffers and therefore
> Cells/Puts/etc.
> {quote}
> Call to failed on local exception:
> com.google.protobuf.InvalidProtocolBufferException: Protocol message was too
> large. May be malicious. Use CodedInputStream.setSizeLimit() to increase
> the size limit.
> {quote}
> While a single cell being that big seems extreme this limit also counts
> towards Result objects so an entire row.
> It is definitely an HBase problem[1] but Crunch also might have changes
> needed since we use protobuf serialization in our HBaseTypes.[2] So might
> need to make sure error isn't only on our side as they work through the issue
> or might need to look at MOB support[3].
> [1] - https://issues.apache.org/jira/browse/HBASE-13825
> [2] -
> https://github.com/apache/crunch/blob/d176778cf803374506cb7743069a05e28e07e2cf/crunch-hbase/src/main/java/org/apache/crunch/io/hbase/HBaseTypes.java#L33
> [3] - https://issues.apache.org/jira/browse/HBASE-11339
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)