[ https://issues.apache.org/jira/browse/CRUNCH-534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14626838#comment-14626838 ]
Esteban Gutierrez commented on CRUNCH-534: ------------------------------------------ Crunch is only importing those dependencies and it should be agnostic about how we handle that internally by HBase. I have created HBASE-14076 to fix this on HBase. > Protobuf Size Limit Exception > ------------------------------ > > Key: CRUNCH-534 > URL: https://issues.apache.org/jira/browse/CRUNCH-534 > Project: Crunch > Issue Type: Bug > Components: IO > Reporter: Micah Whitacre > > With HBase's switch to ProtocolBuffers there is now an imposed restriction on > the size of data (64MB) which can be stored in protocol buffers and therefore > Cells/Puts/etc. > {quote} > Call to failed on local exception: > com.google.protobuf.InvalidProtocolBufferException: Protocol message was too > large. May be malicious. Use CodedInputStream.setSizeLimit() to increase > the size limit. > {quote} > While a single cell being that big seems extreme this limit also counts > towards Result objects so an entire row. > It is definitely an HBase problem[1] but Crunch also might have changes > needed since we use protobuf serialization in our HBaseTypes.[2] So might > need to make sure error isn't only on our side as they work through the issue > or might need to look at MOB support[3]. > [1] - https://issues.apache.org/jira/browse/HBASE-13825 > [2] - > https://github.com/apache/crunch/blob/d176778cf803374506cb7743069a05e28e07e2cf/crunch-hbase/src/main/java/org/apache/crunch/io/hbase/HBaseTypes.java#L33 > [3] - https://issues.apache.org/jira/browse/HBASE-11339 -- This message was sent by Atlassian JIRA (v6.3.4#6332)