[
https://issues.apache.org/jira/browse/ARROW-740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rok Mihevc updated ARROW-740:
-----------------------------
External issue URL: https://github.com/apache/arrow/issues/15377
> FileReader fails for large objects
> ----------------------------------
>
> Key: ARROW-740
> URL: https://issues.apache.org/jira/browse/ARROW-740
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++
> Reporter: Philipp Moritz
> Priority: Major
> Fix For: 0.3.0
>
>
> Trying to serialize a large arrow array (around 2**30 entries) I get a
> non-success status when trying to use the FileReader to read the array:
> "Bad status: Invalid: flatbuffer size 0 invalid. File offset: 660, metadata
> length: 0"
> How to reproduce:
> Check out the branch arrow-large-objects from
> https://github.com/pcmoritz/ray-1, and follow
> http://ray.readthedocs.io/en/latest/install-on-ubuntu.html with that branch.
> Then run
> {{python test/jenkins_tests/multi_node_tests/large_memory_test.py}}
> in the ray root directory.
> Most likely there is some int32_t somewhere that overflows, but I haven't
> been able to track it down. The only int32_ts that are used by the FileReader
> seem to be for the flatbuffer metadata size, which should be small.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)