DrChainsaw opened a new issue, #435:
URL: https://github.com/apache/arrow-julia/issues/435

   I'm generating a bunch of Arrow files from the apache java implementation 
and many of them are not readable by Arrow.jl (but they are readable by the 
java implementation).
   
   When following the java decoding process in the debugger, it seems that both 
implementations agree up to the following line in the java implementation:
   
https://github.com/apache/arrow/blob/febd0ff144cfb8b2baffb1cb0be57ca40dc7cc77/java/vector/src/main/java/org/apache/arrow/vector/compression/AbstractCompressionCodec.java#L72-L75
   
   It seems like length == -1 is some kind of sentinel value for no compression 
(maybe the compressor gave up or something?) which does not seem to be handled 
in the corresponding function in Arrow.jl: 
   
https://github.com/apache/arrow-julia/blob/e893c327f177f5a4d5efeab831df0fe93ab4ec5b/src/table.jl#L521-L524
   
   I have verified that Arrow.jl indeed does read out len = -1 (which in turn 
causes an error saying `invalid Array dimensions` when creating the 
decodedbytes vector).
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to