[ 
https://issues.apache.org/jira/browse/ARROW-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16137457#comment-16137457
 ] 

Philipp Moritz commented on ARROW-1389:
---------------------------------------

For the use case I described above, almost anything would work I think; I'm not 
familiar which standard is the best to use. There is also the really simple 
string based ones that GMP has: 
https://gmplib.org/manual/I_002fO-of-Integers.html (which are probably less 
efficient than VLQ)

A stop gap soluntion if we don't want to make decisions right now is to use a 
StructArray of strings that encodes the integer the as a string with whatever 
_PyLong_AsByteArray (see 
http://svn.python.org/projects/python/tags/r23a1/Include/longobject.h) returns.

> Support arbitrary precision integers
> ------------------------------------
>
>                 Key: ARROW-1389
>                 URL: https://issues.apache.org/jira/browse/ARROW-1389
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Philipp Moritz
>            Priority: Minor
>
> For Python serialization it would be great if we had Arrow support for 
> arbitrary precision integers, see the comment in
> https://github.com/apache/arrow/blob/de7c6715ba244e119913bfa31b8de46dbbd450bf/python/pyarrow/tests/test_serialization.py#L183
> Long integers are for example used in the uuid python module and having this 
> would increase serialization performance for uuids and also make the code 
> cleaner.
> I wonder if this is more generally useful too, any thoughts?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to