[ 
https://issues.apache.org/jira/browse/ARROW-13028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397395#comment-17397395
 ] 

Nate Clark commented on ARROW-13028:
------------------------------------

>> Is this by design or accident?

> I would say by accident, but I'm not sure what you mean with "the precision 
> of the string is too much for a float". Strictly speaking, some very short 
> decimal numbers are not exactly representable in binary floating-point, for 
> example "0.3". Should we reject them?

I was thinking something like `3.78946546156984798497501e10` can be better 
represented as double than a float. But as you point out there are values which 
cannot be fully represented in either, so there might not be a good way to 
detect when double should be used instead of float.

> [C++] CSV add convert option to attempt 32bit number inferences
> ---------------------------------------------------------------
>
>                 Key: ARROW-13028
>                 URL: https://issues.apache.org/jira/browse/ARROW-13028
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Nate Clark
>            Assignee: Nate Clark
>            Priority: Major
>
> When types are being inferred by CSV the numbers are always 64 bit. For large 
> data sets it could be better to use 32 bit types to save over all memory. To 
> do this it would be useful to add an option to ConvertOptions to try 32 bit 
> numbers before 64 bit. By default this option would be disabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to