GitHub user falaki opened a pull request:
https://github.com/apache/spark/pull/11947
[SPARK-14143] Options for parsing NaNs, Infinity and nulls for numeric types
## What changes were proposed in this pull request?
1. Adds following options for parsing type-specfic nulls to CSV data source:
* byteNullValue
* integerNullValue
* shortNullValue
* longNullValue
* floatNullValue
* doubleNullValue
* decimalNullValue
2. Adds following options for parsing NaNs:
* floatNaNValue
* doubleNaNValue
3. And following options for parsing infinity:
* floatNegativeInf
* floatPositiveInf
* doubleNegativeInf
* doublePositiveInf
## How was this patch tested?
`TypeCast.castTo` is unit tested and an end-to-end test is added to
`CSVSuite`
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/falaki/spark SPARK-14143
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/11947.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #11947
----
commit 93ac6bb3eb63efb775b48af090a37a6cbe4f30c4
Author: Hossein <[email protected]>
Date: 2016-03-24T23:31:38Z
Added support for null, NaN and Inf options for numeric types
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]