[ 
https://issues.apache.org/jira/browse/SPARK-6119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Reynold Xin updated SPARK-6119:
-------------------------------
    Description: 
Real world data can be messy. An important feature of data frames is support 
for missing data. We should figure out what we want to do here.

Some ideas:

1. Support replacing all null value for a column with a fixed value.

2. Support replacing all null value for all columns with a fixed value.


  was:
Real world data can be messy. An important feature of data frames is support 
for missing data. We should figure out what we want to do here.




> better support for working with missing data
> --------------------------------------------
>
>                 Key: SPARK-6119
>                 URL: https://issues.apache.org/jira/browse/SPARK-6119
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>              Labels: DataFrame
>
> Real world data can be messy. An important feature of data frames is support 
> for missing data. We should figure out what we want to do here.
> Some ideas:
> 1. Support replacing all null value for a column with a fixed value.
> 2. Support replacing all null value for all columns with a fixed value.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to