[ 
https://issues.apache.org/jira/browse/SPARK-6119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Reynold Xin updated SPARK-6119:
-------------------------------
    Summary: DataFrame.dropna support  (was: better support for working with 
missing data)

> DataFrame.dropna support
> ------------------------
>
>                 Key: SPARK-6119
>                 URL: https://issues.apache.org/jira/browse/SPARK-6119
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>              Labels: DataFrame
>
> Real world data can be messy. An important feature of data frames is support 
> for missing data. We should figure out what we want to do here.
> Some ideas:
> 1. Support replacing all null value for a column (or all columns) with a 
> fixed value.
> 2. Support dropping rows with null values (dropna).
> 3. Support replacing a set of values with another set of values (i.e. map 
> join)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to