GitHub user zhengruifeng opened a pull request:

    https://github.com/apache/spark/pull/19084

    [SPARK-20711][ML]MultivariateOnlineSummarizer incorrect min/max for NaN 
value

    ## What changes were proposed in this pull request?
    
    current impl of min/max ignore `NaN`
    for a column only containing `NaN`, `Double.MaxValue` will be returned for 
`min` and `Double.MinValue` will be returned for `max`
    
    min/max for column containing `NaN` should return `NaN`
    
    ## How was this patch tested?
    existing tests


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zhengruifeng/spark min_max

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/19084.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #19084
    
----
commit f6c75b918dd4fd8848fc22989309e317fc101dc6
Author: Zheng RuiFeng <ruife...@foxmail.com>
Date:   2017-08-30T02:01:55Z

    create pr

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to