[
https://issues.apache.org/jira/browse/SPARK-13672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13672:
---
Shepherd: Nick Pentreath
Assignee: zhengruifeng
> Add python examples of BisectingKMeans
[
https://issues.apache.org/jira/browse/SPARK-13629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13629:
---
Shepherd: Nick Pentreath
> Add binary toggle Param to CountVectorizer
>
[
https://issues.apache.org/jira/browse/SPARK-13629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13629:
---
Assignee: yuhao yang
> Add binary toggle Param to CountVectorizer
>
[
https://issues.apache.org/jira/browse/SPARK-13706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-13706.
Resolution: Fixed
Fix Version/s: 2.0.0
Issue resolved by pull request 11547
[
https://issues.apache.org/jira/browse/SPARK-13430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13430:
---
Assignee: Bryan Cutler
> Expose ml summary function in PySpark for classification and
[
https://issues.apache.org/jira/browse/SPARK-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188774#comment-15188774
]
Nick Pentreath commented on SPARK-12626:
[~dbtsai] ok thanks - would like to take a look when
[
https://issues.apache.org/jira/browse/SPARK-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186985#comment-15186985
]
Nick Pentreath commented on SPARK-12626:
[~mengxr] [~josephkb]
I see this mentioned as a major
[
https://issues.apache.org/jira/browse/SPARK-13706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13706:
---
Assignee: Jeremy
> Python Example for Train Validation Split Missing
>
[
https://issues.apache.org/jira/browse/SPARK-13706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13706:
---
Issue Type: Improvement (was: Bug)
> Python Example for Train Validation Split Missing
>
[
https://issues.apache.org/jira/browse/SPARK-13600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186648#comment-15186648
]
Nick Pentreath commented on SPARK-13600:
Thanks, that's fine
> Use approxQuantile from DataFrame
[
https://issues.apache.org/jira/browse/SPARK-13600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184717#comment-15184717
]
Nick Pentreath commented on SPARK-13600:
[~ocp] Could you update this ticket with something about
[
https://issues.apache.org/jira/browse/SPARK-10785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15184711#comment-15184711
]
Nick Pentreath commented on SPARK-10785:
Pending SPARK-13600, this would no longer be necessary,
[
https://issues.apache.org/jira/browse/SPARK-13629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15182081#comment-15182081
]
Nick Pentreath commented on SPARK-13629:
[~josephkb] what do you think about adding this param to
[
https://issues.apache.org/jira/browse/SPARK-13629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15179571#comment-15179571
]
Nick Pentreath commented on SPARK-13629:
Only the word count would be set to 1 (for non-zero
[
https://issues.apache.org/jira/browse/SPARK-12326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12326:
---
Assignee: Seth Hendrickson
> Move GBT implementation from spark.mllib to spark.ml
>
[
https://issues.apache.org/jira/browse/SPARK-13639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15179447#comment-15179447
]
Nick Pentreath commented on SPARK-13639:
For SPARK-13568, we can take one of two approaches:
1.
[
https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177368#comment-15177368
]
Nick Pentreath commented on SPARK-13568:
Ok - the Imputer will need to compute column stats
[
https://issues.apache.org/jira/browse/SPARK-13600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176117#comment-15176117
]
Nick Pentreath commented on SPARK-13600:
[~ocp] do you plan to submit a PR? Since you worked on
[
https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172328#comment-15172328
]
Nick Pentreath commented on SPARK-13568:
Sure, go ahead. However, taking a quick look at your
[
https://issues.apache.org/jira/browse/SPARK-12348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-12348.
Resolution: Not A Bug
> PySpark _inferSchema crashes with incorrect exception on an empty
[
https://issues.apache.org/jira/browse/SPARK-12348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171964#comment-15171964
]
Nick Pentreath edited comment on SPARK-12348 at 2/29/16 3:10 PM:
-
I'm not
[
https://issues.apache.org/jira/browse/SPARK-12348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171964#comment-15171964
]
Nick Pentreath commented on SPARK-12348:
I'm not sure this is a bug or even a big deal. The cause
[
https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12806:
---
Description:
Use cases exist where a specific index within a {{VectorUDT}} column of a
[
https://issues.apache.org/jira/browse/SPARK-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171957#comment-15171957
]
Nick Pentreath commented on SPARK-12684:
[~srowen] should this be resolved as *Won't Fix*?
>
[
https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13568:
---
Priority: Minor (was: Major)
> Create feature transformer to impute missing values
>
[
https://issues.apache.org/jira/browse/SPARK-13517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171905#comment-15171905
]
Nick Pentreath commented on SPARK-13517:
Is this not a duplicate of SPARK-13430?
> Expose
[
https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13568:
---
Description:
It is quite common to encounter missing values in data sets. It would be useful
[
https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13568:
---
Description:
It is quite common to encounter missing values in data sets. It would be useful
Nick Pentreath created SPARK-13568:
--
Summary: Create feature transformer to impute missing values
Key: SPARK-13568
URL: https://issues.apache.org/jira/browse/SPARK-13568
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-12633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-12633.
Resolution: Fixed
Fix Version/s: 2.0.0
Issue resolved by pull request 11404
[
https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168619#comment-15168619
]
Nick Pentreath commented on SPARK-13289:
Master branch should be building now. Can you try again?
[
https://issues.apache.org/jira/browse/SPARK-13505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168529#comment-15168529
]
Nick Pentreath commented on SPARK-13505:
[~holdenk] [~bryanc] [~sethah] any interest in adding
[
https://issues.apache.org/jira/browse/SPARK-13489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15167120#comment-15167120
]
Nick Pentreath commented on SPARK-13489:
Do we want to focus on work within core, or also
[
https://issues.apache.org/jira/browse/SPARK-13340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13340:
---
Assignee: Grzegorz Chilkiewicz
> [ML] PolynomialExpansion and Normalizer should validate
[
https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15158446#comment-15158446
]
Nick Pentreath commented on SPARK-13289:
Yes the master build is currently failing as detailed in
[
https://issues.apache.org/jira/browse/SPARK-12379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12379:
---
Assignee: Seth Hendrickson
> Copy GBT implementation to spark.ml
>
[
https://issues.apache.org/jira/browse/SPARK-13026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15156803#comment-15156803
]
Nick Pentreath commented on SPARK-13026:
[~holdenk] is this JIRA necessary, as it duplicates
[
https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15156785#comment-15156785
]
Nick Pentreath commented on SPARK-13289:
[~daiqi5477] could you try your experiments again
[
https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-13334.
Resolution: Fixed
Fix Version/s: 2.0.0
Issue resolved by pull request 11214
[
https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-13334:
---
Assignee: Yanbo Liang
> ML KMeansModel/BisectingKMeansModel should be set parent
>
[
https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-12632.
Resolution: Fixed
Fix Version/s: 2.0.0
Issue resolved by pull request 11186
[
https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12632:
---
Assignee: Bryan Cutler (was: somil deshmukh)
> Make Parameter Descriptions Consistent for
[
https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12247:
---
Assignee: Benjamin Fradet
> Documentation for spark.ml's ALS and collaborative filtering in
[
https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12247:
---
Affects Version/s: (was: 1.5.2)
2.0.0
> Documentation for
[
https://issues.apache.org/jira/browse/SPARK-12296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath resolved SPARK-12296.
Resolution: Fixed
Fix Version/s: 2.0.0
> Feature parity for pyspark.mllib
[
https://issues.apache.org/jira/browse/SPARK-12296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15067725#comment-15067725
]
Nick Pentreath commented on SPARK-12296:
Issue resolved by pull request 10298
[
https://issues.apache.org/jira/browse/SPARK-11922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-11922:
---
Assignee: holdenk
> Python API for ml.feature.QuantileDiscretizer
>
[
https://issues.apache.org/jira/browse/SPARK-12182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12182:
---
Assignee: Seth Hendrickson
> Distributed binning for trees in spark.ml
>
[
https://issues.apache.org/jira/browse/SPARK-12296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-12296:
---
Assignee: holdenk
> Feature parity for pyspark.mllib StandardScalerModel
>
[
https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14970870#comment-14970870
]
Nick Pentreath commented on SPARK-7008:
---
Is this now going in 1.6 (as per SPARK-10324)? If so is
901 - 950 of 950 matches
Mail list logo