[
https://issues.apache.org/jira/browse/SPARK-3156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3156.
--
Resolution: Fixed
Fix Version/s: 1.2.0
DecisionTree: Order categorical features
Xiangrui Meng created SPARK-3443:
Summary: Update the default values of some decision tree parameters
Key: SPARK-3443
URL: https://issues.apache.org/jira/browse/SPARK-3443
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3443:
-
Priority: Minor (was: Major)
Target Version/s: 1.2.0
Update the default values of
[
https://issues.apache.org/jira/browse/SPARK-3249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14126191#comment-14126191
]
Xiangrui Meng commented on SPARK-3249:
--
I think we should point to the one with the
[
https://issues.apache.org/jira/browse/SPARK-3160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3160:
-
Assignee: Joseph K. Bradley
Simplify DecisionTree data structure for training
[
https://issues.apache.org/jira/browse/SPARK-3443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3443.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2322
Xiangrui Meng created SPARK-3459:
Summary: MulticlassMetrics is not serializable
Key: SPARK-3459
URL: https://issues.apache.org/jira/browse/SPARK-3459
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-3459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng closed SPARK-3459.
Resolution: Cannot Reproduce
MulticlassMetrics is not serializable
[
https://issues.apache.org/jira/browse/SPARK-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3494.
--
Resolution: Fixed
Assignee: Joseph K. Bradley
https://github.com/apache/spark/pull/2341
[
https://issues.apache.org/jira/browse/SPARK-3160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3160.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2341
[
https://issues.apache.org/jira/browse/SPARK-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3494:
-
Fix Version/s: 1.2.0
DecisionTree overflow error in calculating maxMemoryUsage
[
https://issues.apache.org/jira/browse/SPARK-2830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-2830.
--
Resolution: Fixed
Fix Version/s: 1.1.0
MLlib v1.1 documentation
[
https://issues.apache.org/jira/browse/SPARK-2838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-2838:
-
Target Version/s: 1.2.0 (was: 1.1.0)
performance tests for feature transformations
[
https://issues.apache.org/jira/browse/SPARK-3249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3249:
-
Target Version/s: 1.2.0 (was: 1.1.0)
Fix links in ScalaDoc that cause warning messages in
[
https://issues.apache.org/jira/browse/SPARK-3436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3436:
-
Assignee: Liquan Pei
[MLlib]Streaming SVM
-
Key:
[
https://issues.apache.org/jira/browse/SPARK-2838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-2838:
-
Assignee: (was: Xiangrui Meng)
performance tests for feature transformations
[
https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14131299#comment-14131299
]
Xiangrui Meng commented on SPARK-1405:
--
[~xusen] and [~gq] Thanks for working on LDA!
Xiangrui Meng created SPARK-3530:
Summary: Pipeline and Parameters
Key: SPARK-3530
URL: https://issues.apache.org/jira/browse/SPARK-3530
Project: Spark
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/SPARK-3396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3396.
--
Resolution: Fixed
Change LogistricRegressionWithSGD's default regType to L2
[
https://issues.apache.org/jira/browse/SPARK-3516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3516.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2349
[
https://issues.apache.org/jira/browse/SPARK-3516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3516:
-
Assignee: Joseph K. Bradley
DecisionTree Python support for params maxInstancesPerNode,
[
https://issues.apache.org/jira/browse/SPARK-3366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14134831#comment-14134831
]
Xiangrui Meng commented on SPARK-3366:
--
It is more about communication than
Xiangrui Meng created SPARK-3541:
Summary: Improve ALS internal storage
Key: SPARK-3541
URL: https://issues.apache.org/jira/browse/SPARK-3541
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14134860#comment-14134860
]
Xiangrui Meng commented on SPARK-3530:
--
[~srowen] Thanks for the comments!
The new
[
https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3181:
-
Priority: Major (was: Critical)
Add Robust Regression Algorithm with Huber Estimator
[
https://issues.apache.org/jira/browse/SPARK-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3188:
-
Priority: Minor (was: Critical)
Add Robust Regression Algorithm with Tukey bisquare weight
[
https://issues.apache.org/jira/browse/SPARK-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3188:
-
Affects Version/s: (was: 1.0.2)
Add Robust Regression Algorithm with Tukey bisquare weight
[
https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3181:
-
Affects Version/s: (was: 1.0.2)
Add Robust Regression Algorithm with Huber Estimator
[
https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3181:
-
Target Version/s: 1.2.0 (was: 1.1.1, 1.2.0)
Add Robust Regression Algorithm with Huber
[
https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1503:
-
Assignee: (was: Xiangrui Meng)
Implement Nesterov's accelerated first-order method
[
https://issues.apache.org/jira/browse/SPARK-3357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3357:
-
Assignee: (was: Xiangrui Meng)
Internal log messages should be set at DEBUG level instead of
[
https://issues.apache.org/jira/browse/SPARK-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3258:
-
Assignee: (was: Xiangrui Meng)
Python API for streaming MLlib algorithms
[
https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1486:
-
Assignee: Burak Yavuz (was: Xiangrui Meng)
Support multi-model training in MLlib
[
https://issues.apache.org/jira/browse/SPARK-2944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-2944.
--
Resolution: Cannot Reproduce
Closing this one now because I couldn't find an easy way to
[
https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3066:
-
Assignee: (was: Xiangrui Meng)
Support recommendAll in matrix factorization model
[
https://issues.apache.org/jira/browse/SPARK-3568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3568:
-
Priority: Minor (was: Major)
Add metrics for ranking algorithms
[
https://issues.apache.org/jira/browse/SPARK-3568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3568:
-
Assignee: Shuo Xiang
Add metrics for ranking algorithms
--
Xiangrui Meng created SPARK-3569:
Summary: Add metadata field to StructField
Key: SPARK-3569
URL: https://issues.apache.org/jira/browse/SPARK-3569
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-3569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3569:
-
Component/s: MLlib
ML
Add metadata field to StructField
Xiangrui Meng created SPARK-3572:
Summary: Support register UserType in SQL
Key: SPARK-3572
URL: https://issues.apache.org/jira/browse/SPARK-3572
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3573:
-
Shepherd: Michael Armbrust
Dataset
---
Key: SPARK-3573
[
https://issues.apache.org/jira/browse/SPARK-3569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3569:
-
Description:
Want to add a metadata field to StructField that can be used by other
applications
[
https://issues.apache.org/jira/browse/SPARK-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3270:
-
Issue Type: New Feature (was: Improvement)
Spark API for Application Extensions
[
https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139461#comment-14139461
]
Xiangrui Meng commented on SPARK-3403:
--
Sorry, it should be netlib-java, but the real
[
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139600#comment-14139600
]
Xiangrui Meng commented on SPARK-3530:
--
[~eustache] The default implementation of
[
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139600#comment-14139600
]
Xiangrui Meng edited comment on SPARK-3530 at 9/18/14 10:06 PM:
[
https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3573:
-
Description:
This JIRA is for discussion of ML dataset, essentially a SchemaRDD with extra
[
https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3600:
-
Summary: RDD[Double] doesn't use primitive arrays for caching (was:
RandomRDDs doesn't create
[
https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3600:
-
Issue Type: Improvement (was: Bug)
RDD[Double] doesn't use primitive arrays for caching
[
https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3600:
-
Component/s: (was: MLlib)
RDD[Double] doesn't use primitive arrays for caching
[
https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3600:
-
Target Version/s: (was: 1.1.1, 1.2.0)
RDD[Double] doesn't use primitive arrays for caching
[
https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141271#comment-14141271
]
Xiangrui Meng commented on SPARK-3573:
--
[~sandyr] SQL/Streaming/GraphX provide
[
https://issues.apache.org/jira/browse/SPARK-3541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng reassigned SPARK-3541:
Assignee: Xiangrui Meng
Improve ALS internal storage
[
https://issues.apache.org/jira/browse/SPARK-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-1484.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2347
[
https://issues.apache.org/jira/browse/SPARK-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1484:
-
Assignee: Aaron Staple
MLlib should warn if you are using an iterative algorithm on non-cached
[
https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148529#comment-14148529
]
Xiangrui Meng commented on SPARK-1405:
--
[~Guoqiang Li] and [~pedrorodriguez], since
[
https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1405:
-
Assignee: Guoqiang Li (was: Xusen Yin)
parallel Latent Dirichlet Allocation (LDA) atop of spark
[
https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1405:
-
Shepherd: Xiangrui Meng
parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib
[
https://issues.apache.org/jira/browse/SPARK-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148549#comment-14148549
]
Xiangrui Meng commented on SPARK-1241:
--
This is implemented MLlib:
[
https://issues.apache.org/jira/browse/SPARK-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148900#comment-14148900
]
Xiangrui Meng commented on SPARK-3588:
--
Please follow the instructions at
[
https://issues.apache.org/jira/browse/SPARK-3614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3614.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2494
[
https://issues.apache.org/jira/browse/SPARK-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14149721#comment-14149721
]
Xiangrui Meng commented on SPARK-2516:
--
The plan was to implement Bag of Little
[
https://issues.apache.org/jira/browse/SPARK-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-2516:
-
Assignee: Yu Ishikawa
Bootstrapping
-
Key: SPARK-2516
[
https://issues.apache.org/jira/browse/SPARK-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1547:
-
Shepherd: Joseph K. Bradley
Add gradient boosting algorithm to MLlib
[
https://issues.apache.org/jira/browse/SPARK-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1547:
-
Target Version/s: 1.2.0
Add gradient boosting algorithm to MLlib
[
https://issues.apache.org/jira/browse/SPARK-3700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3700:
-
Assignee: (was: Yin Huai)
Improve the performance of JSON parser
[
https://issues.apache.org/jira/browse/SPARK-3701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3701:
-
Priority: Minor (was: Major)
Some clean-up work after the refactoring of MLlib's SerDe for
Xiangrui Meng created SPARK-3701:
Summary: Some clean-up work after the refactoring of MLlib's SerDe
for PySpark
Key: SPARK-3701
URL: https://issues.apache.org/jira/browse/SPARK-3701
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3702:
-
Assignee: Joseph K. Bradley
Standardize MLlib classes for learners, models
[
https://issues.apache.org/jira/browse/SPARK-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1545:
-
Assignee: Joseph K. Bradley (was: Manish Amde)
Add Random Forest algorithm to MLlib
[
https://issues.apache.org/jira/browse/SPARK-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-1545.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2435
[
https://issues.apache.org/jira/browse/SPARK-3366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3366:
-
Assignee: Qiping Li
Compute best splits distributively in decision tree
[
https://issues.apache.org/jira/browse/SPARK-2885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-2885.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 1778
Xiangrui Meng created SPARK-3735:
Summary: Sending the factor directly or AtA based on the cost in
ALS
Key: SPARK-3735
URL: https://issues.apache.org/jira/browse/SPARK-3735
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152636#comment-14152636
]
Xiangrui Meng commented on SPARK-3434:
--
[~shivaram] Could you post the design of the
[
https://issues.apache.org/jira/browse/SPARK-3366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3366:
-
Target Version/s: 1.2.0
Compute best splits distributively in decision tree
[
https://issues.apache.org/jira/browse/SPARK-3436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3436:
-
Summary: Streaming SVM (was: [MLlib]Streaming SVM )
Streaming SVM
--
[
https://issues.apache.org/jira/browse/SPARK-3486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3486:
-
Summary: Add PySpark support for Word2Vec (was: [MLlib]Add PySpark support
for Word2Vec)
Add
[
https://issues.apache.org/jira/browse/SPARK-3158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3158:
-
Target Version/s: 1.2.0
Avoid 1 extra aggregation for DecisionTree training
[
https://issues.apache.org/jira/browse/SPARK-3158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3158:
-
Priority: Major (was: Minor)
Avoid 1 extra aggregation for DecisionTree training
[
https://issues.apache.org/jira/browse/SPARK-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3161:
-
Priority: Major (was: Minor)
Target Version/s: 1.2.0
Cache example-node map for
[
https://issues.apache.org/jira/browse/SPARK-3541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14153911#comment-14153911
]
Xiangrui Meng commented on SPARK-3541:
--
I put the implementation at
[
https://issues.apache.org/jira/browse/SPARK-3701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3701.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2548
[
https://issues.apache.org/jira/browse/SPARK-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3751:
-
Assignee: Joseph K. Bradley
DecisionTreeRunner functionality improvement
[
https://issues.apache.org/jira/browse/SPARK-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3751.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2604
[
https://issues.apache.org/jira/browse/SPARK-3572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3572:
-
Assignee: Joseph K. Bradley
Support register UserType in SQL
[
https://issues.apache.org/jira/browse/SPARK-3366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3366.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2595
[
https://issues.apache.org/jira/browse/SPARK-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1655:
-
Assignee: Aaron Staple
In naive Bayes, store conditional probabilities distributively.
Xiangrui Meng created SPARK-3820:
Summary: Specialize columnSimilarity() without any threshold
Key: SPARK-3820
URL: https://issues.apache.org/jira/browse/SPARK-3820
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-3803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161281#comment-14161281
]
Xiangrui Meng commented on SPARK-3803:
--
In `computeCovariance`, we generate a warning
[
https://issues.apache.org/jira/browse/SPARK-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng closed SPARK-3370.
Resolution: Duplicate
This is a known issue. We can fix it by checkpointing intermediate RDDs. For
[
https://issues.apache.org/jira/browse/SPARK-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3424:
-
Assignee: Derrick Burns
KMeans Plus Plus is too slow
[
https://issues.apache.org/jira/browse/SPARK-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-3261:
-
Assignee: Derrick Burns
KMeans clusterer can return duplicate cluster centers
[
https://issues.apache.org/jira/browse/SPARK-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161540#comment-14161540
]
Xiangrui Meng commented on SPARK-3828:
--
`text8` doesn't contain any line feed
[
https://issues.apache.org/jira/browse/SPARK-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162156#comment-14162156
]
Xiangrui Meng commented on SPARK-3434:
--
[~shivaram] and [~ConcreteVitamin] Any
[
https://issues.apache.org/jira/browse/SPARK-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng reopened SPARK-3828:
--
Spark returns inconsistent results when building with different Hadoop
version
[
https://issues.apache.org/jira/browse/SPARK-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162439#comment-14162439
]
Xiangrui Meng commented on SPARK-3828:
--
I re-opened this because it may be a serious
Xiangrui Meng created SPARK-3838:
Summary: Python code example for Word2Vec in user guide
Key: SPARK-3838
URL: https://issues.apache.org/jira/browse/SPARK-3838
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-3790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3790.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2622
[
https://issues.apache.org/jira/browse/SPARK-3486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-3486.
--
Resolution: Fixed
Fix Version/s: 1.2.0
Issue resolved by pull request 2356
501 - 600 of 5214 matches
Mail list logo