[jira] [Created] (SPARK-6525) Add new feature transformers in ML package

2015-03-25 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-6525: Summary: Add new feature transformers in ML package Key: SPARK-6525 URL: https://issues.apache.org/jira/browse/SPARK-6525 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6527) sc.binaryFiles can not access files on s3

2015-03-25 Thread Zhao Zhang (JIRA)
Zhao Zhang created SPARK-6527: - Summary: sc.binaryFiles can not access files on s3 Key: SPARK-6527 URL: https://issues.apache.org/jira/browse/SPARK-6527 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6528) IDF transformer

2015-03-25 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-6528: Summary: IDF transformer Key: SPARK-6528 URL: https://issues.apache.org/jira/browse/SPARK-6528 Project: Spark Issue Type: Sub-task Reporter: Xusen Yin

[jira] [Comment Edited] (SPARK-6495) DataFrame#insertInto method should support insert rows with sub-columns

2015-03-25 Thread Chaozhong Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379243#comment-14379243 ] Chaozhong Yang edited comment on SPARK-6495 at 3/25/15 6:31 AM:

[jira] [Closed] (SPARK-6495) DataFrame#insertInto method should support insert rows with sub-columns

2015-03-25 Thread Chaozhong Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chaozhong Yang closed SPARK-6495. - Resolution: Not a Problem DataFrame#insertInto method should support insert rows with

[jira] [Commented] (SPARK-6526) Add Normalizer transformer

2015-03-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379371#comment-14379371 ] Apache Spark commented on SPARK-6526: - User 'yinxusen' has created a pull request for

[jira] [Created] (SPARK-6526) Add Normalizer transformer

2015-03-25 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-6526: Summary: Add Normalizer transformer Key: SPARK-6526 URL: https://issues.apache.org/jira/browse/SPARK-6526 Project: Spark Issue Type: Sub-task Reporter:

[jira] [Updated] (SPARK-6526) Add Normalizer transformer

2015-03-25 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-6526: - Description: https://github.com/apache/spark/pull/5181 Add Normalizer transformer

[jira] [Updated] (SPARK-6499) pyspark: printSchema command on a dataframe hangs

2015-03-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6499: --- Component/s: PySpark pyspark: printSchema command on a dataframe hangs

[jira] [Created] (SPARK-6530) ChiSqSelector transformer

2015-03-25 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-6530: Summary: ChiSqSelector transformer Key: SPARK-6530 URL: https://issues.apache.org/jira/browse/SPARK-6530 Project: Spark Issue Type: Sub-task Reporter:

[jira] [Created] (SPARK-6529) Word2Vec transformer

2015-03-25 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-6529: Summary: Word2Vec transformer Key: SPARK-6529 URL: https://issues.apache.org/jira/browse/SPARK-6529 Project: Spark Issue Type: Sub-task Reporter: Xusen

[jira] [Commented] (SPARK-6525) Add new feature transformers in ML package

2015-03-25 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379358#comment-14379358 ] Xusen Yin commented on SPARK-6525: -- [~mengxr] Let's add new feature transformers. I will

[jira] [Updated] (SPARK-6520) Kyro serialization broken in the shell

2015-03-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6520: --- Component/s: Spark Shell Kyro serialization broken in the shell

[jira] [Commented] (SPARK-6341) Upgrade breeze from 0.11.1 to 0.11.2 or later

2015-03-25 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379432#comment-14379432 ] Yu Ishikawa commented on SPARK-6341: I found another bug at breeze-0.11.1 under Spark

[jira] [Updated] (SPARK-6450) Metastore Parquet table conversion fails when a single metastore Parquet table appears multiple times in the query

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6450: -- Summary: Metastore Parquet table conversion fails when a single metastore Parquet table appears

[jira] [Commented] (SPARK-2926) Add MR-style (merge-sort) SortShuffleReader for sort-based shuffle

2015-03-25 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379532#comment-14379532 ] Saisai Shao commented on SPARK-2926: Hi [~DoingDone9], would you please give some

[jira] [Updated] (SPARK-6450) f

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6450: -- Summary: f (was: Native Parquet reader does not assign table name as qualifier) f -

[jira] [Updated] (SPARK-6450) MetastoreRelation.equals doesn't compare output attributes

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6450: -- Summary: MetastoreRelation.equals doesn't compare output attributes (was: f)

[jira] [Commented] (SPARK-6450) Metastore Parquet table conversion fails when a single metastore Parquet table appears multiple times in the query

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379499#comment-14379499 ] Cheng Lian commented on SPARK-6450: --- Here is a simpler Spark shell snippet for

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-25 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379513#comment-14379513 ] Yu Ishikawa commented on SPARK-2429: My implementation depends on a bug of breeze,

[jira] [Commented] (SPARK-6450) Metastore Parquet table conversion fails when a single metastore Parquet table appears multiple times in the query

2015-03-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379539#comment-14379539 ] Apache Spark commented on SPARK-6450: - User 'liancheng' has created a pull request for

[jira] [Created] (SPARK-6531) Information Theoretic Feature Selection Framework

2015-03-25 Thread JIRA
Sergio Ramírez-Gallego created SPARK-6531: - Summary: Information Theoretic Feature Selection Framework Key: SPARK-6531 URL: https://issues.apache.org/jira/browse/SPARK-6531 Project: Spark

[jira] [Commented] (SPARK-6509) MDLP discretizer

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379661#comment-14379661 ] Sean Owen commented on SPARK-6509: -- Same, isn't this just a realization of

[jira] [Updated] (SPARK-6531) An Information Theoretic Feature Selection Framework

2015-03-25 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergio Ramírez-Gallego updated SPARK-6531: -- Summary: An Information Theoretic Feature Selection Framework (was:

[jira] [Commented] (SPARK-6531) An Information Theoretic Feature Selection Framework

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379660#comment-14379660 ] Sean Owen commented on SPARK-6531: -- Why was this opened when

[jira] [Updated] (SPARK-6482) Remove synchronization of Hive Native commands

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6482: -- Target Version/s: 1.4.0 Remove synchronization of Hive Native commands

[jira] [Updated] (SPARK-6482) Remove synchronization of Hive Native commands

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6482: -- Shepherd: Cheng Lian Remove synchronization of Hive Native commands

[jira] [Issue Comment Deleted] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangyouhua updated SPARK-6425: --- Comment: was deleted (was: Q-Learning is a typical machine learning algorithm for solving tasks

[jira] [Updated] (SPARK-6482) Remove synchronization of Hive Native commands

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6482: -- Affects Version/s: 1.3.0 Remove synchronization of Hive Native commands

[jira] [Resolved] (SPARK-6507) Create separate Hive Driver instance for each SQL query in HiveThriftServer2

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6507. --- Resolution: Duplicate Create separate Hive Driver instance for each SQL query in HiveThriftServer2

[jira] [Commented] (SPARK-6411) PySpark DataFrames can't be created if any datetimes have timezones

2015-03-25 Thread Harry Brundage (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379947#comment-14379947 ] Harry Brundage commented on SPARK-6411: --- I've opened and issue on the upstream

[jira] [Commented] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379796#comment-14379796 ] zhangyouhua commented on SPARK-6425: The main problem of Q-learning algorithm is that

[jira] [Commented] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379789#comment-14379789 ] zhangyouhua commented on SPARK-6425: Q-Learning is a typical machine learning

[jira] [Commented] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379790#comment-14379790 ] zhangyouhua commented on SPARK-6425: Q-Learning is a typical machine learning

[jira] [Issue Comment Deleted] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangyouhua updated SPARK-6425: --- Comment: was deleted (was: Q-Learning is a typical machine learning algorithm for solving tasks

[jira] [Commented] (SPARK-6465) GenericRowWithSchema: KryoException: Class cannot be created (missing no-arg constructor):

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379812#comment-14379812 ] Cheng Lian commented on SPARK-6465: --- I believe for now this is only used for testing

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-03-25 Thread min cheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379777#comment-14379777 ] min cheng commented on SPARK-6192: -- Hello,all, I am a candidate for Professional master

[jira] [Commented] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread zhangyouhua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379791#comment-14379791 ] zhangyouhua commented on SPARK-6425: Q-Learning is a typical machine learning

[jira] [Commented] (SPARK-6480) histogram() bucket function is wrong in some simple edge cases

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379913#comment-14379913 ] Sean Owen commented on SPARK-6480: -- [~frosner] can you have a peek at the PR and see if

[jira] [Commented] (SPARK-3849) Automate remaining Spark Code Style Guide rules

2015-03-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380051#comment-14380051 ] Nicholas Chammas commented on SPARK-3849: - [~boyork] - You may be interested in

[jira] [Updated] (SPARK-6533) Allow using wildcard and other file pattern in Parquet DataSource

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6533: -- Priority: Critical (was: Major) Target Version/s: 1.4.0 Allow using wildcard and other

[jira] [Created] (SPARK-6533) Cannot use wildcard and other file pattern in sqlContext.parquetFile if spark.sql.parquet.useDataSourceApi is not set to false

2015-03-25 Thread Jianshi Huang (JIRA)
Jianshi Huang created SPARK-6533: Summary: Cannot use wildcard and other file pattern in sqlContext.parquetFile if spark.sql.parquet.useDataSourceApi is not set to false Key: SPARK-6533 URL:

[jira] [Updated] (SPARK-6524) Problem connecting JAVA API to Spark Yarn Cluster or yarn Client

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6524: - Component/s: YARN Java API Please set Component. If it's really a question it should not

[jira] [Updated] (SPARK-6533) Allow using wildcard and other file pattern in Parquet DataSource

2015-03-25 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jianshi Huang updated SPARK-6533: - Description: By default, spark.sql.parquet.useDataSourceApi is set to true. And loading parquet

[jira] [Assigned] (SPARK-6533) Allow using wildcard and other file pattern in Parquet DataSource

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-6533: - Assignee: Cheng Lian Allow using wildcard and other file pattern in Parquet DataSource

[jira] [Created] (SPARK-6532) spark-mllib_2.10 fails to compile LDAModel.java

2015-03-25 Thread Brian O'Keefe (JIRA)
Brian O'Keefe created SPARK-6532: Summary: spark-mllib_2.10 fails to compile LDAModel.java Key: SPARK-6532 URL: https://issues.apache.org/jira/browse/SPARK-6532 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6532) spark-mllib_2.10 fails to compile LDAModel.java

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380060#comment-14380060 ] Sean Owen commented on SPARK-6532: -- I can't reproduce this. It's not a compile error, but

[jira] [Commented] (SPARK-6532) spark-mllib_2.10 fails to compile LDAModel.java

2015-03-25 Thread Brian O'Keefe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380069#comment-14380069 ] Brian O'Keefe commented on SPARK-6532: -- Not that I am aware of, but my Scala

[jira] [Comment Edited] (SPARK-6532) spark-mllib_2.10 fails to compile LDAModel.java

2015-03-25 Thread Brian O'Keefe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380069#comment-14380069 ] Brian O'Keefe edited comment on SPARK-6532 at 3/25/15 3:42 PM:

[jira] [Updated] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6532: - Summary: LDAModel.scala fails scalastyle on Windows (was: LDAModel.java fails scalastyle on Windows)

[jira] [Updated] (SPARK-6532) spark-mllib_2.10 fails to compile LDAModel.java on Windows

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6532: - Component/s: Windows Priority: Minor (was: Blocker) Summary: spark-mllib_2.10 fails to

[jira] [Updated] (SPARK-6510) Add Graph#minus method to act as Set#difference

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6510: - Component/s: GraphX Let's make sure to set Component Add Graph#minus method to act as Set#difference

[jira] [Resolved] (SPARK-5747) Review all Bash scripts for word splitting bugs

2015-03-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-5747. - Resolution: Incomplete Resolving since this issue's scope is too large / loosely defined.

[jira] [Assigned] (SPARK-6450) Metastore Parquet table conversion fails when a single metastore Parquet table appears multiple times in the query

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-6450: - Assignee: Cheng Lian (was: Michael Armbrust) Metastore Parquet table conversion fails when a

[jira] [Updated] (SPARK-6533) Allow using wildcard and other file pattern in Parquet DataSource

2015-03-25 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jianshi Huang updated SPARK-6533: - Description: If spark.sql.parquet.useDataSourceApi is not set to false, which is the default.

[jira] [Commented] (SPARK-6533) Allow using wildcard and other file pattern in Parquet DataSource

2015-03-25 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380131#comment-14380131 ] Cheng Lian commented on SPARK-6533: --- Bumped to critical because this should be

[jira] [Resolved] (SPARK-3632) ConnectionManager can run out of receive threads with authentication on

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3632. -- Resolution: Fixed Target Version/s: 1.2.0 (was: 1.1.2, 1.2.0) Same, resolving this as there

[jira] [Updated] (SPARK-6509) MDLP discretizer

2015-03-25 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergio Ramírez updated SPARK-6509: -- Description: Minimum Description Lenght Discretizer This method implements Fayyad's

[jira] [Updated] (SPARK-6526) Add Normalizer transformer

2015-03-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6526: - Assignee: Xusen Yin Add Normalizer transformer --

[jira] [Commented] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380291#comment-14380291 ] Sean Owen commented on SPARK-6532: -- Aha, I knew this all rang a bell. It was fixed

[jira] [Updated] (SPARK-6063) MLlib doesn't pass mvn scalastyle check due to UTF chars in LDAModel.scala

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6063: - Fix Version/s: 1.3.1 Back ported for 1.3.1 MLlib doesn't pass mvn scalastyle check due to UTF chars in

[jira] [Resolved] (SPARK-3628) Don't apply accumulator updates multiple times for tasks in result stages

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3628. -- Resolution: Fixed Target Version/s: (was: 1.1.2) At this stage, calling it Fixed on the

[jira] [Updated] (SPARK-3628) Don't apply accumulator updates multiple times for tasks in result stages

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3628: - Labels: (was: backport-needed) Don't apply accumulator updates multiple times for tasks in result

[jira] [Updated] (SPARK-3632) ConnectionManager can run out of receive threads with authentication on

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3632: - Labels: (was: backport-needed) ConnectionManager can run out of receive threads with authentication on

[jira] [Commented] (SPARK-6510) Add Graph#minus method to act as Set#difference

2015-03-25 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380262#comment-14380262 ] Brennon York commented on SPARK-6510: - Whoops, thanks for catching that! Add

[jira] [Updated] (SPARK-4989) wrong application configuration cause cluster down in standalone mode

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4989: - Target Version/s: 1.0.3 (was: 1.1.2, 1.2.2, 1.3.0) Fix Version/s: 1.2.1 Just keeping score: this

[jira] [Commented] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380223#comment-14380223 ] Sean Owen commented on SPARK-6532: -- OK, I think it would be pretty safe to specify

[jira] [Commented] (SPARK-6496) Multinomial Logistic Regression failed when initialWeights is not null

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380249#comment-14380249 ] Sean Owen commented on SPARK-6496: -- Resolved by https://github.com/apache/spark/pull/5167

[jira] [Resolved] (SPARK-6496) Multinomial Logistic Regression failed when initialWeights is not null

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6496. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Assignee: Yanbo Liang

[jira] [Resolved] (SPARK-5566) Tokenizer for mllib package

2015-03-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5566. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4504

[jira] [Commented] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-25 Thread Brian O'Keefe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380313#comment-14380313 ] Brian O'Keefe commented on SPARK-6532: -- Thanks (PR defaulted to Problem Report in my

[jira] [Commented] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-25 Thread Brian O'Keefe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380173#comment-14380173 ] Brian O'Keefe commented on SPARK-6532: -- If I add the --inputEncoding UTF-8 parameter

[jira] [Created] (SPARK-6534) Task ID and Index columns appear to be reversed on AM web UI

2015-03-25 Thread Alex Shafer (JIRA)
Alex Shafer created SPARK-6534: -- Summary: Task ID and Index columns appear to be reversed on AM web UI Key: SPARK-6534 URL: https://issues.apache.org/jira/browse/SPARK-6534 Project: Spark

[jira] [Updated] (SPARK-6531) An Information Theoretic Feature Selection Framework

2015-03-25 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergio Ramírez updated SPARK-6531: -- Description: **Information Theoretic Feature Selection Framework** The present framework

[jira] [Commented] (SPARK-6532) LDAModel.scala fails scalastyle on Windows

2015-03-25 Thread Brian O'Keefe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380256#comment-14380256 ] Brian O'Keefe commented on SPARK-6532: -- I think I found the issue.

[jira] [Updated] (SPARK-5566) Tokenizer for mllib package

2015-03-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5566: - Assignee: Augustin Borsu Tokenizer for mllib package ---

[jira] [Updated] (SPARK-6534) Task ID and Index columns appear to be reversed on AM web UI

2015-03-25 Thread Alex Shafer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Shafer updated SPARK-6534: --- Attachment: task_id_index.png Attached screenshot. Task ID and Index columns appear to be reversed

[jira] [Resolved] (SPARK-6409) It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.

2015-03-25 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6409. - Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Issue resolved by

[jira] [Created] (SPARK-6536) Add IN to python Column

2015-03-25 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-6536: --- Summary: Add IN to python Column Key: SPARK-6536 URL: https://issues.apache.org/jira/browse/SPARK-6536 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6481) Set In Progress when a PR is opened for an issue

2015-03-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380432#comment-14380432 ] Patrick Wendell commented on SPARK-6481: Hey All, One issue here, (I think?)

[jira] [Updated] (SPARK-6521) executors in the same node read local shuffle file

2015-03-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6521: - Affects Version/s: 1.2.0 executors in the same node read local shuffle file

[jira] [Commented] (SPARK-3849) Automate remaining Spark Code Style Guide rules

2015-03-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380573#comment-14380573 ] Nicholas Chammas commented on SPARK-3849: - Yeah, I suggest reading through this

[jira] [Commented] (SPARK-3849) Automate remaining Spark Code Style Guide rules

2015-03-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380603#comment-14380603 ] Nicholas Chammas commented on SPARK-3849: - Sounds good. My quick summary (which

[jira] [Commented] (SPARK-6465) GenericRowWithSchema: KryoException: Class cannot be created (missing no-arg constructor):

2015-03-25 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380471#comment-14380471 ] Michael Armbrust commented on SPARK-6465: - We don't keep the schema around

[jira] [Created] (SPARK-6535) new RDD function that returns intermediate Future

2015-03-25 Thread Eric Johnston (JIRA)
Eric Johnston created SPARK-6535: Summary: new RDD function that returns intermediate Future Key: SPARK-6535 URL: https://issues.apache.org/jira/browse/SPARK-6535 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6481) Set In Progress when a PR is opened for an issue

2015-03-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380464#comment-14380464 ] Nicholas Chammas commented on SPARK-6481: - The Spark user can initiate state

[jira] [Commented] (SPARK-6537) UIWorkloadGenerator: The main thread should not stop SparkContext until all jobs finish

2015-03-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380483#comment-14380483 ] Apache Spark commented on SPARK-6537: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-3849) Automate remaining Spark Code Style Guide rules

2015-03-25 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380552#comment-14380552 ] Brennon York commented on SPARK-3849: - More than happy to take this on. I've been

[jira] [Commented] (SPARK-6481) Set In Progress when a PR is opened for an issue

2015-03-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380585#comment-14380585 ] Nicholas Chammas commented on SPARK-6481: - PR for this:

[jira] [Commented] (SPARK-6535) new RDD function that returns intermediate Future

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380631#comment-14380631 ] Sean Owen commented on SPARK-6535: -- I don't follow. If you map T = U and U = V then

[jira] [Commented] (SPARK-4660) JavaSerializer uses wrong classloader

2015-03-25 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380633#comment-14380633 ] sam commented on SPARK-4660: I'm getting this exception in executor logs, behaviour seems

[jira] [Updated] (SPARK-6536) Add IN to python Column

2015-03-25 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6536: Issue Type: Improvement (was: Bug) Add IN to python Column ---

[jira] [Assigned] (SPARK-6481) Set In Progress when a PR is opened for an issue

2015-03-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6481: --- Assignee: (was: Nicholas Chammas) Set In Progress when a PR is opened for an issue

[jira] [Commented] (SPARK-3849) Automate remaining Spark Code Style Guide rules

2015-03-25 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380590#comment-14380590 ] Brennon York commented on SPARK-3849: - Roger that, good advice. When (or if, haha) I

[jira] [Created] (SPARK-6537) UIWorkloadGenerator: The main thread should not stop SparkContext until all jobs finish

2015-03-25 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-6537: - Summary: UIWorkloadGenerator: The main thread should not stop SparkContext until all jobs finish Key: SPARK-6537 URL: https://issues.apache.org/jira/browse/SPARK-6537

[jira] [Commented] (SPARK-6425) Add parallel Q-learning algorithm to MLLib

2015-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380611#comment-14380611 ] Joseph K. Bradley commented on SPARK-6425: -- How long does each iteration take in

[jira] [Commented] (SPARK-6534) Task ID and Index columns appear to be reversed on AM web UI

2015-03-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380646#comment-14380646 ] Sean Owen commented on SPARK-6534: -- Headers are Index, ID, attempt:

[jira] [Updated] (SPARK-6537) UIWorkloadGenerator: The main thread should not stop SparkContext until all jobs finish

2015-03-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6537: - Priority: Minor (was: Major) UIWorkloadGenerator: The main thread should not stop SparkContext until

[jira] [Commented] (SPARK-6534) Task ID and Index columns appear to be reversed on AM web UI

2015-03-25 Thread Alex Shafer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14380741#comment-14380741 ] Alex Shafer commented on SPARK-6534: Its just that the column labelled ID is

[jira] [Reopened] (SPARK-6168) Expose some of the collection classes as DeveloperApi

2015-03-25 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan reopened SPARK-6168: Assignee: Mridul Muralidharan Expose some of the collection classes as

  1   2   3   >