[jira] [Updated] (SPARK-6727) Model export/import for spark.ml: HashingTF

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6727: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: HashingTF

[jira] [Updated] (SPARK-6788) Model export/import for spark.ml: Tokenizer

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6788: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: Tokenizer

[jira] [Updated] (SPARK-6790) Model export/import for spark.ml: LinearRegression

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6790: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: LinearRegression

[jira] [Updated] (SPARK-6789) Model export/import for spark.ml: ALS

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6789: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: ALS

[jira] [Updated] (SPARK-6791) Model export/import for spark.ml: meta-algorithms

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6791: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: meta-algorithms

[jira] [Updated] (SPARK-6725) Model export/import for Pipeline API

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6725: - Description: This is an umbrella JIRA for adding model export/import to the spark.ml API.

[jira] [Updated] (SPARK-6725) Model export/import for Pipeline API

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6725: - Target Version/s: (was: 1.4.0) Model export/import for Pipeline API

[jira] [Comment Edited] (SPARK-7002) Persist on RDD fails the second time if the action is called on a child RDD without showing a FAILED message

2015-04-20 Thread Tom Hubregtsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503730#comment-14503730 ] Tom Hubregtsen edited comment on SPARK-7002 at 4/20/15 9:46 PM:

[jira] [Commented] (SPARK-7002) Persist on RDD fails the second time if the action is called on a child RDD without showing a FAILED message

2015-04-20 Thread Tom Hubregtsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503730#comment-14503730 ] Tom Hubregtsen commented on SPARK-7002: --- Your speculation was correct: After the

[jira] [Commented] (SPARK-6921) Spark SQL API saveAsParquetFile will output tachyon file with different block size

2015-04-20 Thread Sebastian YEPES FERNANDEZ (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503921#comment-14503921 ] Sebastian YEPES FERNANDEZ commented on SPARK-6921: -- I can also validate

[jira] [Commented] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503957#comment-14503957 ] Apache Spark commented on SPARK-7022: - User 'oefirouz' has created a pull request for

[jira] [Assigned] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7022: --- Assignee: Apache Spark PySpark is missing ParamGridBuilder

[jira] [Assigned] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7022: --- Assignee: (was: Apache Spark) PySpark is missing ParamGridBuilder

[jira] [Commented] (SPARK-6635) DataFrame.withColumn can create columns with identical names

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504084#comment-14504084 ] Joseph K. Bradley commented on SPARK-6635: -- Just to clarify, does that mean

[jira] [Commented] (SPARK-7002) Persist on RDD fails the second time if the action is called on a child RDD without showing a FAILED message

2015-04-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503761#comment-14503761 ] Sean Owen commented on SPARK-7002: -- The shuffle data is a sort of hidden, second type of

[jira] [Commented] (SPARK-7002) Persist on RDD fails the second time if the action is called on a child RDD without showing a FAILED message

2015-04-20 Thread Tom Hubregtsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503783#comment-14503783 ] Tom Hubregtsen commented on SPARK-7002: --- Great, thanks for your help :) I will be

[jira] [Created] (SPARK-7019) Build docs on doc changes

2015-04-20 Thread Brennon York (JIRA)
Brennon York created SPARK-7019: --- Summary: Build docs on doc changes Key: SPARK-7019 URL: https://issues.apache.org/jira/browse/SPARK-7019 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-6635) DataFrame.withColumn can create columns with identical names

2015-04-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504092#comment-14504092 ] Michael Armbrust commented on SPARK-6635: - Sorry, updated. I meant

[jira] [Commented] (SPARK-7009) Build assembly JAR via ant to avoid zip64 problems

2015-04-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503754#comment-14503754 ] Sean Owen commented on SPARK-7009: -- Or warnings, yes. These add to the case that updating

[jira] [Updated] (SPARK-6726) Model export/import for spark.ml: LogisticRegression

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6726: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: LogisticRegression

[jira] [Updated] (SPARK-6786) Model export/import for spark.ml: Normalizer

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6786: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: Normalizer

[jira] [Updated] (SPARK-6787) Model export/import for spark.ml: StandardScaler

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6787: - Target Version/s: (was: 1.4.0) Model export/import for spark.ml: StandardScaler

[jira] [Commented] (SPARK-6635) DataFrame.withColumn can create columns with identical names

2015-04-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504030#comment-14504030 ] Michael Armbrust commented on SPARK-6635: - +1 to {{withName}} overwriting existing

[jira] [Commented] (SPARK-7008) An Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504114#comment-14504114 ] zhengruifeng commented on SPARK-7008: - thanks for this information! An Implement of

[jira] [Commented] (SPARK-7009) Build assembly JAR via ant to avoid zip64 problems

2015-04-20 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503752#comment-14503752 ] Steve Loughran commented on SPARK-7009: --- most of the others seemed fix by

[jira] [Updated] (SPARK-7016) Refactor dev/run-tests(-jenkins) from Bash to Python

2015-04-20 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brennon York updated SPARK-7016: Summary: Refactor dev/run-tests(-jenkins) from Bash to Python (was: Refactor

[jira] [Updated] (SPARK-7016) Refactor {{dev/run-tests(-jenkins)}} from Bash to Python

2015-04-20 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brennon York updated SPARK-7016: Summary: Refactor {{dev/run-tests(-jenkins)}} from Bash to Python (was: Refactor

[jira] [Updated] (SPARK-7020) Restrict module testing based on commit contents

2015-04-20 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brennon York updated SPARK-7020: Description: Currently all builds trigger all tests. This does not need to happen and, to minimize

[jira] [Commented] (SPARK-6917) Broken data returned to PySpark dataframe if any large numbers used in Scala land

2015-04-20 Thread Harry Brundage (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503915#comment-14503915 ] Harry Brundage commented on SPARK-6917: --- [~davies] or [~joshrosen] any idea why this

[jira] [Updated] (SPARK-5995) Make ML Prediction Developer APIs public

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5995: - Description: Previously, some Developer APIs were added to spark.ml for classification

[jira] [Issue Comment Deleted] (SPARK-3530) Pipeline and Parameters

2015-04-20 Thread Fan Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fan Jiang updated SPARK-3530: - Comment: was deleted (was: Hi Xiangrui, Which part of this pipeline project would you like us to work

[jira] [Created] (SPARK-7017) Refactor dev/run-tests into Python

2015-04-20 Thread Brennon York (JIRA)
Brennon York created SPARK-7017: --- Summary: Refactor dev/run-tests into Python Key: SPARK-7017 URL: https://issues.apache.org/jira/browse/SPARK-7017 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7018) Refactor dev/run-tests-jenkins into Python

2015-04-20 Thread Brennon York (JIRA)
Brennon York created SPARK-7018: --- Summary: Refactor dev/run-tests-jenkins into Python Key: SPARK-7018 URL: https://issues.apache.org/jira/browse/SPARK-7018 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Omede Firouz (JIRA)
Omede Firouz created SPARK-7022: --- Summary: PySpark is missing ParamGridBuilder Key: SPARK-7022 URL: https://issues.apache.org/jira/browse/SPARK-7022 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Omede Firouz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Omede Firouz updated SPARK-7022: Description: PySpark is missing the entirety of ML.Tuning (see:

[jira] [Updated] (SPARK-6954) Dynamic allocation: numExecutorsPending in ExecutorAllocationManager should never become negative

2015-04-20 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6954: - Priority: Major (was: Minor) Dynamic allocation: numExecutorsPending in ExecutorAllocationManager

[jira] [Created] (SPARK-7021) JUnit output for Python tests

2015-04-20 Thread Brennon York (JIRA)
Brennon York created SPARK-7021: --- Summary: JUnit output for Python tests Key: SPARK-7021 URL: https://issues.apache.org/jira/browse/SPARK-7021 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-7016) Refactor dev/run-tests(-jenkins) from Bash to Python

2015-04-20 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brennon York updated SPARK-7016: Description: Currently the {{dev/run-tests}} and {{dev/run-tests-jenkins}} scripts are written in

[jira] [Created] (SPARK-7016) Refactor {dev/run-tests(-jenkins)} from Bash to Python

2015-04-20 Thread Brennon York (JIRA)
Brennon York created SPARK-7016: --- Summary: Refactor {dev/run-tests(-jenkins)} from Bash to Python Key: SPARK-7016 URL: https://issues.apache.org/jira/browse/SPARK-7016 Project: Spark Issue

[jira] [Comment Edited] (SPARK-6635) DataFrame.withColumn can create columns with identical names

2015-04-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504030#comment-14504030 ] Michael Armbrust edited comment on SPARK-6635 at 4/21/15 1:07 AM:

[jira] [Commented] (SPARK-5995) Make ML Prediction Developer APIs public

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504090#comment-14504090 ] Joseph K. Bradley commented on SPARK-5995: -- I just updated the design doc linked

[jira] [Created] (SPARK-7025) Create a Java-friendly input source API

2015-04-20 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7025: -- Summary: Create a Java-friendly input source API Key: SPARK-7025 URL: https://issues.apache.org/jira/browse/SPARK-7025 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6529) Word2Vec transformer

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504180#comment-14504180 ] Joseph K. Bradley commented on SPARK-6529: -- [~yinxusen] brings up a good point

[jira] [Updated] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7022: - Assignee: Omede Firouz PySpark is missing ParamGridBuilder ---

[jira] [Updated] (SPARK-7022) PySpark is missing ParamGridBuilder

2015-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7022: - Target Version/s: 1.4.0 PySpark is missing ParamGridBuilder ---

[jira] [Resolved] (SPARK-4521) Parquet fails to read columns with spaces in the name

2015-04-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-4521. --- Resolution: Done This ticket is covered by SPARK-6607. Parquet fails to read columns with spaces in

[jira] [Resolved] (SPARK-6635) DataFrame.withColumn can create columns with identical names

2015-04-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6635. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5541

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-20 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 2.2 GB data to disk: {code} 15/04/07 20:27:37 INFO

[jira] [Reopened] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-20 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen reopened SPARK-6738: -- There is a in SizeEstimator EstimateSize is difference with spill file size

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-20 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 2.2 GB data to disk: {code} 15/04/07 20:27:37 INFO

[jira] [Comment Edited] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-20 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504202#comment-14504202 ] Hong Shen edited comment on SPARK-6738 at 4/21/15 2:54 AM: ---

[jira] [Assigned] (SPARK-4131) Support Writing data into the filesystem from queries

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4131: --- Assignee: Fei Wang (was: Apache Spark) Support Writing data into the filesystem from

[jira] [Assigned] (SPARK-4131) Support Writing data into the filesystem from queries

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4131: --- Assignee: Apache Spark (was: Fei Wang) Support Writing data into the filesystem from

[jira] [Updated] (SPARK-7025) Create a Java-friendly input source API

2015-04-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7025: --- Description: The goal of this ticket is to create a simple input source API that we can maintain and

[jira] [Commented] (SPARK-7015) Multiclass to Binary Reduction

2015-04-20 Thread Ram Sriharsha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504197#comment-14504197 ] Ram Sriharsha commented on SPARK-7015: -- sounds good. Let me know what reference you

[jira] [Updated] (SPARK-6954) Dynamic allocation: numExecutorsPending in ExecutorAllocationManager should never become negative

2015-04-20 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated SPARK-6954: - Attachment: without_fix.png with_fix.png I am uploading two diagrams that shows

[jira] [Updated] (SPARK-7025) Create a Java-friendly input source API

2015-04-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7025: --- Description: The goal of this ticket is to create a simple input source API that we can maintain and

[jira] [Comment Edited] (SPARK-7015) Multiclass to Binary Reduction

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504189#comment-14504189 ] Joseph K. Bradley edited comment on SPARK-7015 at 4/21/15 2:43 AM:

[jira] [Commented] (SPARK-7015) Multiclass to Binary Reduction

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504189#comment-14504189 ] Joseph K. Bradley commented on SPARK-7015: -- +1 I'd strongly vote for supporting

[jira] [Created] (SPARK-7023) [Spark SQL] Can't populate table size inforamtion into Hive metastore when create table or insert into table

2015-04-20 Thread Yi Zhou (JIRA)
Yi Zhou created SPARK-7023: -- Summary: [Spark SQL] Can't populate table size inforamtion into Hive metastore when create table or insert into table Key: SPARK-7023 URL: https://issues.apache.org/jira/browse/SPARK-7023

[jira] [Commented] (SPARK-5100) Spark Thrift server monitor page

2015-04-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504132#comment-14504132 ] Cheng Lian commented on SPARK-5100: --- Had offline discussion with [~tianyi], he's

[jira] [Resolved] (SPARK-6368) Build a specialized serializer for Exchange operator.

2015-04-20 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6368. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5497

[jira] [Updated] (SPARK-7015) Multiclass to Binary Reduction

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7015: - Component/s: (was: MLlib) ML Multiclass to Binary Reduction

[jira] [Commented] (SPARK-4521) Parquet fails to read columns with spaces in the name

2015-04-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504127#comment-14504127 ] Cheng Lian commented on SPARK-4521: --- Yes, I'm resolving this one. Parquet fails to

[jira] [Updated] (SPARK-4766) ML Estimator Params should be distinct from Transformer Params

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4766: - Description: Currently, in spark.ml, both Transformers and Estimators extend the same

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504181#comment-14504181 ] Joseph K. Bradley commented on SPARK-4766: -- *Update*: A new issue was brought up

[jira] [Updated] (SPARK-4766) ML Estimator Params should be distinct from Transformer Params

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4766: - Summary: ML Estimator Params should be distinct from Transformer Params (was: ML

[jira] [Commented] (SPARK-6900) spark ec2 script enters infinite loop when run-instance fails

2015-04-20 Thread Guodong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504236#comment-14504236 ] Guodong Wang commented on SPARK-6900: - Hi Nick, sorry for my late reply. I mark this

[jira] [Created] (SPARK-7024) Improve performance of function containsStar

2015-04-20 Thread Yadong Qi (JIRA)
Yadong Qi created SPARK-7024: Summary: Improve performance of function containsStar Key: SPARK-7024 URL: https://issues.apache.org/jira/browse/SPARK-7024 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-7024) Improve performance of function containsStar

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7024: --- Assignee: (was: Apache Spark) Improve performance of function containsStar

[jira] [Commented] (SPARK-7024) Improve performance of function containsStar

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504255#comment-14504255 ] Apache Spark commented on SPARK-7024: - User 'watermen' has created a pull request for

[jira] [Commented] (SPARK-6900) spark ec2 script enters infinite loop when run-instance fails

2015-04-20 Thread Guodong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504253#comment-14504253 ] Guodong Wang commented on SPARK-6900: - In my opinion, it does not cost us much to fix

[jira] [Assigned] (SPARK-7024) Improve performance of function containsStar

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7024: --- Assignee: Apache Spark Improve performance of function containsStar

[jira] [Comment Edited] (SPARK-7015) Multiclass to Binary Reduction

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504335#comment-14504335 ] Joseph K. Bradley edited comment on SPARK-7015 at 4/21/15 5:21 AM:

[jira] [Commented] (SPARK-7015) Multiclass to Binary Reduction

2015-04-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504335#comment-14504335 ] Joseph K. Bradley commented on SPARK-7015: -- Your reference looks newer than ones

[jira] [Commented] (SPARK-7025) Create a Java-friendly input source API

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504368#comment-14504368 ] Apache Spark commented on SPARK-7025: - User 'rxin' has created a pull request for this

[jira] [Assigned] (SPARK-7025) Create a Java-friendly input source API

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7025: --- Assignee: Reynold Xin (was: Apache Spark) Create a Java-friendly input source API

[jira] [Commented] (SPARK-7008) An Implement of Factorization Machine (LibFM)

2015-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504369#comment-14504369 ] Xiangrui Meng commented on SPARK-7008: -- [~podongfeng] You implementation assumes that

[jira] [Assigned] (SPARK-7025) Create a Java-friendly input source API

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7025: --- Assignee: Apache Spark (was: Reynold Xin) Create a Java-friendly input source API

[jira] [Commented] (SPARK-7009) Build assembly JAR via ant to avoid zip64 problems

2015-04-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502695#comment-14502695 ] Sean Owen commented on SPARK-7009: -- Let's see if I remember this correctly: Java 7

[jira] [Assigned] (SPARK-7011) Build fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

2015-04-20 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma reassigned SPARK-7011: -- Assignee: Prashant Sharma Build fails with scala 2.11 option, because a

[jira] [Updated] (SPARK-3276) Provide a API to specify MIN_REMEMBER_DURATION for files to consider as input in streaming

2015-04-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3276: - Assignee: Emre Sevinç Provide a API to specify MIN_REMEMBER_DURATION for files to consider as input in

[jira] [Commented] (SPARK-7009) Build assembly JAR via ant to avoid zip64 problems

2015-04-20 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502657#comment-14502657 ] Steve Loughran commented on SPARK-7009: --- It's only 30 lines of diff including the

[jira] [Assigned] (SPARK-7011) Build fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7011: --- Assignee: Apache Spark (was: Prashant Sharma) Build fails with scala 2.11 option, because

[jira] [Commented] (SPARK-3276) Provide a API to specify MIN_REMEMBER_DURATION for files to consider as input in streaming

2015-04-20 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502702#comment-14502702 ] Emre Sevinç commented on SPARK-3276: Can someone with enough access rights assign this

[jira] [Assigned] (SPARK-7011) Build fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7011: --- Assignee: Prashant Sharma (was: Apache Spark) Build fails with scala 2.11 option, because

[jira] [Commented] (SPARK-7011) Build fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502703#comment-14502703 ] Apache Spark commented on SPARK-7011: - User 'ScrapCodes' has created a pull request

[jira] [Commented] (SPARK-7009) Build assembly JAR via ant to avoid zip64 problems

2015-04-20 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502675#comment-14502675 ] Steve Loughran commented on SPARK-7009: --- Looking at the [openJDK

[jira] [Created] (SPARK-7007) Add metrics source for ExecutorAllocationManager to expose internal status

2015-04-20 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-7007: -- Summary: Add metrics source for ExecutorAllocationManager to expose internal status Key: SPARK-7007 URL: https://issues.apache.org/jira/browse/SPARK-7007 Project: Spark

[jira] [Resolved] (SPARK-7010) How can i custom the external initialize when start the spark cluster

2015-04-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7010. -- Resolution: Invalid (Ask questions at u...@spark.apache.org) How can i custom the external initialize

[jira] [Commented] (SPARK-7006) Inconsistent behavior for ctrl-c in Spark shells

2015-04-20 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502612#comment-14502612 ] Cheolsoo Park commented on SPARK-7006: -- Thanks for asking about Ctrl-D. During a job

[jira] [Comment Edited] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502616#comment-14502616 ] Guoqiang Li edited comment on SPARK-7008 at 4/20/15 10:34 AM: --

[jira] [Created] (SPARK-7011) Build fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

2015-04-20 Thread Prashant Sharma (JIRA)
Prashant Sharma created SPARK-7011: -- Summary: Build fails with scala 2.11 option, because a protected[sql] type is accessed in ml package. Key: SPARK-7011 URL: https://issues.apache.org/jira/browse/SPARK-7011

[jira] [Issue Comment Deleted] (SPARK-7005) resetProb error in pagerank

2015-04-20 Thread lisendong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lisendong updated SPARK-7005: - Comment: was deleted (was: oh...you are right... I'm so sorry, the result is exactly being scaled by N...

[jira] [Commented] (SPARK-7007) Add metrics source for ExecutorAllocationManager to expose internal status

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502542#comment-14502542 ] Apache Spark commented on SPARK-7007: - User 'jerryshao' has created a pull request for

[jira] [Assigned] (SPARK-7007) Add metrics source for ExecutorAllocationManager to expose internal status

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7007: --- Assignee: (was: Apache Spark) Add metrics source for ExecutorAllocationManager to

[jira] [Commented] (SPARK-1911) Warn users if their assembly jars are not built with Java 6

2015-04-20 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502589#comment-14502589 ] Steve Loughran commented on SPARK-1911: --- This doesn't fix the problem, merely

[jira] [Updated] (SPARK-7008) An Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Description: An implement of Factorization Machines based on Scala and Spark MLlib. Factorization

[jira] [Assigned] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7008: --- Assignee: Apache Spark Implement of Factorization Machine (LibFM)

[jira] [Commented] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14502644#comment-14502644 ] Apache Spark commented on SPARK-7008: - User 'zhengruifeng' has created a pull request

  1   2   >