[jira] [Created] (SPARK-9114) The returned value is not converted into internal type in Python UDF

2015-07-16 Thread Davies Liu (JIRA)
Davies Liu created SPARK-9114: - Summary: The returned value is not converted into internal type in Python UDF Key: SPARK-9114 URL: https://issues.apache.org/jira/browse/SPARK-9114 Project: Spark

[jira] [Updated] (SPARK-6941) Provide a better error message to explain that tables created from RDDs are immutable

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6941: Shepherd: Yin Huai Provide a better error message to explain that tables created from RDDs are immutable

[jira] [Updated] (SPARK-9082) Filter using non-deterministic expressions should not be pushed down

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-9082: Shepherd: Yin Huai Filter using non-deterministic expressions should not be pushed down

[jira] [Commented] (SPARK-9112) Implement LogisticRegressionSummary similar to LinearRegressionSummary

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630053#comment-14630053 ] Manoj Kumar commented on SPARK-9112: Yes, that is the idea. Also we need not port it

[jira] [Updated] (SPARK-9102) Improve project collapse with nondeterministic expressions

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-9102: Shepherd: Yin Huai Improve project collapse with nondeterministic expressions

[jira] [Updated] (SPARK-9102) Improve project collapse with nondeterministic expressions

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-9102: Assignee: Wenchen Fan Improve project collapse with nondeterministic expressions

[jira] [Resolved] (SPARK-9015) Maven cleanup / Clean Project Import in scala-ide

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-9015. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7375

[jira] [Updated] (SPARK-9015) Maven cleanup / Clean Project Import in scala-ide

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9015: - Assignee: Jan Prach Maven cleanup / Clean Project Import in scala-ide

[jira] [Closed] (SPARK-6217) insertInto doesn't work in PySpark

2015-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-6217. -- Assignee: Wenchen Fan insertInto doesn't work in PySpark --

[jira] [Resolved] (SPARK-6941) Provide a better error message to explain that tables created from RDDs are immutable

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-6941. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7342

[jira] [Comment Edited] (SPARK-8682) Range Join for Spark SQL

2015-07-16 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630449#comment-14630449 ] Herman van Hovell edited comment on SPARK-8682 at 7/16/15 10:31 PM:

[jira] [Commented] (SPARK-9119) In some cases, we may save wrong decimal values to parquet

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630556#comment-14630556 ] Yin Huai commented on SPARK-9119: - Actually, the impact of this issue is that whenever we

[jira] [Updated] (SPARK-4134) Dynamic allocation: tone down scary executor lost messages when killing on purpose

2015-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4134: - Summary: Dynamic allocation: tone down scary executor lost messages when killing on purpose (was: Tone

[jira] [Created] (SPARK-9116) Class in __main__ cannot be serialized by PySpark

2015-07-16 Thread Davies Liu (JIRA)
Davies Liu created SPARK-9116: - Summary: Class in __main__ cannot be serialized by PySpark Key: SPARK-9116 URL: https://issues.apache.org/jira/browse/SPARK-9116 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-9113) remove unnecessary analysis check code for self join

2015-07-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9113: Shepherd: Michael Armbrust Assignee: Wenchen Fan remove unnecessary analysis check

[jira] [Commented] (SPARK-9073) spark.ml Models copy() should call setParent when there is a parent

2015-07-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630307#comment-14630307 ] Joseph K. Bradley commented on SPARK-9073: -- Yes, thanks! I'll look at the PR as

[jira] [Updated] (SPARK-9073) spark.ml Models copy() should call setParent when there is a parent

2015-07-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9073: - Shepherd: Joseph K. Bradley spark.ml Models copy() should call setParent when there is a

[jira] [Resolved] (SPARK-8807) Add between operator in SparkR

2015-07-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8807. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request

[jira] [Updated] (SPARK-8807) Add between operator in SparkR

2015-07-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-8807: - Assignee: Liang-Chi Hsieh Add between operator in SparkR

[jira] [Updated] (SPARK-8972) Incorrect result for rollup

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-8972: Assignee: Cheng Hao Incorrect result for rollup --- Key:

[jira] [Resolved] (SPARK-8972) Incorrect result for rollup

2015-07-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-8972. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7343

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629409#comment-14629409 ] Lianhui Wang commented on SPARK-8646: - yes, when i use this command:

[jira] [Resolved] (SPARK-9091) Add the codec interface to DStream.

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-9091. -- Resolution: Invalid [~carlmartin] I think you're familiar with

[jira] [Commented] (SPARK-9093) Fix single-quotes strings in SparkR

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629477#comment-14629477 ] Apache Spark commented on SPARK-9093: - User 'yu-iskw' has created a pull request for

[jira] [Commented] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629483#comment-14629483 ] SaintBacchus commented on SPARK-9091: - [~srowen] Sorry for forgetting to change the

[jira] [Updated] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gisle Ytrestøl updated SPARK-9096: -- Description: When using JavaRDD.subtract(), it seems that the tasks are unevenly distributed

[jira] [Created] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread KaiXinXIaoLei (JIRA)
KaiXinXIaoLei created SPARK-9097: Summary: Tasks are not completed but the number of executor is zero Key: SPARK-9097 URL: https://issues.apache.org/jira/browse/SPARK-9097 Project: Spark

[jira] [Updated] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9096: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) I am not sure it is a bug, yet.

[jira] [Commented] (SPARK-9093) Fix single-quotes strings in SparkR

2015-07-16 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629474#comment-14629474 ] Yu Ishikawa commented on SPARK-9093: I'm working this issue. Fix single-quotes

[jira] [Updated] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9091: - Affects Version/s: (was: 1.5.0) Much better, though it can't affect 1.5.0 as this version does not

[jira] [Commented] (SPARK-9073) spark.ml Models copy() should call setParent when there is a parent

2015-07-16 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629527#comment-14629527 ] Kai Sasaki commented on SPARK-9073: --- [~josephkb] Hi, if possible, can I work on this

[jira] [Created] (SPARK-9095) Removes old Parquet support code

2015-07-16 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-9095: - Summary: Removes old Parquet support code Key: SPARK-9095 URL: https://issues.apache.org/jira/browse/SPARK-9095 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gisle Ytrestøl updated SPARK-9096: -- Description: When using JavaRDD.subtract(), it seems that the tasks are unevenly distributed

[jira] [Updated] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gisle Ytrestøl updated SPARK-9096: -- Attachment: reproduce.1.4.1.log.gz reproduce.1.3.1.log.gz

[jira] [Created] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread JIRA
Gisle Ytrestøl created SPARK-9096: - Summary: Unevenly distributed task loads after using JavaRDD.subtract() Key: SPARK-9096 URL: https://issues.apache.org/jira/browse/SPARK-9096 Project: Spark

[jira] [Assigned] (SPARK-9095) Removes old Parquet support code

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9095: --- Assignee: Cheng Lian (was: Apache Spark) Removes old Parquet support code

[jira] [Assigned] (SPARK-9093) Fix single-quotes strings in SparkR

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9093: --- Assignee: Apache Spark Fix single-quotes strings in SparkR

[jira] [Updated] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Description: Since the RDD has the function *saveAsTextFile* which can use *CompressionCodec* to

[jira] [Assigned] (SPARK-9093) Fix single-quotes strings in SparkR

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9093: --- Assignee: (was: Apache Spark) Fix single-quotes strings in SparkR

[jira] [Commented] (SPARK-9067) Memory overflow and open file limit exhaustion for NewParquetRDD+CoalescedRDD

2015-07-16 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629482#comment-14629482 ] Liang-Chi Hsieh commented on SPARK-9067: Thanks for reporting that. I updated the

[jira] [Updated] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Description: Since the RDD has the function *saveAsTextFile* which can use *CompressionCodec* to

[jira] [Commented] (SPARK-9095) Removes old Parquet support code

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629546#comment-14629546 ] Apache Spark commented on SPARK-9095: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-9095) Removes old Parquet support code

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9095: --- Assignee: Apache Spark (was: Cheng Lian) Removes old Parquet support code

[jira] [Created] (SPARK-9093) Fix single-quotes strings in SparkR

2015-07-16 Thread Yu Ishikawa (JIRA)
Yu Ishikawa created SPARK-9093: -- Summary: Fix single-quotes strings in SparkR Key: SPARK-9093 URL: https://issues.apache.org/jira/browse/SPARK-9093 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Summary: Add the codec interface to Text DStream. (was: Add the codec interface to DStream.) Add

[jira] [Updated] (SPARK-9091) Add the codec interface to DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Priority: Minor (was: Major) Add the codec interface to DStream.

[jira] [Updated] (SPARK-9091) Add the codec interface to DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Issue Type: Improvement (was: Bug) Add the codec interface to DStream.

[jira] [Updated] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Description: Since the RDD has the function *saveAsTextFile* which can use *CompressionCodec* to

[jira] [Updated] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-9091: Description: Since the RDD has the function *saveAsTextFile* which can use *CompressionCodec* to

[jira] [Created] (SPARK-9094) Increase io.dropwizard.metrics dependency to 3.1.2

2015-07-16 Thread JIRA
Carl Anders Düvel created SPARK-9094: Summary: Increase io.dropwizard.metrics dependency to 3.1.2 Key: SPARK-9094 URL: https://issues.apache.org/jira/browse/SPARK-9094 Project: Spark

[jira] [Commented] (SPARK-9052) Fix comments after curly braces

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629510#comment-14629510 ] Apache Spark commented on SPARK-9052: - User 'yu-iskw' has created a pull request for

[jira] [Assigned] (SPARK-9052) Fix comments after curly braces

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9052: --- Assignee: (was: Apache Spark) Fix comments after curly braces

[jira] [Assigned] (SPARK-9052) Fix comments after curly braces

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9052: --- Assignee: Apache Spark Fix comments after curly braces ---

[jira] [Updated] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9097: - Fix Version/s: (was: 1.5.0) [~KaiXinXIaoLei] Again please read

[jira] [Commented] (SPARK-9094) Increase io.dropwizard.metrics dependency to 3.1.2

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629558#comment-14629558 ] Sean Owen commented on SPARK-9094: -- Yes, you also need to update your PR title. The

[jira] [Commented] (SPARK-8807) Add between operator in SparkR

2015-07-16 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629391#comment-14629391 ] Yu Ishikawa commented on SPARK-8807: [~yalamart] Sorry for the delay of my reply. And

[jira] [Updated] (SPARK-9092) Make --num-executors compatible with dynamic allocation

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9092: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) Make --num-executors compatible

[jira] [Commented] (SPARK-6442) MLlib Local Linear Algebra Package

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629320#comment-14629320 ] Sean Owen commented on SPARK-6442: -- [~mengxr] for Commons Math, and point #2: actually

[jira] [Assigned] (SPARK-8646) PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8646: --- Assignee: Apache Spark PySpark does not run on YARN if master not provided in command line

[jira] [Assigned] (SPARK-8646) PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8646: --- Assignee: (was: Apache Spark) PySpark does not run on YARN if master not provided in

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629407#comment-14629407 ] Apache Spark commented on SPARK-8646: - User 'lianhuiwang' has created a pull request

[jira] [Comment Edited] (SPARK-9019) spark-submit fails on yarn with kerberos enabled

2015-07-16 Thread Bolke de Bruin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629432#comment-14629432 ] Bolke de Bruin edited comment on SPARK-9019 at 7/16/15 8:39 AM:

[jira] [Updated] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8893: - Assignee: Daniel Darabos Require positive partition counts in RDD.repartition

[jira] [Resolved] (SPARK-8893) Require positive partition counts in RDD.repartition

2015-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8893. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7285

[jira] [Commented] (SPARK-9067) Memory overflow and open file limit exhaustion for NewParquetRDD+CoalescedRDD

2015-07-16 Thread konstantin knizhnik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629399#comment-14629399 ] konstantin knizhnik commented on SPARK-9067: I have found workaround for the

[jira] [Commented] (SPARK-9067) Memory overflow and open file limit exhaustion for NewParquetRDD+CoalescedRDD

2015-07-16 Thread konstantin knizhnik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629358#comment-14629358 ] konstantin knizhnik commented on SPARK-9067: Sorry, but this patch doesn't

[jira] [Comment Edited] (SPARK-8646) PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629409#comment-14629409 ] Lianhui Wang edited comment on SPARK-8646 at 7/16/15 8:25 AM: --

[jira] [Comment Edited] (SPARK-8646) PySpark does not run on YARN if master not provided in command line

2015-07-16 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629409#comment-14629409 ] Lianhui Wang edited comment on SPARK-8646 at 7/16/15 8:26 AM: --

[jira] [Commented] (SPARK-9019) spark-submit fails on yarn with kerberos enabled

2015-07-16 Thread Bolke de Bruin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629432#comment-14629432 ] Bolke de Bruin commented on SPARK-9019: --- I tried running this on an update

[jira] [Created] (SPARK-9098) Inconsistent Dense Vectors hashing between PySpark and Scala

2015-07-16 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-9098: - Summary: Inconsistent Dense Vectors hashing between PySpark and Scala Key: SPARK-9098 URL: https://issues.apache.org/jira/browse/SPARK-9098 Project: Spark

[jira] [Created] (SPARK-9099) spark-ec2 does not add important ports to security group

2015-07-16 Thread Brian Sung-jin Hong (JIRA)
Brian Sung-jin Hong created SPARK-9099: -- Summary: spark-ec2 does not add important ports to security group Key: SPARK-9099 URL: https://issues.apache.org/jira/browse/SPARK-9099 Project: Spark

[jira] [Issue Comment Deleted] (SPARK-9044) Updated RDD name does not reflect under Storage tab

2015-07-16 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-9044: --- Comment: was deleted (was: Well, I think the component is correct, still it's business of Web UI)

[jira] [Commented] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629632#comment-14629632 ] Apache Spark commented on SPARK-9091: - User 'SaintBacchus' has created a pull request

[jira] [Created] (SPARK-9100) DataFrame reader/writer shortcut methods for ORC

2015-07-16 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-9100: - Summary: DataFrame reader/writer shortcut methods for ORC Key: SPARK-9100 URL: https://issues.apache.org/jira/browse/SPARK-9100 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9098) Inconsistent Dense Vectors hashing between PySpark and Scala

2015-07-16 Thread Abou Haydar Elias (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629685#comment-14629685 ] Abou Haydar Elias commented on SPARK-9098: -- This issues creates an inconsistency

[jira] [Created] (SPARK-9101) Can't use null in selectExpr

2015-07-16 Thread JIRA
Mateusz Buśkiewicz created SPARK-9101: - Summary: Can't use null in selectExpr Key: SPARK-9101 URL: https://issues.apache.org/jira/browse/SPARK-9101 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-9100) DataFrame reader/writer shortcut methods for ORC

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9100: --- Assignee: Cheng Lian (was: Apache Spark) DataFrame reader/writer shortcut methods for ORC

[jira] [Assigned] (SPARK-9100) DataFrame reader/writer shortcut methods for ORC

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9100: --- Assignee: Apache Spark (was: Cheng Lian) DataFrame reader/writer shortcut methods for ORC

[jira] [Commented] (SPARK-9091) Add the codec interface to Text DStream.

2015-07-16 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629639#comment-14629639 ] SaintBacchus commented on SPARK-9091: - [~sowen] I agree user can design the output by

[jira] [Updated] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KaiXinXIaoLei updated SPARK-9097: - Attachment: number of executor is zero.png Tasks are not completed but the number of executor is

[jira] [Commented] (SPARK-9099) spark-ec2 does not add important ports to security group

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629649#comment-14629649 ] Apache Spark commented on SPARK-9099: - User 'serialx' has created a pull request for

[jira] [Commented] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629673#comment-14629673 ] KaiXinXIaoLei commented on SPARK-9097: -- I run a big job. During running tasks, five

[jira] [Comment Edited] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629673#comment-14629673 ] KaiXinXIaoLei edited comment on SPARK-9097 at 7/16/15 12:57 PM:

[jira] [Updated] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gisle Ytrestøl updated SPARK-9096: -- Attachment: hanging-one-task.jpg Unevenly distributed task loads after using

[jira] [Updated] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KaiXinXIaoLei updated SPARK-9097: - Attachment: tasks are not completed.png Tasks are not completed but the number of executor is

[jira] [Updated] (SPARK-9097) Tasks are not completed but the number of executor is zero

2015-07-16 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KaiXinXIaoLei updated SPARK-9097: - Target Version/s: 1.5.0 Tasks are not completed but the number of executor is zero

[jira] [Commented] (SPARK-9044) Updated RDD name does not reflect under Storage tab

2015-07-16 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629704#comment-14629704 ] Zhang, Liye commented on SPARK-9044: Well, I think the component is correct, still

[jira] [Commented] (SPARK-9044) Updated RDD name does not reflect under Storage tab

2015-07-16 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629703#comment-14629703 ] Zhang, Liye commented on SPARK-9044: Well, I think the component is correct, still

[jira] [Commented] (SPARK-9096) Unevenly distributed task loads after using JavaRDD.subtract()

2015-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629584#comment-14629584 ] Gisle Ytrestøl commented on SPARK-9096: --- Hi, thanks for responding. I've added a

[jira] [Commented] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629808#comment-14629808 ] Manoj Kumar commented on SPARK-6001: I just started to work on this. K-Means

[jira] [Created] (SPARK-9104) expose network layer memory usage in shuffle part

2015-07-16 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-9104: -- Summary: expose network layer memory usage in shuffle part Key: SPARK-9104 URL: https://issues.apache.org/jira/browse/SPARK-9104 Project: Spark Issue Type:

[jira] [Created] (SPARK-9105) Add an additional WebUi Tab for Memory Usage

2015-07-16 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-9105: -- Summary: Add an additional WebUi Tab for Memory Usage Key: SPARK-9105 URL: https://issues.apache.org/jira/browse/SPARK-9105 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-9073) spark.ml Models copy() should call setParent when there is a parent

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9073: --- Assignee: (was: Apache Spark) spark.ml Models copy() should call setParent when there

[jira] [Assigned] (SPARK-9073) spark.ml Models copy() should call setParent when there is a parent

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9073: --- Assignee: Apache Spark spark.ml Models copy() should call setParent when there is a parent

[jira] [Created] (SPARK-9102) Improve project collapse with nondeterministic expressions

2015-07-16 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-9102: -- Summary: Improve project collapse with nondeterministic expressions Key: SPARK-9102 URL: https://issues.apache.org/jira/browse/SPARK-9102 Project: Spark Issue

[jira] [Created] (SPARK-9103) Tracking spark's memory usage

2015-07-16 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-9103: -- Summary: Tracking spark's memory usage Key: SPARK-9103 URL: https://issues.apache.org/jira/browse/SPARK-9103 Project: Spark Issue Type: Umbrella

[jira] [Assigned] (SPARK-9082) Filter using non-deterministic expressions should not be pushed down

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9082: --- Assignee: Apache Spark (was: Wenchen Fan) Filter using non-deterministic expressions

[jira] [Commented] (SPARK-9082) Filter using non-deterministic expressions should not be pushed down

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629816#comment-14629816 ] Apache Spark commented on SPARK-9082: - User 'cloud-fan' has created a pull request for

[jira] [Commented] (SPARK-9100) DataFrame reader/writer shortcut methods for ORC

2015-07-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629752#comment-14629752 ] Apache Spark commented on SPARK-9100: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-9059) Update Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-07-16 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629770#comment-14629770 ] Benjamin Fradet commented on SPARK-9059: I've started working on this. Update

  1   2   3   >