[jira] [Created] (SPARK-10461) make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection

2015-09-05 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-10461: --- Summary: make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection Key: SPARK-10461 URL: https://issues.apache.org/jira/browse/SPARK-10461

[jira] [Assigned] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10447: Assignee: Apache Spark > Upgrade pyspark to use py4j 0.9 >

[jira] [Assigned] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10447: Assignee: (was: Apache Spark) > Upgrade pyspark to use py4j 0.9 >

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731896#comment-14731896 ] Apache Spark commented on SPARK-10447: -- User 'holdenk' has created a pull request for this issue:

[jira] [Commented] (SPARK-10199) Avoid using reflections for parquet model save

2015-09-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731834#comment-14731834 ] Joseph K. Bradley commented on SPARK-10199: --- I agree that the work required for these changes

[jira] [Commented] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731883#comment-14731883 ] Reynold Xin commented on SPARK-8632: We don't technically need to cache the RDD at all, do we? Can't

[jira] [Comment Edited] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731883#comment-14731883 ] Reynold Xin edited comment on SPARK-8632 at 9/5/15 8:47 AM: We don't

[jira] [Commented] (SPARK-5337) respect spark.task.cpus when launch executors

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731826#comment-14731826 ] Apache Spark commented on SPARK-5337: - User 'CodingCat' has created a pull request for this issue:

[jira] [Updated] (SPARK-10460) fieldIndex method missing on spark.sql.Row

2015-09-05 Thread FELIPE Q B ALMEIDA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] FELIPE Q B ALMEIDA updated SPARK-10460: --- Environment: I'm running on an Ubuntu 14.04 32-bit machine, Java 7, spark 1.4.1. Jar

[jira] [Commented] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-09-05 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731862#comment-14731862 ] Naden Franciscus commented on SPARK-10309: -- Why is this targeted for 1.6 ? We are finding this

[jira] [Updated] (SPARK-10460) fieldIndex method missing on spark.sql.Row

2015-09-05 Thread FELIPE Q B ALMEIDA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] FELIPE Q B ALMEIDA updated SPARK-10460: --- Description: {code:title=foo.scala|borderStyle=solid} val sc = new

[jira] [Resolved] (SPARK-10434) Parquet compatibility with 1.4 is broken when writing arrays that may contain nulls

2015-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-10434. Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull

[jira] [Created] (SPARK-10460) fieldIndex method on spark.sql.Row

2015-09-05 Thread FELIPE Q B ALMEIDA (JIRA)
FELIPE Q B ALMEIDA created SPARK-10460: -- Summary: fieldIndex method on spark.sql.Row Key: SPARK-10460 URL: https://issues.apache.org/jira/browse/SPARK-10460 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10460) fieldIndex method missing on spark.sql.Row

2015-09-05 Thread FELIPE Q B ALMEIDA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] FELIPE Q B ALMEIDA updated SPARK-10460: --- Summary: fieldIndex method missing on spark.sql.Row (was: fieldIndex method on

[jira] [Updated] (SPARK-10460) fieldIndex method missing on spark.sql.Row

2015-09-05 Thread FELIPE Q B ALMEIDA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] FELIPE Q B ALMEIDA updated SPARK-10460: --- Description: ``` val sc = new SparkContext(cnf) val

[jira] [Updated] (SPARK-10460) fieldIndex method missing on spark.sql.Row

2015-09-05 Thread FELIPE Q B ALMEIDA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] FELIPE Q B ALMEIDA updated SPARK-10460: --- Description: {code:title=foo.scala|borderStyle=solid} val sc = new

[jira] [Assigned] (SPARK-10461) make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10461: Assignee: Apache Spark > make sure `input.primitive` is always variable name not code at

[jira] [Assigned] (SPARK-10461) make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10461: Assignee: (was: Apache Spark) > make sure `input.primitive` is always variable name

[jira] [Commented] (SPARK-10461) make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731881#comment-14731881 ] Apache Spark commented on SPARK-10461: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Resolved] (SPARK-10440) Update Spark Streaming Documentation for 1.5

2015-09-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10440. - Resolution: Fixed Fix Version/s: 1.5.0 > Update Spark Streaming Documentation for 1.5 >

[jira] [Commented] (SPARK-10433) Gradient boosted trees

2015-09-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731942#comment-14731942 ] Sean Owen commented on SPARK-10433: --- Oh that's great, do you know what change might have fixed it, or

[jira] [Assigned] (SPARK-8518) Log-linear models for survival analysis

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8518: --- Assignee: Apache Spark (was: Yanbo Liang) > Log-linear models for survival analysis >

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731866#comment-14731866 ] Apache Spark commented on SPARK-8518: - User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-8518) Log-linear models for survival analysis

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8518: --- Assignee: Yanbo Liang (was: Apache Spark) > Log-linear models for survival analysis >

[jira] [Commented] (SPARK-8630) Prevent from checkpointing QueueInputDStream

2015-09-05 Thread Andrei Dragomir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731867#comment-14731867 ] Andrei Dragomir commented on SPARK-8630: [~tdas] for me personally it's 1.4.1 > Prevent from

[jira] [Resolved] (SPARK-10426) Cannot call methods on a stopped SparkContext

2015-09-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10426. --- Resolution: Not A Problem > Cannot call methods on a stopped SparkContext >

[jira] [Updated] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10458: -- Component/s: Spark Core Is this basically "add SparkContext.isStopped"? seems OK, though it will

[jira] [Commented] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields

2015-09-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731838#comment-14731838 ] Joseph K. Bradley commented on SPARK-9961: -- By "evaluator," I mean the Evaluator types in

[jira] [Comment Edited] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-09-05 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731862#comment-14731862 ] Naden Franciscus edited comment on SPARK-10309 at 9/5/15 7:37 AM: -- Why

[jira] [Resolved] (SPARK-10013) Remove Java assert from Java unit tests

2015-09-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10013. - Resolution: Fixed Assignee: holdenk Fix Version/s: 1.6.0 > Remove Java assert

[jira] [Updated] (SPARK-10434) Parquet compatibility with 1.4 is broken when writing arrays that may contain nulls

2015-09-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10434: --- Target Version/s: 1.6.0, 1.5.1 (was: 1.5.0, 1.5.1) > Parquet compatibility with 1.4 is broken when

[jira] [Commented] (SPARK-10459) PythonUDF could process UnsafeRow

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731930#comment-14731930 ] Apache Spark commented on SPARK-10459: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10459) PythonUDF could process UnsafeRow

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10459: Assignee: (was: Apache Spark) > PythonUDF could process UnsafeRow >

[jira] [Assigned] (SPARK-10459) PythonUDF could process UnsafeRow

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10459: Assignee: Apache Spark > PythonUDF could process UnsafeRow >

[jira] [Comment Edited] (SPARK-3231) select on a table in parquet format containing smallint as a field type does not work

2015-09-05 Thread Alex Rovner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732021#comment-14732021 ] Alex Rovner edited comment on SPARK-3231 at 9/5/15 4:03 PM: This is no longer

[jira] [Commented] (SPARK-3231) select on a table in parquet format containing smallint as a field type does not work

2015-09-05 Thread Alex Rovner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732021#comment-14732021 ] Alex Rovner commented on SPARK-3231: This is no longer an issue in master. I just verified that it

[jira] [Commented] (SPARK-9488) pyspark.sql.types.Row very slow when using named arguments

2015-09-05 Thread Alexey Grishchenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732112#comment-14732112 ] Alexey Grishchenko commented on SPARK-9488: --- OrderedDict implementation in Python is very simple

[jira] [Updated] (SPARK-9488) pyspark.sql.types.Row very slow when using named arguments

2015-09-05 Thread Alexey Grishchenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Grishchenko updated SPARK-9488: -- Attachment: Indexed Row Benchmark.png > pyspark.sql.types.Row very slow when using

[jira] [Updated] (SPARK-10451) Prevent unnecessary serializations in InMemoryColumnarTableScan

2015-09-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10451: -- Priority: Minor (was: Major) Component/s: SQL > Prevent unnecessary serializations in

[jira] [Commented] (SPARK-6740) SQL operator and condition precedence is not honoured

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731959#comment-14731959 ] Apache Spark commented on SPARK-6740: - User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-3978) Schema change on Spark-Hive (Parquet file format) table not working

2015-09-05 Thread Alex Rovner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732033#comment-14732033 ] Alex Rovner commented on SPARK-3978: I have verified that altering a table stored as parquet files and

[jira] [Commented] (SPARK-10433) Gradient boosted trees

2015-09-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732137#comment-14732137 ] Joseph K. Bradley commented on SPARK-10433: --- I'm pretty sure it was this one:

[jira] [Assigned] (SPARK-7128) Add generic bagging algorithm to spark.ml

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7128: --- Assignee: Apache Spark > Add generic bagging algorithm to spark.ml >

[jira] [Commented] (SPARK-7128) Add generic bagging algorithm to spark.ml

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732157#comment-14732157 ] Apache Spark commented on SPARK-7128: - User 'feynmanliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-7128) Add generic bagging algorithm to spark.ml

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7128: --- Assignee: (was: Apache Spark) > Add generic bagging algorithm to spark.ml >

[jira] [Created] (SPARK-10462) spark-ec2 not creating ephemeral volumes

2015-09-05 Thread Joseph E. Gonzalez (JIRA)
Joseph E. Gonzalez created SPARK-10462: -- Summary: spark-ec2 not creating ephemeral volumes Key: SPARK-10462 URL: https://issues.apache.org/jira/browse/SPARK-10462 Project: Spark Issue

[jira] [Commented] (SPARK-10132) daemon crash caused by memory leak

2015-09-05 Thread Hailong Wen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732163#comment-14732163 ] Hailong Wen commented on SPARK-10132: - Latest updates: No matter how big the JVM is, it will finally

[jira] [Assigned] (SPARK-9841) Params.clear needs to be public

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9841: --- Assignee: (was: Apache Spark) > Params.clear needs to be public >

[jira] [Commented] (SPARK-9716) BinaryClassificationEvaluator should accept Double prediction column

2015-09-05 Thread Luvsandondov Lkhamsuren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732119#comment-14732119 ] Luvsandondov Lkhamsuren commented on SPARK-9716: I was thinking on having method that

[jira] [Commented] (SPARK-9841) Params.clear needs to be public

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732179#comment-14732179 ] Apache Spark commented on SPARK-9841: - User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-9841) Params.clear needs to be public

2015-09-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9841: --- Assignee: Apache Spark > Params.clear needs to be public > --- >