[jira] [Assigned] (SPARK-12872) Support to specify the option for compression codec for JSON datasource.

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12872: Assignee: Apache Spark > Support to specify the option for compression codec for JSON

[jira] [Assigned] (SPARK-12872) Support to specify the option for compression codec for JSON datasource.

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12872: Assignee: (was: Apache Spark) > Support to specify the option for compression codec

[jira] [Commented] (SPARK-12872) Support to specify the option for compression codec for JSON datasource.

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109871#comment-15109871 ] Apache Spark commented on SPARK-12872: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-12946) The SQL page is empty

2016-01-20 Thread KaiXinXIaoLei (JIRA)
KaiXinXIaoLei created SPARK-12946: - Summary: The SQL page is empty Key: SPARK-12946 URL: https://issues.apache.org/jira/browse/SPARK-12946 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-12946) The SQL page is empty

2016-01-20 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KaiXinXIaoLei updated SPARK-12946: -- Attachment: SQLpage.png > The SQL page is empty > - > >

[jira] [Assigned] (SPARK-12790) Remove HistoryServer old multiple files format

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12790: Assignee: Felix Cheung (was: Apache Spark) > Remove HistoryServer old multiple files

[jira] [Created] (SPARK-12947) Spark with Swift throws EOFException when reading parquet file

2016-01-20 Thread Sam Stoelinga (JIRA)
Sam Stoelinga created SPARK-12947: - Summary: Spark with Swift throws EOFException when reading parquet file Key: SPARK-12947 URL: https://issues.apache.org/jira/browse/SPARK-12947 Project: Spark

[jira] [Commented] (SPARK-12863) missing api for renaming and mapping result of operations on GroupedDataset to case classes

2016-01-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110045#comment-15110045 ] Hyukjin Kwon commented on SPARK-12863: -- I understand this might possibly be an issue although I have

[jira] [Commented] (SPARK-12790) Remove HistoryServer old multiple files format

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110048#comment-15110048 ] Apache Spark commented on SPARK-12790: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-12790) Remove HistoryServer old multiple files format

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12790: Assignee: Apache Spark (was: Felix Cheung) > Remove HistoryServer old multiple files

[jira] [Updated] (SPARK-12947) Spark with Swift throws EOFException when reading parquet file

2016-01-20 Thread Sam Stoelinga (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Stoelinga updated SPARK-12947: -- Component/s: SQL > Spark with Swift throws EOFException when reading parquet file >

[jira] [Created] (SPARK-12949) Support common expression elimination

2016-01-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12949: -- Summary: Support common expression elimination Key: SPARK-12949 URL: https://issues.apache.org/jira/browse/SPARK-12949 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception

2016-01-20 Thread Tristan (JIRA)
Tristan created SPARK-12945: --- Summary: ERROR LiveListenerBus: Listener JobProgressListener threw an exception Key: SPARK-12945 URL: https://issues.apache.org/jira/browse/SPARK-12945 Project: Spark

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-01-20 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110082#comment-15110082 ] Mark Grover commented on SPARK-12177: - Hi Mario, I may have misunderstood some parts of your previous

[jira] [Created] (SPARK-12950) Improve performance of BytesToBytesMap

2016-01-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12950: -- Summary: Improve performance of BytesToBytesMap Key: SPARK-12950 URL: https://issues.apache.org/jira/browse/SPARK-12950 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-01-20 Thread Jayadevan M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110112#comment-15110112 ] Jayadevan M commented on SPARK-12941: - [~jpoblete] Would like to work on this. > Spark-SQL JDBC

[jira] [Commented] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-01-20 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110124#comment-15110124 ] Thomas Sebastian commented on SPARK-12941: -- [~jayadevan.m] and [~jpoblete] As per the details,

[jira] [Resolved] (SPARK-8968) dynamic partitioning in spark sql performance issue due to the high GC overhead

2016-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8968. Resolution: Fixed Assignee: Fei Wang Fix Version/s: 2.0.0 > dynamic partitioning in

[jira] [Commented] (SPARK-6847) Stack overflow on updateStateByKey which followed by a dstream with checkpoint set

2016-01-20 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109955#comment-15109955 ] Shixiong Zhu commented on SPARK-6847: - I think this one has been fixed by

[jira] [Commented] (SPARK-7848) Update SparkStreaming docs to incorporate FAQ and/or bullets w/ "knobs" information.

2016-01-20 Thread Nirman Narang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110058#comment-15110058 ] Nirman Narang commented on SPARK-7848: -- Even I think it's better to put these comments into places

[jira] [Commented] (SPARK-10870) Criteo Display Advertising Challenge

2016-01-20 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110060#comment-15110060 ] Yu Ishikawa commented on SPARK-10870: - [~prudenko] Should we test the Kaggle data with the winning

[jira] [Commented] (SPARK-12948) Consider reducing size of broadcasts in OrcRelation

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110072#comment-15110072 ] Apache Spark commented on SPARK-12948: -- User 'rajeshbalamohan' has created a pull request for this

[jira] [Assigned] (SPARK-12948) Consider reducing size of broadcasts in OrcRelation

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12948: Assignee: Apache Spark > Consider reducing size of broadcasts in OrcRelation >

[jira] [Resolved] (SPARK-12204) Implement drop method for DataFrame in SparkR

2016-01-20 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-12204. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request

[jira] [Created] (SPARK-12948) Consider reducing size of broadcasts in OrcRelation

2016-01-20 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created SPARK-12948: Summary: Consider reducing size of broadcasts in OrcRelation Key: SPARK-12948 URL: https://issues.apache.org/jira/browse/SPARK-12948 Project: Spark

[jira] [Updated] (SPARK-12948) Consider reducing size of broadcasts in OrcRelation

2016-01-20 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated SPARK-12948: - Attachment: SPARK-12948_cpuProf.png > Consider reducing size of broadcasts in

[jira] [Updated] (SPARK-12204) Implement drop method for DataFrame in SparkR

2016-01-20 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-12204: -- Assignee: Sun Rui > Implement drop method for DataFrame in SparkR >

[jira] [Created] (SPARK-12951) Support spilling in generate aggregate

2016-01-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12951: -- Summary: Support spilling in generate aggregate Key: SPARK-12951 URL: https://issues.apache.org/jira/browse/SPARK-12951 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-12910) Support for specifying version of R to use while creating sparkR libraries

2016-01-20 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-12910: -- Assignee: Shubhanshu Mishra > Support for specifying version of R to use while

[jira] [Updated] (SPARK-12910) Support for specifying version of R to use while creating sparkR libraries

2016-01-20 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-12910: -- Shepherd: (was: Shivram Mani) > Support for specifying version of R to use

[jira] [Resolved] (SPARK-12910) Support for specifying version of R to use while creating sparkR libraries

2016-01-20 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-12910. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request

[jira] [Assigned] (SPARK-12948) Consider reducing size of broadcasts in OrcRelation

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12948: Assignee: (was: Apache Spark) > Consider reducing size of broadcasts in OrcRelation >

[jira] [Assigned] (SPARK-12798) Broadcast hash join

2016-01-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-12798: -- Assignee: Davies Liu > Broadcast hash join > --- > > Key:

[jira] [Assigned] (SPARK-12896) Send only accumulator updates, not TaskMetrics, to the driver

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12896: Assignee: Andrew Or (was: Apache Spark) > Send only accumulator updates, not

[jira] [Commented] (SPARK-12896) Send only accumulator updates, not TaskMetrics, to the driver

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109858#comment-15109858 ] Apache Spark commented on SPARK-12896: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12896) Send only accumulator updates, not TaskMetrics, to the driver

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12896: Assignee: Apache Spark (was: Andrew Or) > Send only accumulator updates, not

[jira] [Commented] (SPARK-3374) Spark on Yarn remove deprecated configs for 2.0

2016-01-20 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110053#comment-15110053 ] Felix Cheung commented on SPARK-3374: - +1 esp on config handling. SPARK-12343 gets messy with places

[jira] [Commented] (SPARK-12952) EMLDAOptimizer initialize should return EMLDAOptimizer other than its parent class

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110214#comment-15110214 ] Apache Spark commented on SPARK-12952: -- User 'yinxusen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12952) EMLDAOptimizer initialize should return EMLDAOptimizer other than its parent class

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12952: Assignee: (was: Apache Spark) > EMLDAOptimizer initialize should return

[jira] [Assigned] (SPARK-12952) EMLDAOptimizer initialize should return EMLDAOptimizer other than its parent class

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12952: Assignee: Apache Spark > EMLDAOptimizer initialize should return EMLDAOptimizer other

[jira] [Assigned] (SPARK-12908) Add tests to make sure that ml.classification.LogisticRegression returns meaningful result when labels are the same without intercept

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12908: Assignee: Apache Spark > Add tests to make sure that ml.classification.LogisticRegression

[jira] [Commented] (SPARK-12908) Add tests to make sure that ml.classification.LogisticRegression returns meaningful result when labels are the same without intercept

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110185#comment-15110185 ] Apache Spark commented on SPARK-12908: -- User 'dbtsai' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12908) Add tests to make sure that ml.classification.LogisticRegression returns meaningful result when labels are the same without intercept

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12908: Assignee: (was: Apache Spark) > Add tests to make sure that

[jira] [Assigned] (SPARK-12908) Add tests to make sure that ml.classification.LogisticRegression returns meaningful result when labels are the same without intercept

2016-01-20 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-12908: --- Assignee: DB Tsai > Add tests to make sure that ml.classification.LogisticRegression returns >

[jira] [Created] (SPARK-12952) EMLDAOptimizer initialize should return EMLDAOptimizer other than its parent class

2016-01-20 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-12952: - Summary: EMLDAOptimizer initialize should return EMLDAOptimizer other than its parent class Key: SPARK-12952 URL: https://issues.apache.org/jira/browse/SPARK-12952

[jira] [Commented] (SPARK-7799) Move "StreamingContext.actorStream" to a separate project and deprecate it in StreamingContext

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109836#comment-15109836 ] Apache Spark commented on SPARK-7799: - User 'zsxwing' has created a pull request for this issue:

[jira] [Commented] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109000#comment-15109000 ] Sean Owen commented on SPARK-12929: --- I believe you need to call it "Apache Spark" in general since that

[jira] [Closed] (SPARK-12924) Make Token pattern matching in the parser case insensitive

2016-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-12924. --- Resolution: Invalid > Make Token pattern matching in the parser case insensitive >

[jira] [Closed] (SPARK-11857) Remove Mesos fine-grained mode subject to discussions

2016-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-11857. --- Resolution: Later Closing this as later until we can figure out what's going on with resource

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-01-20 Thread Mario Briggs (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109056#comment-15109056 ] Mario Briggs commented on SPARK-12177: -- bq. I totally understand what you mean. However, kafka has

[jira] [Commented] (SPARK-11295) Add packages to JUnit output for Python tests

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109070#comment-15109070 ] Apache Spark commented on SPARK-11295: -- User 'mengxr' has created a pull request for this issue:

[jira] [Commented] (SPARK-12823) Cannot create UDF with StructType input

2016-01-20 Thread Frank Rosner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108689#comment-15108689 ] Frank Rosner commented on SPARK-12823: -- Ok :( :D > Cannot create UDF with StructType input >

[jira] [Resolved] (SPARK-12927) Dataframe doesn't display the current status on Aerospike.

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12927. --- Resolution: Not A Problem You have copied the contents of a DataFrame to the driver with collect().

[jira] [Resolved] (SPARK-3431) Parallelize Scala/Java test execution

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3431. -- Resolution: Won't Fix I think this is a "cantfix" at this point; just too many tests that collide with

[jira] [Updated] (SPARK-3374) Spark on Yarn remove deprecated configs for 2.0

2016-01-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-3374: - Parent Issue: SPARK-11806 (was: SPARK-3492) > Spark on Yarn remove deprecated configs for 2.0 >

[jira] [Updated] (SPARK-3374) Spark on Yarn remove deprecated configs for 2.0

2016-01-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-3374: - Summary: Spark on Yarn remove deprecated configs for 2.0 (was: Spark on Yarn config cleanup) >

[jira] [Updated] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2016-01-20 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Braithwaite updated SPARK-11327: - Description: I haven't figured out exactly what's going on yet, but there's something in

[jira] [Comment Edited] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-01-20 Thread Mario Briggs (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109056#comment-15109056 ] Mario Briggs edited comment on SPARK-12177 at 1/20/16 6:16 PM: --- bq. I

[jira] [Updated] (SPARK-12816) Schema generation for type aliases does not work

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12816: -- Assignee: Jakob Odersky > Schema generation for type aliases does not work >

[jira] [Updated] (SPARK-12906) LongSQLMetricValue cause memory leak on Spark 1.5.1

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12906: -- Component/s: SQL > LongSQLMetricValue cause memory leak on Spark 1.5.1 >

[jira] [Updated] (SPARK-12911) Cacheing a dataframe causes array comparisons to fail (in filter / where) after 1.6

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12911: -- Component/s: SQL > Cacheing a dataframe causes array comparisons to fail (in filter / where) > after

[jira] [Updated] (SPARK-12921) Use SparkHadoopUtil reflection to access TaskAttemptContext in SpecificParquetRecordReaderBase

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12921: -- Component/s: SQL Spark Core > Use SparkHadoopUtil reflection to access

[jira] [Updated] (SPARK-12892) Support plugging in Spark scheduler

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12892: -- Component/s: Spark Core > Support plugging in Spark scheduler >

[jira] [Updated] (SPARK-12915) SQL metrics for generated operators

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12915: -- Component/s: SQL > SQL metrics for generated operators > --- > >

[jira] [Resolved] (SPARK-4232) Truncate table not works when specific the table from non-current database session

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4232. -- Resolution: Not A Problem Target Version/s: (was: 1.1.2, 1.2.1) I'm not clear, but it

[jira] [Commented] (SPARK-12911) Cacheing a dataframe causes array comparisons to fail (in filter / where) after 1.6

2016-01-20 Thread Jayadevan M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108795#comment-15108795 ] Jayadevan M commented on SPARK-12911: - [~jesse.english] I just try following statments in

[jira] [Reopened] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project

2016-01-20 Thread Kevin Doyle (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Doyle reopened SPARK-12929: - Thanks Sean for the super quick turnaround on this. One small adjustment is needed. The product

[jira] [Resolved] (SPARK-6519) Add spark.ml API for bisecting k-means

2016-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6519. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 9604

[jira] [Commented] (SPARK-12906) LongSQLMetricValue cause memory leak on Spark 1.5.1

2016-01-20 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109131#comment-15109131 ] Shixiong Zhu commented on SPARK-12906: -- Did you run some no sql jobs? If so, I think it may be same

[jira] [Commented] (SPARK-11806) Spark 2.0 deprecations and removals

2016-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109031#comment-15109031 ] Reynold Xin commented on SPARK-11806: - No not yet. I don't know how many we want to deprecate though.

[jira] [Commented] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project

2016-01-20 Thread Kevin Doyle (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109090#comment-15109090 ] Kevin Doyle commented on SPARK-12929: - I don't see Apache Spark used for other projects listed on

[jira] [Commented] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109142#comment-15109142 ] Sean Owen commented on SPARK-12929: --- While "SparkR" is actually part of Spark, I agree about things

[jira] [Commented] (SPARK-12911) Cacheing a dataframe causes array comparisons to fail (in filter / where) after 1.6

2016-01-20 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108715#comment-15108715 ] kevin yu commented on SPARK-12911: -- I will look into this . Thanks. Kevin > Cacheing a dataframe causes

[jira] [Created] (SPARK-12928) Oracle FLOAT datatype is not properly handled when reading via JDBC

2016-01-20 Thread Greg Michalopoulos (JIRA)
Greg Michalopoulos created SPARK-12928: -- Summary: Oracle FLOAT datatype is not properly handled when reading via JDBC Key: SPARK-12928 URL: https://issues.apache.org/jira/browse/SPARK-12928

[jira] [Resolved] (SPARK-4727) Add "dimensional" RDDs (time series, spatial)

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4727. -- Resolution: Won't Fix I suggest the timeseries ideas be implemented in the spark-timeseries project

[jira] [Commented] (SPARK-12930) NullPointerException running hive query with array dereference in select and where clause

2016-01-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108871#comment-15108871 ] Thomas Graves commented on SPARK-12930: --- Note that change the query to remove the ['pos'] from

[jira] [Comment Edited] (SPARK-12930) NullPointerException running hive query with array dereference in select and where clause

2016-01-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108871#comment-15108871 ] Thomas Graves edited comment on SPARK-12930 at 1/20/16 4:41 PM: Note that

[jira] [Commented] (SPARK-12926) SQLContext to disallow users passing non-sql configs

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108762#comment-15108762 ] Apache Spark commented on SPARK-12926: -- User 'tejasapatil' has created a pull request for this

[jira] [Assigned] (SPARK-12926) SQLContext to disallow users passing non-sql configs

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12926: Assignee: (was: Apache Spark) > SQLContext to disallow users passing non-sql configs

[jira] [Resolved] (SPARK-3051) Support looking-up named accumulators in a registry

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3051. -- Resolution: Won't Fix > Support looking-up named accumulators in a registry >

[jira] [Resolved] (SPARK-4251) Add Restricted Boltzmann machine(RBM) algorithm to MLlib

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4251. -- Resolution: Won't Fix > Add Restricted Boltzmann machine(RBM) algorithm to MLlib >

[jira] [Resolved] (SPARK-3849) Automate remaining Spark Code Style Guide rules

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3849. -- Resolution: Done I broke out the one remaining task as a stand-alone and am closing this. > Automate

[jira] [Updated] (SPARK-3854) Scala style: require spaces before `{`

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3854: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-3849) > Scala style: require

[jira] [Resolved] (SPARK-4099) env var HOME not set correctly

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4099. -- Resolution: Not A Problem > env var HOME not set correctly > -- > >

[jira] [Closed] (SPARK-10911) Executors should System.exit on clean shutdown

2016-01-20 Thread Zhuo Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuo Liu closed SPARK-10911. Resolution: Later This is caused mainly by user issue. We may reopen it if this issue repeats more for

[jira] [Commented] (SPARK-11806) Spark 2.0 deprecations and removals

2016-01-20 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108961#comment-15108961 ] Thomas Graves commented on SPARK-11806: --- I added a task to remove the deprecated yarn configs,

[jira] [Resolved] (SPARK-12691) Multiple unionAll on Dataframe goes growingly slow.

2016-01-20 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-12691. --- Resolution: Duplicate > Multiple unionAll on Dataframe goes growingly slow. >

[jira] [Commented] (SPARK-12691) Multiple unionAll on Dataframe goes growingly slow.

2016-01-20 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108730#comment-15108730 ] Herman van Hovell commented on SPARK-12691: --- SPARK-12616 fixes this problem. > Multiple

[jira] [Assigned] (SPARK-12926) SQLContext to disallow users passing non-sql configs

2016-01-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12926: Assignee: Apache Spark > SQLContext to disallow users passing non-sql configs >

[jira] [Resolved] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3533. -- Resolution: Won't Fix > Add saveAsTextFileByKey() method to RDDs >

[jira] [Commented] (SPARK-12911) Cacheing a dataframe causes array comparisons to fail (in filter / where) after 1.6

2016-01-20 Thread Stephen DiCocco (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108803#comment-15108803 ] Stephen DiCocco commented on SPARK-12911: - I've been working on Jesse with this. What is

[jira] [Resolved] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project

2016-01-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12929. --- Resolution: Done OK, done. > Adding IBM Platform Conductor for Spark as new Supplemental Project >

[jira] [Commented] (SPARK-12022) spark-shell cannot run on master created with spark-ec2

2016-01-20 Thread Denton Cockburn (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15108862#comment-15108862 ] Denton Cockburn commented on SPARK-12022: - I encountered the same problem with Spark 1.6.0

[jira] [Created] (SPARK-12930) NullPointerException running hive query with array dereference in select and where clause

2016-01-20 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-12930: - Summary: NullPointerException running hive query with array dereference in select and where clause Key: SPARK-12930 URL: https://issues.apache.org/jira/browse/SPARK-12930

[jira] [Updated] (SPARK-9835) Iteratively reweighted least squares solver for GLMs

2016-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9835: - Target Version/s: 2.0.0 > Iteratively reweighted least squares solver for GLMs >

[jira] [Updated] (SPARK-12230) WeightedLeastSquares.fit() should handle division by zero properly if standard deviation of target variable is zero.

2016-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12230: -- Assignee: Imran Younus (was: Xiangrui Meng) > WeightedLeastSquares.fit() should handle

[jira] [Resolved] (SPARK-12898) Consider having dummyCallSite for HiveTableScan

2016-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12898. - Resolution: Fixed Assignee: Rajesh Balamohan Fix Version/s: 2.0.0 > Consider

[jira] [Commented] (SPARK-12911) Cacheing a dataframe causes array comparisons to fail (in filter / where) after 1.6

2016-01-20 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109269#comment-15109269 ] Andrew Ray commented on SPARK-12911: In the current master this happens even without caching. The

[jira] [Resolved] (SPARK-12925) Improve HiveInspectors.unwrap for StringObjectInspector.getPrimitiveWritableObject

2016-01-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12925. - Resolution: Fixed Assignee: Rajesh Balamohan Fix Version/s: 2.0.0 > Improve

[jira] [Updated] (SPARK-12732) Fix LinearRegression.train for the case when label is constant and fitIntercept=false

2016-01-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12732: -- Shepherd: DB Tsai Target Version/s: 2.0.0 > Fix LinearRegression.train for the

  1   2   3   >