[jira] [Commented] (SPARK-19747) Consolidate code in ML aggregators

2017-02-27 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887421#comment-15887421 ] yuhao yang commented on SPARK-19747: I did notice the code duplication during implementing LinearSVC.

[jira] [Updated] (SPARK-19764) Executors hang with supposedly running task that are really finished.

2017-02-27 Thread Ari Gesher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ari Gesher updated SPARK-19764: --- Description: We've come across a job that won't finish. Running on a six-node cluster, each of the

[jira] [Updated] (SPARK-19764) Executors hang with supposedly running task that are really finished.

2017-02-27 Thread Ari Gesher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ari Gesher updated SPARK-19764: --- Attachment: driver-log-stderr.log Full driver log > Executors hang with supposedly running task

[jira] [Commented] (SPARK-17147) Spark Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2017-02-27 Thread Dean Wampler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887342#comment-15887342 ] Dean Wampler commented on SPARK-17147: -- Cody, thanks for the suggestion. We'll try to test it and

[jira] [Created] (SPARK-19764) Executors hang with supposedly running task that are really finished.

2017-02-27 Thread Ari Gesher (JIRA)
Ari Gesher created SPARK-19764: -- Summary: Executors hang with supposedly running task that are really finished. Key: SPARK-19764 URL: https://issues.apache.org/jira/browse/SPARK-19764 Project: Spark

[jira] [Updated] (SPARK-19764) Executors hang with supposedly running task that are really finished.

2017-02-27 Thread Ari Gesher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ari Gesher updated SPARK-19764: --- Attachment: executor-2.log Full stdout log of one of the seven executors > Executors hang with

[jira] [Commented] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887315#comment-15887315 ] Joseph K. Bradley commented on SPARK-19636: --- I'm going to work on this. > Feature parity for

[jira] [Assigned] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19636: - Assignee: Joseph K. Bradley > Feature parity for correlation statistics in

[jira] [Commented] (SPARK-19738) Consider adding error handler to DataStreamWriter

2017-02-27 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887218#comment-15887218 ] Genmao Yu commented on SPARK-19738: --- [~jlalwani] I tested it on latest master branch, and return NULL

[jira] [Commented] (SPARK-19763) qualified external datasource table location stored in catalog

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887209#comment-15887209 ] Apache Spark commented on SPARK-19763: -- User 'windpiger' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19763) qualified external datasource table location stored in catalog

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19763: Assignee: Apache Spark > qualified external datasource table location stored in catalog >

[jira] [Assigned] (SPARK-19763) qualified external datasource table location stored in catalog

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19763: Assignee: (was: Apache Spark) > qualified external datasource table location stored

[jira] [Commented] (SPARK-19750) Spark UI http -> https redirect error

2017-02-27 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887204#comment-15887204 ] Saisai Shao commented on SPARK-19750: - This issue was found by [~yeshavora], credits to her. > Spark

[jira] [Created] (SPARK-19763) qualified external datasource table location stored in catalog

2017-02-27 Thread Song Jun (JIRA)
Song Jun created SPARK-19763: Summary: qualified external datasource table location stored in catalog Key: SPARK-19763 URL: https://issues.apache.org/jira/browse/SPARK-19763 Project: Spark

[jira] [Commented] (SPARK-19762) Implement aggregator/loss function hierarchy and apply to linear regression

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887187#comment-15887187 ] Apache Spark commented on SPARK-19762: -- User 'sethah' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19762) Implement aggregator/loss function hierarchy and apply to linear regression

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19762: Assignee: Apache Spark > Implement aggregator/loss function hierarchy and apply to linear

[jira] [Assigned] (SPARK-19762) Implement aggregator/loss function hierarchy and apply to linear regression

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19762: Assignee: (was: Apache Spark) > Implement aggregator/loss function hierarchy and

[jira] [Commented] (SPARK-19738) Consider adding error handler to DataStreamWriter

2017-02-27 Thread Jayesh lalwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887183#comment-15887183 ] Jayesh lalwani commented on SPARK-19738: Thanks [~zsxwing] This will do. Do you know if

[jira] [Created] (SPARK-19762) Implement aggregator/loss function hierarchy and apply to linear regression

2017-02-27 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19762: Summary: Implement aggregator/loss function hierarchy and apply to linear regression Key: SPARK-19762 URL: https://issues.apache.org/jira/browse/SPARK-19762

[jira] [Commented] (SPARK-19761) create InMemoryFileIndex with empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887095#comment-15887095 ] Apache Spark commented on SPARK-19761: -- User 'windpiger' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19761) create InMemoryFileIndex with empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19761: Assignee: Apache Spark > create InMemoryFileIndex with empty rootPaths when set >

[jira] [Assigned] (SPARK-19761) create InMemoryFileIndex with empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19761: Assignee: (was: Apache Spark) > create InMemoryFileIndex with empty rootPaths when

[jira] [Commented] (SPARK-17147) Spark Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2017-02-27 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887091#comment-15887091 ] Cody Koeninger commented on SPARK-17147: Dean if you guys have any bandwith to help test out

[jira] [Commented] (SPARK-19738) Consider adding error handler to DataStreamWriter

2017-02-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887092#comment-15887092 ] Shixiong Zhu commented on SPARK-19738: -- [~gaaldornick] could you check if SPARK-18699 is enough? It

[jira] [Created] (SPARK-19761) create InMemoryFileIndex with empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero

2017-02-27 Thread Song Jun (JIRA)
Song Jun created SPARK-19761: Summary: create InMemoryFileIndex with empty rootPaths when set PARALLEL_PARTITION_DISCOVERY_THRESHOLD to zero Key: SPARK-19761 URL: https://issues.apache.org/jira/browse/SPARK-19761

[jira] [Updated] (SPARK-19738) Consider adding error handler to DataStreamWriter

2017-02-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19738: - Component/s: SQL > Consider adding error handler to DataStreamWriter >

[jira] [Updated] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19751: - Component/s: (was: Spark Core) SQL > Create Data frame API fails with a

[jira] [Commented] (SPARK-18450) Add AND-amplification to Locality Sensitive Hashing

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887068#comment-15887068 ] Apache Spark commented on SPARK-18450: -- User 'Yunni' has created a pull request for this issue:

[jira] [Resolved] (SPARK-19749) Name socket source with a meaningful name

2017-02-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19749. -- Resolution: Fixed Fix Version/s: 2.2.0 > Name socket source with a meaningful name >

[jira] [Commented] (SPARK-19760) Documentation does not list dependency on jniloader

2017-02-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887054#comment-15887054 ] Sean Owen commented on SPARK-19760: --- No, you need to build Spark with the netlib-lgpl profile, and that

[jira] [Created] (SPARK-19760) Documentation does not list dependency on jniloader

2017-02-27 Thread Demi Marie Obenour (JIRA)
Demi Marie Obenour created SPARK-19760: -- Summary: Documentation does not list dependency on jniloader Key: SPARK-19760 URL: https://issues.apache.org/jira/browse/SPARK-19760 Project: Spark

[jira] [Closed] (SPARK-19744) Find a better alternative to netlib-java

2017-02-27 Thread Demi Marie Obenour (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Demi Marie Obenour closed SPARK-19744. -- This report is bogus. I reported it out of frustration. Adding

[jira] [Commented] (SPARK-18080) Locality Sensitive Hashing (LSH) Python API

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887000#comment-15887000 ] Joseph K. Bradley commented on SPARK-18080: --- No, no problem. Thanks for committing it! I saw

[jira] [Commented] (SPARK-19280) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler

2017-02-27 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886999#comment-15886999 ] Nan Zhu commented on SPARK-19280: - [~zsxwing] please let me know if we agree on that 2 is something we

[jira] [Created] (SPARK-19759) ALSModel.predict on Dataframes : potential optimization by not using blas

2017-02-27 Thread Sue Ann Hong (JIRA)
Sue Ann Hong created SPARK-19759: Summary: ALSModel.predict on Dataframes : potential optimization by not using blas Key: SPARK-19759 URL: https://issues.apache.org/jira/browse/SPARK-19759 Project:

[jira] [Created] (SPARK-19758) Casting string to timestamp in inline table definition fails with AnalysisException

2017-02-27 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-19758: -- Summary: Casting string to timestamp in inline table definition fails with AnalysisException Key: SPARK-19758 URL: https://issues.apache.org/jira/browse/SPARK-19758

[jira] [Assigned] (SPARK-19757) Executor with task scheduled could be killed due to idleness

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19757: Assignee: Apache Spark > Executor with task scheduled could be killed due to idleness >

[jira] [Commented] (SPARK-19757) Executor with task scheduled could be killed due to idleness

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886925#comment-15886925 ] Apache Spark commented on SPARK-19757: -- User 'jxiang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19757) Executor with task scheduled could be killed due to idleness

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19757: Assignee: (was: Apache Spark) > Executor with task scheduled could be killed due to

[jira] [Assigned] (SPARK-19756) drop the table cache after inserting into a data source table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19756: Assignee: Wenchen Fan (was: Apache Spark) > drop the table cache after inserting into a

[jira] [Resolved] (SPARK-19746) LogisticAggregator is inefficient in indexing

2017-02-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-19746. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17078

[jira] [Assigned] (SPARK-19746) LogisticAggregator is inefficient in indexing

2017-02-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-19746: --- Assignee: Seth Hendrickson > LogisticAggregator is inefficient in indexing >

[jira] [Assigned] (SPARK-19756) drop the table cache after inserting into a data source table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19756: Assignee: Apache Spark (was: Wenchen Fan) > drop the table cache after inserting into a

[jira] [Assigned] (SPARK-19535) ALSModel recommendAll analogs

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19535: Assignee: Apache Spark (was: Sue Ann Hong) > ALSModel recommendAll analogs >

[jira] [Commented] (SPARK-19756) drop the table cache after inserting into a data source table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886898#comment-15886898 ] Apache Spark commented on SPARK-19756: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19756) drop the table cache after inserting into a data source table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19756: Assignee: Apache Spark (was: Wenchen Fan) > drop the table cache after inserting into a

[jira] [Commented] (SPARK-19535) ALSModel recommendAll analogs

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886897#comment-15886897 ] Apache Spark commented on SPARK-19535: -- User 'sueann' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19535) ALSModel recommendAll analogs

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19535: Assignee: Sue Ann Hong (was: Apache Spark) > ALSModel recommendAll analogs >

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-02-27 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886900#comment-15886900 ] Timothy Hunter commented on SPARK-19634: [~wm624] were you able to start to work on this task? I

[jira] [Assigned] (SPARK-19756) drop the table cache after inserting into a data source table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19756: Assignee: Wenchen Fan (was: Apache Spark) > drop the table cache after inserting into a

[jira] [Commented] (SPARK-19757) Executor with task scheduled could be killed due to idleness

2017-02-27 Thread Jimmy Xiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886838#comment-15886838 ] Jimmy Xiang commented on SPARK-19757: - In class CoarseGrainedSchedulerBackend, killExecutors makes

[jira] [Created] (SPARK-19757) Executor with task scheduled could be killed due to idleness

2017-02-27 Thread Jimmy Xiang (JIRA)
Jimmy Xiang created SPARK-19757: --- Summary: Executor with task scheduled could be killed due to idleness Key: SPARK-19757 URL: https://issues.apache.org/jira/browse/SPARK-19757 Project: Spark

[jira] [Created] (SPARK-19756) drop the table cache after inserting into a data source table

2017-02-27 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-19756: --- Summary: drop the table cache after inserting into a data source table Key: SPARK-19756 URL: https://issues.apache.org/jira/browse/SPARK-19756 Project: Spark

[jira] [Updated] (SPARK-9140) Replace TimeTracker by Stopwatch

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9140: - Shepherd: Joseph K. Bradley Target Version/s: (was: 2.2.0) > Replace

[jira] [Commented] (SPARK-9140) Replace TimeTracker by Stopwatch

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886749#comment-15886749 ] Joseph K. Bradley commented on SPARK-9140: -- I'll unset the target version but assign myself as

[jira] [Updated] (SPARK-18903) uiWebUrl is not accessible to SparkR

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18903: -- Fix Version/s: 2.2.0 2.1.1 > uiWebUrl is not accessible to SparkR >

[jira] [Assigned] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-17498: - Assignee: Vincent > StringIndexer.setHandleInvalid should have another option

[jira] [Updated] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17498: -- Priority: Minor (was: Major) > StringIndexer.setHandleInvalid should have another

[jira] [Updated] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17498: -- Shepherd: Joseph K. Bradley > StringIndexer.setHandleInvalid should have another

[jira] [Updated] (SPARK-19702) Add Suppress/Revive support to the Mesos Spark Dispatcher

2017-02-27 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt updated SPARK-19702: Description: Due to the problem described here:

[jira] [Updated] (SPARK-19702) Increasse refuse_seconds timeout in the Mesos Spark Dispatcher

2017-02-27 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt updated SPARK-19702: Summary: Increasse refuse_seconds timeout in the Mesos Spark Dispatcher (was: Add

[jira] [Created] (SPARK-19755) Blacklist is always active for MesosCoarseGrainedSchedulerBackend. As result - scheduler cannot create an executor after some time.

2017-02-27 Thread Timur Abakumov (JIRA)
Timur Abakumov created SPARK-19755: -- Summary: Blacklist is always active for MesosCoarseGrainedSchedulerBackend. As result - scheduler cannot create an executor after some time. Key: SPARK-19755 URL:

[jira] [Assigned] (SPARK-19753) Remove all shuffle files on a host in case of slave lost of fetch failure

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19753: Assignee: Apache Spark > Remove all shuffle files on a host in case of slave lost of

[jira] [Commented] (SPARK-19753) Remove all shuffle files on a host in case of slave lost of fetch failure

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886500#comment-15886500 ] Apache Spark commented on SPARK-19753: -- User 'sitalkedia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19753) Remove all shuffle files on a host in case of slave lost of fetch failure

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19753: Assignee: (was: Apache Spark) > Remove all shuffle files on a host in case of slave

[jira] [Created] (SPARK-19754) Casting to int from a JSON-parsed float rounds instead of truncating

2017-02-27 Thread Juan Pumarino (JIRA)
Juan Pumarino created SPARK-19754: - Summary: Casting to int from a JSON-parsed float rounds instead of truncating Key: SPARK-19754 URL: https://issues.apache.org/jira/browse/SPARK-19754 Project:

[jira] [Created] (SPARK-19753) Remove all shuffle files on a host in case of slave lost of fetch failure

2017-02-27 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-19753: --- Summary: Remove all shuffle files on a host in case of slave lost of fetch failure Key: SPARK-19753 URL: https://issues.apache.org/jira/browse/SPARK-19753 Project:

[jira] [Comment Edited] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception

2017-02-27 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886397#comment-15886397 ] Josh Rosen edited comment on SPARK-12945 at 2/27/17 7:46 PM: - I'm still

[jira] [Updated] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception

2017-02-27 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12945: --- Affects Version/s: 2.0.2 > ERROR LiveListenerBus: Listener JobProgressListener threw an exception >

[jira] [Commented] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception

2017-02-27 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886397#comment-15886397 ] Josh Rosen commented on SPARK-12945: I'm still seeing this error intermittently on Spark 2.0.3

[jira] [Assigned] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19372: Assignee: (was: Apache Spark) > Code generation for Filter predicate including many

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886383#comment-15886383 ] Apache Spark commented on SPARK-19372: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19372: Assignee: Apache Spark > Code generation for Filter predicate including many OR

[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886278#comment-15886278 ] Apache Spark commented on SPARK-18693: -- User 'imatiach-msft' has created a pull request for this

[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886268#comment-15886268 ] Apache Spark commented on SPARK-18693: -- User 'imatiach-msft' has created a pull request for this

[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886258#comment-15886258 ] Apache Spark commented on SPARK-18693: -- User 'imatiach-msft' has created a pull request for this

[jira] [Comment Edited] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886097#comment-15886097 ] Sean Owen edited comment on SPARK-19751 at 2/27/17 6:03 PM: Yes, Instances of

[jira] [Commented] (SPARK-19392) Throw an exception "NoSuchElementException: key not found: scale" in OracleDialect

2017-02-27 Thread Reza Safi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886108#comment-15886108 ] Reza Safi commented on SPARK-19392: --- As [~srowen] mentioned the problem occurred in a 1.6.x maintenance

[jira] [Commented] (SPARK-19714) Bucketizer Bug Regarding Handling Unbucketed Inputs

2017-02-27 Thread Bill Chambers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886109#comment-15886109 ] Bill Chambers commented on SPARK-19714: --- Agree with your first and second paragraphs. Regarding

[jira] [Commented] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Avinash Venkateshaiah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886101#comment-15886101 ] Avinash Venkateshaiah commented on SPARK-19751: --- Does it mean that we cant have beans those

[jira] [Commented] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Avinash Venkateshaiah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886097#comment-15886097 ] Avinash Venkateshaiah commented on SPARK-19751: --- Yes, Instances of the same class.

[jira] [Updated] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19751: -- Priority: Minor (was: Major) > Create Data frame API fails with a self referencing bean >

[jira] [Commented] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886067#comment-15886067 ] Sean Owen commented on SPARK-19751: --- Can you show the stack trace, at least the part before it just

[jira] [Comment Edited] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886067#comment-15886067 ] Sean Owen edited comment on SPARK-19751 at 2/27/17 4:26 PM: Can you show the

[jira] [Created] (SPARK-19752) OrcGetSplits fails with 0 size files

2017-02-27 Thread Nick Orka (JIRA)
Nick Orka created SPARK-19752: - Summary: OrcGetSplits fails with 0 size files Key: SPARK-19752 URL: https://issues.apache.org/jira/browse/SPARK-19752 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7481) Add spark-hadoop-cloud module to pull in object store support

2017-02-27 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-7481: -- Summary: Add spark-hadoop-cloud module to pull in object store support (was: Add spark-cloud

[jira] [Updated] (SPARK-7481) Add spark-cloud module to pull in object store support

2017-02-27 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-7481: -- Affects Version/s: (was: 1.3.1) 2.1.0 > Add spark-cloud module to

[jira] [Updated] (SPARK-7481) Add spark-cloud module to pull in object store support

2017-02-27 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-7481: -- Description: To keep the s3n classpath right, to add s3a, swift & azure, the dependencies of

[jira] [Updated] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Avinash Venkateshaiah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Avinash Venkateshaiah updated SPARK-19751: -- Description: createDataset API throws a stack overflow exception when we try

[jira] [Created] (SPARK-19751) Create Data frame API fails with a self referencing bean

2017-02-27 Thread Avinash Venkateshaiah (JIRA)
Avinash Venkateshaiah created SPARK-19751: - Summary: Create Data frame API fails with a self referencing bean Key: SPARK-19751 URL: https://issues.apache.org/jira/browse/SPARK-19751 Project:

[jira] [Commented] (SPARK-19738) Consider adding error handler to DataStreamWriter

2017-02-27 Thread Jayesh lalwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885918#comment-15885918 ] Jayesh lalwani commented on SPARK-19738: [~uncleGen] Spark 2.1.0. What do you get when you run

[jira] [Updated] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-02-27 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-19659: - Attachment: SPARK-19659-design-v1.pdf > Fetch big blocks to disk when shuffle-read >

[jira] [Updated] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-02-27 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-19659: - Attachment: (was: SPARK-19659-design-v1.pdf) > Fetch big blocks to disk when shuffle-read >

[jira] [Commented] (SPARK-19478) JDBC Sink

2017-02-27 Thread Jayesh lalwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885803#comment-15885803 ] Jayesh lalwani commented on SPARK-19478: I would like to take this on. One of our projects needs

[jira] [Commented] (SPARK-19729) Strange behaviour with reading csv with schema into dataframe

2017-02-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885758#comment-15885758 ] Hyukjin Kwon commented on SPARK-19729: -- I am sorry that I am a bit confused. {code} scala>

[jira] [Commented] (SPARK-19713) saveAsTable

2017-02-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885717#comment-15885717 ] Hyukjin Kwon commented on SPARK-19713: -- Could you update the JIRA title to be more meaningful and

[jira] [Commented] (SPARK-19700) Design an API for pluggable scheduler implementations

2017-02-27 Thread Hamel Ajay Kothari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885715#comment-15885715 ] Hamel Ajay Kothari commented on SPARK-19700: Throwing this here for consistency. It looks

[jira] [Commented] (SPARK-11968) ALS recommend all methods spend most of time in GC

2017-02-27 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885636#comment-15885636 ] Nick Pentreath commented on SPARK-11968: While working on performance testing for ALS parity I've

[jira] [Commented] (SPARK-18726) Filesystem unnecessarily scanned twice during creation of non-catalog table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885635#comment-15885635 ] Apache Spark commented on SPARK-18726: -- User 'windpiger' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18726) Filesystem unnecessarily scanned twice during creation of non-catalog table

2017-02-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18726: Assignee: (was: Apache Spark) > Filesystem unnecessarily scanned twice during

  1   2   >