[jira] [Assigned] (SPARK-11991) spark_ec2.py does not perform sanity checks on hostnames

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11991: Assignee: Apache Spark > spark_ec2.py does not perform sanity checks on hostnames >

[jira] [Assigned] (SPARK-11991) spark_ec2.py does not perform sanity checks on hostnames

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11991: Assignee: (was: Apache Spark) > spark_ec2.py does not perform sanity checks on

[jira] [Commented] (SPARK-11991) spark_ec2.py does not perform sanity checks on hostnames

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027157#comment-15027157 ] Apache Spark commented on SPARK-11991: -- User 'jcderr' has created a pull request for this issue:

[jira] [Resolved] (SPARK-10666) Use properties from ActiveJob associated with a Stage

2015-11-25 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-10666. -- Resolution: Fixed Fix Version/s: 1.6.0 1.5.3 > Use properties from

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2015-11-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027235#comment-15027235 ] Joseph K. Bradley commented on SPARK-10574: --- No problem; it's been busy here too. My last

[jira] [Commented] (SPARK-11202) Unsupported dataType

2015-11-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027237#comment-15027237 ] Yin Huai commented on SPARK-11202: -- Seems the problem is caused by the NUMBER type defined in ORACLE.

[jira] [Commented] (SPARK-11980) Add unit tests for the Python functions added in SPARK-10621

2015-11-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027043#comment-15027043 ] Xiao Li commented on SPARK-11980: - Will submit a PR today. Thanks! > Add unit tests for the Python

[jira] [Created] (SPARK-11991) spark_ec2.py does not perform sanity checks on hostnames

2015-11-25 Thread Jeremy Derr (JIRA)
Jeremy Derr created SPARK-11991: --- Summary: spark_ec2.py does not perform sanity checks on hostnames Key: SPARK-11991 URL: https://issues.apache.org/jira/browse/SPARK-11991 Project: Spark Issue

[jira] [Commented] (SPARK-11605) ML 1.6 QA: API: Java compatibility, docs

2015-11-25 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027184#comment-15027184 ] yuhao yang commented on SPARK-11605: I plan to finish it this week. I think Xiangrui will make

[jira] [Resolved] (SPARK-11206) Support SQL UI on the history server

2015-11-25 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-11206. Resolution: Fixed Assignee: Carson Wang Fix Version/s: 1.7.0 > Support SQL

[jira] [Updated] (SPARK-12002) offsetRanges attribute missing in Kafka RDD when resuming from checkpoint

2015-11-25 Thread Amit Ramesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Ramesh updated SPARK-12002: Description: SPARK-8389 added offsetRanges to Kafka direct streams. And SPARK-10122 fixed the

[jira] [Created] (SPARK-12004) RDD checkpointing does not preserve partitioner

2015-11-25 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-12004: - Summary: RDD checkpointing does not preserve partitioner Key: SPARK-12004 URL: https://issues.apache.org/jira/browse/SPARK-12004 Project: Spark Issue

[jira] [Created] (SPARK-12000) `sbt publishLocal` hits a Scala compiler bug caused by `Since` annotation

2015-11-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-12000: - Summary: `sbt publishLocal` hits a Scala compiler bug caused by `Since` annotation Key: SPARK-12000 URL: https://issues.apache.org/jira/browse/SPARK-12000 Project:

[jira] [Commented] (SPARK-11950) Exception throws when executing “exit;” in spark-sql

2015-11-25 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027773#comment-15027773 ] Rekha Joshi commented on SPARK-11950: - Hi [~meiyoula] This seems duplicate of SPARK-11624. [~srowen]

[jira] [Updated] (SPARK-11997) NPE when save a DataFrame as parquet and partitioned by long column

2015-11-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-11997: - Assignee: (was: Cheng Lian) Target Version/s: 1.6.0 > NPE when save a DataFrame as

[jira] [Created] (SPARK-11993) https://github.com/streamatica/TrafficAnalytics/issues/393#issuecomment-159685855

2015-11-25 Thread Antonio Murgia (JIRA)
Antonio Murgia created SPARK-11993: -- Summary: https://github.com/streamatica/TrafficAnalytics/issues/393#issuecomment-159685855 Key: SPARK-11993 URL: https://issues.apache.org/jira/browse/SPARK-11993

[jira] [Commented] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-11-25 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027375#comment-15027375 ] Timothy Hunter commented on SPARK-8517: --- Here is a few comments I have at a high level: - branding

[jira] [Assigned] (SPARK-11999) ThreadUtils.newDaemonCachedThreadPool(prefix, maxThreadNumber) has unexpected behavior

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11999: Assignee: Apache Spark > ThreadUtils.newDaemonCachedThreadPool(prefix, maxThreadNumber)

[jira] [Commented] (SPARK-11999) ThreadUtils.newDaemonCachedThreadPool(prefix, maxThreadNumber) has unexpected behavior

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027716#comment-15027716 ] Apache Spark commented on SPARK-11999: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11619) cannot use UDTF in DataFrame.selectExpr

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11619: Assignee: (was: Apache Spark) > cannot use UDTF in DataFrame.selectExpr >

[jira] [Commented] (SPARK-11619) cannot use UDTF in DataFrame.selectExpr

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027738#comment-15027738 ] Apache Spark commented on SPARK-11619: -- User 'dilipbiswal' has created a pull request for this

[jira] [Created] (SPARK-12001) StreamingContext cannot be completely stopped if the stop() call is interrupted

2015-11-25 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-12001: -- Summary: StreamingContext cannot be completely stopped if the stop() call is interrupted Key: SPARK-12001 URL: https://issues.apache.org/jira/browse/SPARK-12001 Project:

[jira] [Assigned] (SPARK-11999) ThreadUtils.newDaemonCachedThreadPool(prefix, maxThreadNumber) has unexpected behavior

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11999: Assignee: (was: Apache Spark) > ThreadUtils.newDaemonCachedThreadPool(prefix,

[jira] [Commented] (SPARK-11998) Branch 1.6's hadoop 2.2 tests always fail the entire VersionsSuite

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027718#comment-15027718 ] Apache Spark commented on SPARK-11998: -- User 'yhuai' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11998) Branch 1.6's hadoop 2.2 tests always fail the entire VersionsSuite

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11998: Assignee: (was: Apache Spark) > Branch 1.6's hadoop 2.2 tests always fail the entire

[jira] [Assigned] (SPARK-11998) Branch 1.6's hadoop 2.2 tests always fail the entire VersionsSuite

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11998: Assignee: Apache Spark > Branch 1.6's hadoop 2.2 tests always fail the entire

[jira] [Commented] (SPARK-11968) ALS recommend all methods spend most of time in GC

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027734#comment-15027734 ] Apache Spark commented on SPARK-11968: -- User 'rekhajoshm' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11968) ALS recommend all methods spend most of time in GC

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11968: Assignee: Apache Spark > ALS recommend all methods spend most of time in GC >

[jira] [Assigned] (SPARK-11968) ALS recommend all methods spend most of time in GC

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11968: Assignee: (was: Apache Spark) > ALS recommend all methods spend most of time in GC >

[jira] [Created] (SPARK-12003) Expanded star should use field name as column name

2015-11-25 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12003: -- Summary: Expanded star should use field name as column name Key: SPARK-12003 URL: https://issues.apache.org/jira/browse/SPARK-12003 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-11619) cannot use UDTF in DataFrame.selectExpr

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11619: Assignee: Apache Spark > cannot use UDTF in DataFrame.selectExpr >

[jira] [Updated] (SPARK-12000) `sbt publishLocal` hits a Scala compiler bug caused by `Since` annotation

2015-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12000: -- Description: Reported by [~josephkb]. Not sure what is the root cause, but this is the error

[jira] [Created] (SPARK-12002) offsetRanges attribute missing in Kafka RDD when resuming from checkpoint

2015-11-25 Thread Amit Ramesh (JIRA)
Amit Ramesh created SPARK-12002: --- Summary: offsetRanges attribute missing in Kafka RDD when resuming from checkpoint Key: SPARK-12002 URL: https://issues.apache.org/jira/browse/SPARK-12002 Project:

[jira] [Assigned] (SPARK-12001) StreamingContext cannot be completely stopped if the stop() call is interrupted

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12001: Assignee: Apache Spark (was: Josh Rosen) > StreamingContext cannot be completely stopped

[jira] [Assigned] (SPARK-12001) StreamingContext cannot be completely stopped if the stop() call is interrupted

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12001: Assignee: Josh Rosen (was: Apache Spark) > StreamingContext cannot be completely stopped

[jira] [Commented] (SPARK-12001) StreamingContext cannot be completely stopped if the stop() call is interrupted

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027871#comment-15027871 ] Apache Spark commented on SPARK-12001: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027937#comment-15027937 ] Tathagata Das commented on SPARK-11932: --- The reason why trackStateByKey throws this exception is

[jira] [Assigned] (SPARK-12004) RDD checkpointing does not preserve partitioner

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12004: Assignee: Apache Spark (was: Tathagata Das) > RDD checkpointing does not preserve

[jira] [Commented] (SPARK-12004) RDD checkpointing does not preserve partitioner

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027948#comment-15027948 ] Apache Spark commented on SPARK-12004: -- User 'tdas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12005) VerifyError in HyperLogLogPlusPlus with newer JDKs

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12005: Assignee: Apache Spark > VerifyError in HyperLogLogPlusPlus with newer JDKs >

[jira] [Commented] (SPARK-12005) VerifyError in HyperLogLogPlusPlus with newer JDKs

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027990#comment-15027990 ] Apache Spark commented on SPARK-12005: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12005) VerifyError in HyperLogLogPlusPlus with newer JDKs

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12005: Assignee: (was: Apache Spark) > VerifyError in HyperLogLogPlusPlus with newer JDKs >

[jira] [Updated] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-11932: -- Description: The problem is that Code {code} StreamingContext.getOrCreate(".", () =>

[jira] [Created] (SPARK-12007) Network library's RPC layer requires a lot of copying

2015-11-25 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-12007: -- Summary: Network library's RPC layer requires a lot of copying Key: SPARK-12007 URL: https://issues.apache.org/jira/browse/SPARK-12007 Project: Spark

[jira] [Commented] (SPARK-11606) ML 1.6 QA: Update user guide for new APIs

2015-11-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028058#comment-15028058 ] Joseph K. Bradley commented on SPARK-11606: --- I think the higher priority is to create a single

[jira] [Comment Edited] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027937#comment-15027937 ] Tathagata Das edited comment on SPARK-11932 at 11/26/15 1:07 AM: - The

[jira] [Updated] (SPARK-12006) GaussianMixture.train crashes if an itnital model is not None

2015-11-25 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-12006: --- Description: Steps to reproduce : {code} from pyspark.mllib.clustering import

[jira] [Comment Edited] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027937#comment-15027937 ] Tathagata Das edited comment on SPARK-11932 at 11/26/15 2:06 AM: - The

[jira] [Comment Edited] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027937#comment-15027937 ] Tathagata Das edited comment on SPARK-11932 at 11/26/15 2:04 AM: - The

[jira] [Created] (SPARK-12006) GaussianMixture.train crashes if an itnital model is not None

2015-11-25 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-12006: -- Summary: GaussianMixture.train crashes if an itnital model is not None Key: SPARK-12006 URL: https://issues.apache.org/jira/browse/SPARK-12006 Project:

[jira] [Assigned] (SPARK-12006) GaussianMixture.train crashes if an itnital model is not None

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12006: Assignee: Apache Spark > GaussianMixture.train crashes if an itnital model is not None >

[jira] [Commented] (SPARK-12006) GaussianMixture.train crashes if an itnital model is not None

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028002#comment-15028002 ] Apache Spark commented on SPARK-12006: -- User 'zero323' has created a pull request for this issue:

[jira] [Commented] (SPARK-12007) Network library's RPC layer requires a lot of copying

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028012#comment-15028012 ] Apache Spark commented on SPARK-12007: -- User 'vanzin' has created a pull request for this issue:

[jira] [Updated] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-11932: -- Description: The problem is that when recovering a streaming application using

[jira] [Assigned] (SPARK-12007) Network library's RPC layer requires a lot of copying

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12007: Assignee: (was: Apache Spark) > Network library's RPC layer requires a lot of copying

[jira] [Assigned] (SPARK-12007) Network library's RPC layer requires a lot of copying

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12007: Assignee: Apache Spark > Network library's RPC layer requires a lot of copying >

[jira] [Assigned] (SPARK-12004) RDD checkpointing does not preserve partitioner

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12004: Assignee: Tathagata Das (was: Apache Spark) > RDD checkpointing does not preserve

[jira] [Created] (SPARK-12005) VerifyError in HyperLogLogPlusPlus with newer JDKs

2015-11-25 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-12005: -- Summary: VerifyError in HyperLogLogPlusPlus with newer JDKs Key: SPARK-12005 URL: https://issues.apache.org/jira/browse/SPARK-12005 Project: Spark Issue

[jira] [Updated] (SPARK-11999) ThreadUtils.newDaemonCachedThreadPool(prefix, maxThreadNumber) has unexpected behavior

2015-11-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11999: -- Assignee: Shixiong Zhu > ThreadUtils.newDaemonCachedThreadPool(prefix, maxThreadNumber) has >

[jira] [Commented] (SPARK-11606) ML 1.6 QA: Update user guide for new APIs

2015-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028019#comment-15028019 ] Yanbo Liang commented on SPARK-11606: - [~josephkb] Since we have add support for most of feature

[jira] [Commented] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028044#comment-15028044 ] Apache Spark commented on SPARK-11932: -- User 'tmnd1991' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027937#comment-15027937 ] Tathagata Das edited comment on SPARK-11932 at 11/26/15 1:07 AM: - The

[jira] [Assigned] (SPARK-12003) Expanded star should use field name as column name

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12003: Assignee: Apache Spark > Expanded star should use field name as column name >

[jira] [Assigned] (SPARK-12003) Expanded star should use field name as column name

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12003: Assignee: (was: Apache Spark) > Expanded star should use field name as column name >

[jira] [Commented] (SPARK-12003) Expanded star should use field name as column name

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027968#comment-15027968 ] Apache Spark commented on SPARK-12003: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12006) GaussianMixture.train crashes if an itnital model is not None

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12006: Assignee: (was: Apache Spark) > GaussianMixture.train crashes if an itnital model is

[jira] [Assigned] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11932: Assignee: Apache Spark (was: Tathagata Das) > trackStateByKey throws

[jira] [Assigned] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11932: Assignee: Tathagata Das (was: Apache Spark) > trackStateByKey throws

[jira] [Commented] (SPARK-11932) trackStateByKey throws java.lang.IllegalArgumentException: requirement failed on restarting from checkpoint

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028015#comment-15028015 ] Apache Spark commented on SPARK-11932: -- User 'tdas' has created a pull request for this issue:

[jira] [Commented] (SPARK-11994) Word2VecModel load and save cause SparkException when model is bigger than spark.kryoserializer.buffer.max

2015-11-25 Thread Antonio Murgia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028016#comment-15028016 ] Antonio Murgia commented on SPARK-11994: Since `spark.kryoserializer.buffer.max` defaults to

[jira] [Updated] (SPARK-11956) Test failures potentially related to SPARK-11140

2015-11-25 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-11956: --- Fix Version/s: 1.6.0 Also merged to 1.6 since SPARK-11140 was backported (even if in a

[jira] [Updated] (SPARK-11762) TransportResponseHandler should consider open streams when counting outstanding requests

2015-11-25 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-11762: --- Fix Version/s: 1.6.0 Also merged to 1.6 since SPARK-11140 (even if in a semi-disabled state

[jira] [Updated] (SPARK-10266) Add @Since annotation to ml.tuning

2015-11-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10266: -- Shepherd: Joseph K. Bradley (was: Yu Ishikawa) Assignee: Yu Ishikawa (was: Ehsan

[jira] [Comment Edited] (SPARK-11992) Severl numbers in my spark shell (pyspark)

2015-11-25 Thread Alberto Bonsanto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027304#comment-15027304 ] Alberto Bonsanto edited comment on SPARK-11992 at 11/25/15 7:25 PM:

[jira] [Commented] (SPARK-11995) Partitioning Parquet by DateType

2015-11-25 Thread Jack Arenas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027423#comment-15027423 ] Jack Arenas commented on SPARK-11995: - Seems like the issue comes from CatalystSchemaConverter.scala

[jira] [Comment Edited] (SPARK-11995) Partitioning Parquet by DateType

2015-11-25 Thread Jack Arenas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027423#comment-15027423 ] Jack Arenas edited comment on SPARK-11995 at 11/25/15 7:25 PM: --- Seems like

[jira] [Resolved] (SPARK-10864) SparkUI: app name is hidden if window is resized

2015-11-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10864. --- Resolution: Fixed Fix Version/s: 1.6.0 Target Version/s: 1.6.0 > SparkUI: app name

[jira] [Updated] (SPARK-11935) Send the Python exceptions in TransformFunction and TransformFunctionSerializer to Java

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-11935: -- Assignee: Shixiong Zhu > Send the Python exceptions in TransformFunction and >

[jira] [Resolved] (SPARK-11935) Send the Python exceptions in TransformFunction and TransformFunctionSerializer to Java

2015-11-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-11935. --- Resolution: Fixed Fix Version/s: 1.6.0 > Send the Python exceptions in

[jira] [Created] (SPARK-11992) Severl numbers in my spark shell (pyspark)

2015-11-25 Thread Alberto Bonsanto (JIRA)
Alberto Bonsanto created SPARK-11992: Summary: Severl numbers in my spark shell (pyspark) Key: SPARK-11992 URL: https://issues.apache.org/jira/browse/SPARK-11992 Project: Spark Issue

[jira] [Commented] (SPARK-11329) Expand Star when creating a struct

2015-11-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027246#comment-15027246 ] Yin Huai commented on SPARK-11329: -- oh, sorry. Yes, you are right. Right now, when a struct type column

[jira] [Updated] (SPARK-11992) Severl numbers in my spark shell (pyspark)

2015-11-25 Thread Alberto Bonsanto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alberto Bonsanto updated SPARK-11992: - Priority: Critical (was: Blocker) Issue Type: Bug (was: Question) > Severl

[jira] [Resolved] (SPARK-10558) Wrong executor state in standalone master because of wrong state transition

2015-11-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10558. --- Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 1.6.0 Target

[jira] [Created] (SPARK-11994) Word2VecModel load and save cause SparkException when model is bigger than spark.kryoserializer.buffer.max

2015-11-25 Thread Antonio Murgia (JIRA)
Antonio Murgia created SPARK-11994: -- Summary: Word2VecModel load and save cause SparkException when model is bigger than spark.kryoserializer.buffer.max Key: SPARK-11994 URL:

[jira] [Resolved] (SPARK-11974) Not all the temp dirs had been deleted when the JVM exits

2015-11-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11974. - Resolution: Fixed Assignee: Zhongshuai Pei Fix Version/s: 1.6.0

[jira] [Resolved] (SPARK-11984) Typos in GroupedData Pivot doc in Scala and Python

2015-11-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11984. - Resolution: Fixed Assignee: Felix Cheung Fix Version/s: 1.6.0 > Typos in

[jira] [Assigned] (SPARK-11996) Executor thread dump is broken

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11996: Assignee: (was: Apache Spark) > Executor thread dump is broken >

[jira] [Commented] (SPARK-11996) Executor thread dump is broken

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027440#comment-15027440 ] Apache Spark commented on SPARK-11996: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Updated] (SPARK-11996) Executor thread dump is broken

2015-11-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-11996: - Description: The driver needs to know the executor listening address to send the thread dump

[jira] [Assigned] (SPARK-11996) Executor thread dump is broken

2015-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11996: Assignee: Apache Spark > Executor thread dump is broken > --

[jira] [Updated] (SPARK-11880) On Windows spark-env.cmd is not loaded.

2015-11-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11880: -- Assignee: tawan > On Windows spark-env.cmd is not loaded. > --- >

[jira] [Commented] (SPARK-6518) Add example code and user guide for bisecting k-means

2015-11-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027298#comment-15027298 ] Joseph K. Bradley commented on SPARK-6518: -- Sure, sounds good. However, it'd be nice if the docs

[jira] [Created] (SPARK-11995) Partitioning Parquet by DateType

2015-11-25 Thread Jack Arenas (JIRA)
Jack Arenas created SPARK-11995: --- Summary: Partitioning Parquet by DateType Key: SPARK-11995 URL: https://issues.apache.org/jira/browse/SPARK-11995 Project: Spark Issue Type: Improvement

[jira] [Comment Edited] (SPARK-11995) Partitioning Parquet by DateType

2015-11-25 Thread Jack Arenas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027423#comment-15027423 ] Jack Arenas edited comment on SPARK-11995 at 11/25/15 7:24 PM: --- Seems like

[jira] [Comment Edited] (SPARK-11995) Partitioning Parquet by DateType

2015-11-25 Thread Jack Arenas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027423#comment-15027423 ] Jack Arenas edited comment on SPARK-11995 at 11/25/15 7:24 PM: --- Seems like

[jira] [Updated] (SPARK-11961) User guide section for ChiSqSelector transformer

2015-11-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11961: -- Shepherd: Joseph K. Bradley > User guide section for ChiSqSelector transformer >

[jira] [Resolved] (SPARK-11956) Test failures potentially related to SPARK-11140

2015-11-25 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-11956. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 1.7.0 > Test

[jira] [Closed] (SPARK-11993) https://github.com/streamatica/TrafficAnalytics/issues/393#issuecomment-159685855

2015-11-25 Thread Antonio Murgia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antonio Murgia closed SPARK-11993. -- Resolution: Invalid >

[jira] [Commented] (SPARK-11992) Severl numbers in my spark shell (pyspark)

2015-11-25 Thread Alberto Bonsanto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027304#comment-15027304 ] Alberto Bonsanto commented on SPARK-11992: -- [~srowen] Hello, I appreciate your time commenting

[jira] [Updated] (SPARK-11996) Executor thread dump is broken

2015-11-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-11996: - Summary: Executor thread dump is broken (was: Executor thread dump of is broken) > Executor

  1   2   3   >