[jira] [Assigned] (SPARK-7112) Add a tracker to track the direct streams

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7112: --- Assignee: Apache Spark Add a tracker to track the direct streams

[jira] [Commented] (SPARK-7112) Add a tracker to track the direct streams

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510540#comment-14510540 ] Apache Spark commented on SPARK-7112: - User 'jerryshao' has created a pull request for

[jira] [Assigned] (SPARK-7112) Add a tracker to track the direct streams

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7112: --- Assignee: (was: Apache Spark) Add a tracker to track the direct streams

[jira] [Resolved] (SPARK-6891) ExecutorAllocationManager will request negative number executors

2015-04-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza resolved SPARK-6891. --- Resolution: Duplicate ExecutorAllocationManager will request negative number executors

[jira] [Commented] (SPARK-6891) ExecutorAllocationManager will request negative number executors

2015-04-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510550#comment-14510550 ] Sandy Ryza commented on SPARK-6891: --- This looks like a duplicate of SPARK-6954. While

[jira] [Created] (SPARK-7118) Add coalesce Spark SQL function to PySpark API

2015-04-24 Thread Olivier Girardot (JIRA)
Olivier Girardot created SPARK-7118: --- Summary: Add coalesce Spark SQL function to PySpark API Key: SPARK-7118 URL: https://issues.apache.org/jira/browse/SPARK-7118 Project: Spark Issue

[jira] [Created] (SPARK-7119) ScriptTransform doesn't consider the output data type

2015-04-24 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-7119: Summary: ScriptTransform doesn't consider the output data type Key: SPARK-7119 URL: https://issues.apache.org/jira/browse/SPARK-7119 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4705) Driver retries in cluster mode always fail if event logging is enabled

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510813#comment-14510813 ] Sean Owen commented on SPARK-4705: -- Looks like it's still active and in progress:

[jira] [Updated] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Platon Potapov updated SPARK-7122: -- Description: attached is the complete source code of a test spark job. no external data

[jira] [Commented] (SPARK-7053) KafkaUtils.createStream leaks resources

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510932#comment-14510932 ] Sean Owen commented on SPARK-7053: -- I'm out of ideas myself, but you can often find the

[jira] [Commented] (SPARK-7103) SparkContext.union crashed when some RDDs have no partitioner

2015-04-24 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510621#comment-14510621 ] Vinod KC commented on SPARK-7103: - I closed PR #5678.. thanks SparkContext.union crashed

[jira] [Commented] (SPARK-7098) Inconsistent Timestamp behavior when used in WHERE clause

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510626#comment-14510626 ] Apache Spark commented on SPARK-7098: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7098) Inconsistent Timestamp behavior when used in WHERE clause

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7098: --- Assignee: Apache Spark Inconsistent Timestamp behavior when used in WHERE clause

[jira] [Assigned] (SPARK-7098) Inconsistent Timestamp behavior when used in WHERE clause

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7098: --- Assignee: (was: Apache Spark) Inconsistent Timestamp behavior when used in WHERE clause

[jira] [Comment Edited] (SPARK-7053) KafkaUtils.createStream leaks resources

2015-04-24 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510751#comment-14510751 ] Platon Potapov edited comment on SPARK-7053 at 4/24/15 10:02 AM:

[jira] [Created] (SPARK-7121) ClosureCleaner does not handle nesting properly

2015-04-24 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7121: Summary: ClosureCleaner does not handle nesting properly Key: SPARK-7121 URL: https://issues.apache.org/jira/browse/SPARK-7121 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7121) ClosureCleaner does not handle nesting properly

2015-04-24 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7121: - Component/s: Spark Core ClosureCleaner does not handle nesting properly

[jira] [Commented] (SPARK-7049) File does not exist in checkpoint directory

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510912#comment-14510912 ] Sean Owen commented on SPARK-7049: -- Try increasing your TTL to at least twice your window

[jira] [Created] (SPARK-7120) ClosureCleaner lacks documentation

2015-04-24 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7120: Summary: ClosureCleaner lacks documentation Key: SPARK-7120 URL: https://issues.apache.org/jira/browse/SPARK-7120 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7120) ClosureCleaner lacks documentation

2015-04-24 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7120: - Target Version/s: 1.4.0 ClosureCleaner lacks documentation --

[jira] [Commented] (SPARK-5895) Add VectorSlicer

2015-04-24 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510642#comment-14510642 ] Xusen Yin commented on SPARK-5895: -- [~mengxr] [~josephkb] I want to take it. Thanks!

[jira] [Commented] (SPARK-7115) Do not output 1 in PolynomialExpansion

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510620#comment-14510620 ] Apache Spark commented on SPARK-7115: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-7120) ClosureCleaner lacks documentation

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510772#comment-14510772 ] Apache Spark commented on SPARK-7120: - User 'andrewor14' has created a pull request

[jira] [Assigned] (SPARK-7120) ClosureCleaner lacks documentation

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7120: --- Assignee: Andrew Or (was: Apache Spark) ClosureCleaner lacks documentation

[jira] [Commented] (SPARK-6681) JAVA_HOME error with upgrade to Spark 1.3.0

2015-04-24 Thread Sourabh Chaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510774#comment-14510774 ] Sourabh Chaki commented on SPARK-6681: -- I am also facing the same issue. I have

[jira] [Assigned] (SPARK-7121) ClosureCleaner does not handle nesting properly

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7121: --- Assignee: Apache Spark (was: Andrew Or) ClosureCleaner does not handle nesting properly

[jira] [Commented] (SPARK-7121) ClosureCleaner does not handle nesting properly

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510773#comment-14510773 ] Apache Spark commented on SPARK-7121: - User 'andrewor14' has created a pull request

[jira] [Assigned] (SPARK-7121) ClosureCleaner does not handle nesting properly

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7121: --- Assignee: Andrew Or (was: Apache Spark) ClosureCleaner does not handle nesting properly

[jira] [Assigned] (SPARK-7120) ClosureCleaner lacks documentation

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7120: --- Assignee: Apache Spark (was: Andrew Or) ClosureCleaner lacks documentation

[jira] [Updated] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Platon Potapov updated SPARK-7122: -- Attachment: SparkStreamingJob.scala KafkaUtils.createDirectStream - unreasonable processing

[jira] [Commented] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510900#comment-14510900 ] Sean Owen commented on SPARK-7110: -- I think you may need to apply the same fix to

[jira] [Commented] (SPARK-4705) Driver retries in cluster mode always fail if event logging is enabled

2015-04-24 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510630#comment-14510630 ] Emre Sevinç commented on SPARK-4705: Hello, Any plans on resolving this issue?

[jira] [Commented] (SPARK-6900) spark ec2 script enters infinite loop when run-instance fails

2015-04-24 Thread Guodong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510724#comment-14510724 ] Guodong Wang commented on SPARK-6900: - Since spark-ec2 is not designed to

[jira] [Updated] (SPARK-7053) KafkaUtils.createStream leaks resources

2015-04-24 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Platon Potapov updated SPARK-7053: -- Attachment: round2.scala round2.5000msg.simple.jpg

[jira] [Created] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Platon Potapov (JIRA)
Platon Potapov created SPARK-7122: - Summary: KafkaUtils.createDirectStream - unreasonable processing time in absence of load Key: SPARK-7122 URL: https://issues.apache.org/jira/browse/SPARK-7122

[jira] [Updated] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7122: - Priority: Minor (was: Major) As a matter of process -- let's ask questions on the list first rather than

[jira] [Comment Edited] (SPARK-5895) Add VectorSlicer

2015-04-24 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386365#comment-14386365 ] Xusen Yin edited comment on SPARK-5895 at 4/24/15 8:43 AM: --- I

[jira] [Assigned] (SPARK-7118) Add coalesce Spark SQL function to PySpark API

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7118: --- Assignee: Apache Spark Add coalesce Spark SQL function to PySpark API

[jira] [Commented] (SPARK-7118) Add coalesce Spark SQL function to PySpark API

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510674#comment-14510674 ] Apache Spark commented on SPARK-7118: - User 'ogirardot' has created a pull request for

[jira] [Assigned] (SPARK-7118) Add coalesce Spark SQL function to PySpark API

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7118: --- Assignee: (was: Apache Spark) Add coalesce Spark SQL function to PySpark API

[jira] [Commented] (SPARK-4705) Driver retries in cluster mode always fail if event logging is enabled

2015-04-24 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510823#comment-14510823 ] Emre Sevinç commented on SPARK-4705: Great! From the conversation on Github, it seems

[jira] [Comment Edited] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510900#comment-14510900 ] Sean Owen edited comment on SPARK-7110 at 4/24/15 11:46 AM: I

[jira] [Resolved] (SPARK-7117) SparkSQL and Spark sometimes throw exceptions when reading Parquet files.

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7117. -- Resolution: Duplicate Please set component, and use PRs not patches to submit changes. However, also

[jira] [Commented] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-04-24 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511007#comment-14511007 ] Thomas Graves commented on SPARK-7110: -- So with the NewApi's, the call to: val

[jira] [Commented] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511092#comment-14511092 ] Cody Koeninger commented on SPARK-7122: --- Does this actually have anything to do with

[jira] [Updated] (SPARK-7102) update apache hosted graphx-programming-guide doc

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7102: - Priority: Minor (was: Trivial) update apache hosted graphx-programming-guide doc

[jira] [Commented] (SPARK-7123) support table.star in sqlcontext

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511099#comment-14511099 ] Apache Spark commented on SPARK-7123: - User 'scwf' has created a pull request for this

[jira] [Assigned] (SPARK-7123) support table.star in sqlcontext

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7123: --- Assignee: (was: Apache Spark) support table.star in sqlcontext

[jira] [Commented] (SPARK-7119) ScriptTransform doesn't consider the output data type

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510940#comment-14510940 ] Apache Spark commented on SPARK-7119: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7119) ScriptTransform doesn't consider the output data type

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7119: --- Assignee: (was: Apache Spark) ScriptTransform doesn't consider the output data type

[jira] [Assigned] (SPARK-7119) ScriptTransform doesn't consider the output data type

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7119: --- Assignee: Apache Spark ScriptTransform doesn't consider the output data type

[jira] [Resolved] (SPARK-7102) update apache hosted graphx-programming-guide doc

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7102. -- Resolution: Fixed Fix Version/s: 1.4.0 update apache hosted graphx-programming-guide doc

[jira] [Reopened] (SPARK-7102) update apache hosted graphx-programming-guide doc

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-7102: -- Assignee: Deborah Siegel Reopening by request to retroactively attach this to

[jira] [Created] (SPARK-7123) support table.star in sqlcontext

2015-04-24 Thread Fei Wang (JIRA)
Fei Wang created SPARK-7123: --- Summary: support table.star in sqlcontext Key: SPARK-7123 URL: https://issues.apache.org/jira/browse/SPARK-7123 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-3727) Trees and ensembles: More prediction functionality

2015-04-24 Thread Oscar Olmedo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511133#comment-14511133 ] Oscar Olmedo commented on SPARK-3727: - Hello, Here is a [link to my fork |

[jira] [Commented] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511134#comment-14511134 ] Platon Potapov commented on SPARK-7122: --- yes, this is specific to direct stream. in

[jira] [Updated] (SPARK-7122) KafkaUtils.createDirectStream - unreasonable processing time in absence of load

2015-04-24 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Platon Potapov updated SPARK-7122: -- Attachment: 10.second.window.fast.job.txt 5.second.window.slow.job.txt an

[jira] [Commented] (SPARK-7049) File does not exist in checkpoint directory

2015-04-24 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510993#comment-14510993 ] yangping wu commented on SPARK-7049: Ok, I will try to increase the TTL, Thank you.

[jira] [Assigned] (SPARK-7123) support table.star in sqlcontext

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7123: --- Assignee: Apache Spark support table.star in sqlcontext

[jira] [Created] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed

2015-04-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7126: Summary: For spark.ml Classifiers, automatically index labels if they are not yet indexed Key: SPARK-7126 URL: https://issues.apache.org/jira/browse/SPARK-7126

[jira] [Commented] (SPARK-5995) Make ML Prediction Developer APIs public

2015-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511468#comment-14511468 ] Joseph K. Bradley commented on SPARK-5995: -- Update on plan: We'll make these APIs

[jira] [Created] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-04-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7127: Summary: Broadcast spark.ml tree ensemble models for predict Key: SPARK-7127 URL: https://issues.apache.org/jira/browse/SPARK-7127 Project: Spark

[jira] [Updated] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7127: - Labels: starter (was: ) Broadcast spark.ml tree ensemble models for predict

[jira] [Created] (SPARK-7128) Add generic bagging algorithm to spark.ml

2015-04-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7128: Summary: Add generic bagging algorithm to spark.ml Key: SPARK-7128 URL: https://issues.apache.org/jira/browse/SPARK-7128 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2516) Bootstrapping

2015-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511493#comment-14511493 ] Joseph K. Bradley commented on SPARK-2516: -- [~mengxr] Just to confirm, am I

[jira] [Comment Edited] (SPARK-2516) Bootstrapping

2015-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511493#comment-14511493 ] Joseph K. Bradley edited comment on SPARK-2516 at 4/24/15 6:37 PM:

[jira] [Commented] (SPARK-6900) spark ec2 script enters infinite loop when run-instance fails

2015-04-24 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511283#comment-14511283 ] Nicholas Chammas commented on SPARK-6900: - That is correct. So again the solution

[jira] [Commented] (SPARK-7124) Add functions to check for file and directory existence

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511387#comment-14511387 ] Sean Owen commented on SPARK-7124: -- There is already an HDFS API for this, which also

[jira] [Updated] (SPARK-7125) textFile().first() on empty files raises ENOENT

2015-04-24 Thread Sam Steingold (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Steingold updated SPARK-7125: - Description: The two calls: # {{sc.textFile(existing-empty-file).first()}} and #

[jira] [Resolved] (SPARK-7125) textFile().first() on empty files raises ENOENT

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7125. -- Resolution: Not A Problem For a non-existent input, some flavor of file not found exception is correct,

[jira] [Resolved] (SPARK-7033) Use JavaRDD.partitions() instead of JavaRDD.splits()

2015-04-24 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-7033. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request

[jira] [Created] (SPARK-7139) Allow received block metadata to be saved to WAL and recovered on driver failure

2015-04-24 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7139: Summary: Allow received block metadata to be saved to WAL and recovered on driver failure Key: SPARK-7139 URL: https://issues.apache.org/jira/browse/SPARK-7139

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Description: An implementation of Factorization Machines based on Scala and Spark MLlib. FM is a

[jira] [Comment Edited] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512110#comment-14512110 ] zhengruifeng edited comment on SPARK-7008 at 4/25/15 12:46 AM:

[jira] [Resolved] (SPARK-1457) Change APIs for training algorithms to take optimizer as parameter

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1457. -- Resolution: Won't Fix I think this is not a bad idea but think this might have timed out, and will be

[jira] [Closed] (SPARK-7134) Add regParam and featureScaling options to Logistic regression 'train' methods

2015-04-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-7134. Resolution: Won't Fix I'm closing in favor of SPARK-6682. Add regParam and featureScaling options

[jira] [Issue Comment Deleted] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-24 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-6980: Comment: was deleted (was: Modified ActorWordCount example to produce akka timeout) Akka timeout

[jira] [Updated] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-24 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-6980: Attachment: Spark-6980-Test.scala Modified ActorWordCount example to produce akka timeout Akka

[jira] [Comment Edited] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512110#comment-14512110 ] zhengruifeng edited comment on SPARK-7008 at 4/25/15 12:44 AM:

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512110#comment-14512110 ] zhengruifeng commented on SPARK-7008: - The convergence curves of Binary Classification

[jira] [Comment Edited] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-24 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512148#comment-14512148 ] Imran Rashid edited comment on SPARK-6980 at 4/25/15 1:20 AM: --

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-24 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512148#comment-14512148 ] Imran Rashid commented on SPARK-6980: - Hi [~bryanc] [~harshg], sorry I didn't notice

[jira] [Updated] (SPARK-6599) Improve reliability and usability of Kinesis-based Spark Streaming

2015-04-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6599: - Description: Currently, the KinesisReceiver can loose some data in the case of certain failures

[jira] [Updated] (SPARK-6599) Improve reliability and usability of Kinesis-based Spark Streaming

2015-04-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-6599: - Description: Currently, the KinesisReceiver can loose some data in the case of certain failures

[jira] [Updated] (SPARK-7138) Add method to BlockGenerator to add multiple records to BlockGenerator with single callback

2015-04-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7138: - Component/s: Streaming Add method to BlockGenerator to add multiple records to BlockGenerator

[jira] [Updated] (SPARK-5895) Add VectorSlicer

2015-04-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5895: - Assignee: Xusen Yin Add VectorSlicer Key: SPARK-5895

[jira] [Commented] (SPARK-7138) Add method to BlockGenerator to add multiple records to BlockGenerator with single callback

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511902#comment-14511902 ] Apache Spark commented on SPARK-7138: - User 'tdas' has created a pull request for this

[jira] [Assigned] (SPARK-3090) Avoid not stopping SparkContext with YARN Client mode

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3090: --- Assignee: (was: Apache Spark) Avoid not stopping SparkContext with YARN Client mode

[jira] [Assigned] (SPARK-3090) Avoid not stopping SparkContext with YARN Client mode

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3090: --- Assignee: Apache Spark Avoid not stopping SparkContext with YARN Client mode

[jira] [Commented] (SPARK-3090) Avoid not stopping SparkContext with YARN Client mode

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512048#comment-14512048 ] Apache Spark commented on SPARK-3090: - User 'vanzin' has created a pull request for

[jira] [Assigned] (SPARK-6214) Allow configuration options to use a simple expression language

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6214: --- Assignee: (was: Apache Spark) Allow configuration options to use a simple expression

[jira] [Assigned] (SPARK-6214) Allow configuration options to use a simple expression language

2015-04-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6214: --- Assignee: Apache Spark Allow configuration options to use a simple expression language

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-24 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512069#comment-14512069 ] Bryan Cutler commented on SPARK-6980: - I'm working out of trunk. Changing the

[jira] [Resolved] (SPARK-6122) Upgrade Tachyon dependency to 0.6.0

2015-04-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6122. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5354

[jira] [Updated] (SPARK-7138) Add method to BlockGenerator to add multiple records to BlockGenerator with single callback

2015-04-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7138: - Issue Type: Improvement (was: Bug) Add method to BlockGenerator to add multiple records to

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Attachment: FM_CR.xlsx An implementation of Factorization Machine (LibFM)

[jira] [Resolved] (SPARK-7115) Do not output 1 in PolynomialExpansion

2015-04-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7115. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5681

[jira] [Issue Comment Deleted] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-04-24 Thread Beniamino (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Beniamino updated SPARK-2344: - Comment: was deleted (was: Hi Alex, Sorry for the late response but I'm very busy lately. I think that

[jira] [Issue Comment Deleted] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-04-24 Thread Beniamino (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Beniamino updated SPARK-2344: - Comment: was deleted (was: Hi Alex, don't worry for the late response. Break a leg (for the test) I've

[jira] [Issue Comment Deleted] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-04-24 Thread Beniamino (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Beniamino updated SPARK-2344: - Comment: was deleted (was: Hi, yes the computation of the next centers are made on the fly avoiding to

  1   2   >