[jira] [Commented] (SPARK-7213) Exception while copying Hadoop config files due to permission issues

2015-04-28 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518188#comment-14518188 ] Nishkam Ravi commented on SPARK-7213: - Exception in thread main

[jira] [Created] (SPARK-7206) Gaussian Mixture Model (GMM) improvements

2015-04-28 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7206: Summary: Gaussian Mixture Model (GMM) improvements Key: SPARK-7206 URL: https://issues.apache.org/jira/browse/SPARK-7206 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-7207) Add new spark.ml subpackages to SparkBuild

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7207: --- Assignee: Joseph K. Bradley (was: Apache Spark) Add new spark.ml subpackages to SparkBuild

[jira] [Assigned] (SPARK-7207) Add new spark.ml subpackages to SparkBuild

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7207: --- Assignee: Apache Spark (was: Joseph K. Bradley) Add new spark.ml subpackages to SparkBuild

[jira] [Created] (SPARK-7208) Add SparseMatrix to __all__ list in linalg.py

2015-04-28 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7208: Summary: Add SparseMatrix to __all__ list in linalg.py Key: SPARK-7208 URL: https://issues.apache.org/jira/browse/SPARK-7208 Project: Spark Issue

[jira] [Commented] (SPARK-7207) Add new spark.ml subpackages to SparkBuild

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518069#comment-14518069 ] Apache Spark commented on SPARK-7207: - User 'jkbradley' has created a pull request for

[jira] [Created] (SPARK-7211) Improvements for FPGrowth

2015-04-28 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7211: Summary: Improvements for FPGrowth Key: SPARK-7211 URL: https://issues.apache.org/jira/browse/SPARK-7211 Project: Spark Issue Type: Umbrella

[jira] [Assigned] (SPARK-6756) Add compress() to Vector

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6756: --- Assignee: Apache Spark (was: Xiangrui Meng) Add compress() to Vector

[jira] [Resolved] (SPARK-7201) Move identifiable to ml.util

2015-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7201. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5749

[jira] [Created] (SPARK-7207) Add new spark.ml subpackages to SparkBuild

2015-04-28 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7207: Summary: Add new spark.ml subpackages to SparkBuild Key: SPARK-7207 URL: https://issues.apache.org/jira/browse/SPARK-7207 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7209) Adding new Manning book Spark in Action to the official Spark Webpage

2015-04-28 Thread Aleksandar Dragosavljevic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aleksandar Dragosavljevic updated SPARK-7209: - Attachment: Spark in Action.jpg Book cover Adding new Manning book

[jira] [Created] (SPARK-7205) Support local ivy cache in --packages

2015-04-28 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7205: -- Summary: Support local ivy cache in --packages Key: SPARK-7205 URL: https://issues.apache.org/jira/browse/SPARK-7205 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-7204) Call sites in UI are not accurate for DataFrame operations

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7204: --- Assignee: Patrick Wendell (was: Apache Spark) Call sites in UI are not accurate for

[jira] [Commented] (SPARK-7204) Call sites in UI are not accurate for DataFrame operations

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518047#comment-14518047 ] Apache Spark commented on SPARK-7204: - User 'pwendell' has created a pull request for

[jira] [Assigned] (SPARK-7204) Call sites in UI are not accurate for DataFrame operations

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7204: --- Assignee: Apache Spark (was: Patrick Wendell) Call sites in UI are not accurate for

[jira] [Assigned] (SPARK-7213) Exception while copying Hadoop config files due to permission issues

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7213: --- Assignee: (was: Apache Spark) Exception while copying Hadoop config files due to

[jira] [Commented] (SPARK-7213) Exception while copying Hadoop config files due to permission issues

2015-04-28 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518197#comment-14518197 ] Nishkam Ravi commented on SPARK-7213: - PR: https://github.com/apache/spark/pull/5760/

[jira] [Commented] (SPARK-6756) Add compress() to Vector

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518008#comment-14518008 ] Apache Spark commented on SPARK-6756: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-6756) Add compress() to Vector

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6756: --- Assignee: Xiangrui Meng (was: Apache Spark) Add compress() to Vector

[jira] [Created] (SPARK-7210) Test matrix decompositions for speed vs. numerical stability for Gaussians

2015-04-28 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7210: Summary: Test matrix decompositions for speed vs. numerical stability for Gaussians Key: SPARK-7210 URL: https://issues.apache.org/jira/browse/SPARK-7210

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-04-28 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518133#comment-14518133 ] Pedro Rodriguez commented on SPARK-5556: With the refactoring done, I can get to

[jira] [Created] (SPARK-7212) Frequent pattern mining for sequential item sets

2015-04-28 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7212: Summary: Frequent pattern mining for sequential item sets Key: SPARK-7212 URL: https://issues.apache.org/jira/browse/SPARK-7212 Project: Spark Issue

[jira] [Created] (SPARK-7213) Exception while copying Hadoop config files due to permission issues

2015-04-28 Thread Nishkam Ravi (JIRA)
Nishkam Ravi created SPARK-7213: --- Summary: Exception while copying Hadoop config files due to permission issues Key: SPARK-7213 URL: https://issues.apache.org/jira/browse/SPARK-7213 Project: Spark

[jira] [Resolved] (SPARK-7187) Exceptions in SerializationDebugger should not crash user code

2015-04-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7187. Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Target

[jira] [Resolved] (SPARK-7135) Expression for monotonically increasing IDs

2015-04-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7135. Resolution: Fixed Fix Version/s: 1.4.0 Expression for monotonically increasing IDs

[jira] [Updated] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-28 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-6980: Attachment: Spark-6980-Test.scala Akka timeout exceptions indicate which conf controls them

[jira] [Updated] (SPARK-7191) SharedParamsCodeGen doesn't import org.apache.spark.util.Utils

2015-04-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7191: --- Description: When we run `build/sbt mllib/runMain org.apache.spark.ml.param.shared.SharedParamsCodeGen`, the

[jira] [Closed] (SPARK-7191) SharedParamsCodeGen doesn't import org.apache.spark.util.Utils

2015-04-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-7191. -- Resolution: Not A Problem sorry, it's my rebase issue. not a bug. SharedParamsCodeGen doesn't import

[jira] [Commented] (SPARK-6530) ChiSqSelector transformer

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516626#comment-14516626 ] Apache Spark commented on SPARK-6530: - User 'yinxusen' has created a pull request for

[jira] [Assigned] (SPARK-6530) ChiSqSelector transformer

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6530: --- Assignee: (was: Apache Spark) ChiSqSelector transformer -

[jira] [Resolved] (SPARK-6352) Supporting non-default OutputCommitter when using saveAsParquetFile

2015-04-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6352. --- Resolution: Fixed Issue resolved by pull request 5525 [https://github.com/apache/spark/pull/5525]

[jira] [Commented] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517498#comment-14517498 ] Apache Spark commented on SPARK-5529: - User 'alexrovner' has created a pull request

[jira] [Created] (SPARK-7199) Add date and timestamp support to UnsafeRow

2015-04-28 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7199: - Summary: Add date and timestamp support to UnsafeRow Key: SPARK-7199 URL: https://issues.apache.org/jira/browse/SPARK-7199 Project: Spark Issue Type: New Feature

[jira] [Comment Edited] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-04-28 Thread Jeremy Hanna (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517546#comment-14517546 ] Jeremy Hanna edited comment on SPARK-5388 at 4/28/15 6:08 PM: --

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-04-28 Thread Jeremy Hanna (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517546#comment-14517546 ] Jeremy Hanna commented on SPARK-5388: - Would people be amenable to addition features

[jira] [Commented] (SPARK-6258) Python MLlib API missing items: Clustering

2015-04-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517470#comment-14517470 ] Joseph K. Bradley commented on SPARK-6258: -- About a question asked offline:

[jira] [Commented] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-04-28 Thread Alex Rovner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517499#comment-14517499 ] Alex Rovner commented on SPARK-5529: Sorry about all the pull requests. Here is one

[jira] [Created] (SPARK-7200) Tungsten test suites should fail if memory leak is detected

2015-04-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7200: -- Summary: Tungsten test suites should fail if memory leak is detected Key: SPARK-7200 URL: https://issues.apache.org/jira/browse/SPARK-7200 Project: Spark Issue

[jira] [Commented] (SPARK-6943) Graphically show RDD's included in a stage

2015-04-28 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517582#comment-14517582 ] Kay Ousterhout commented on SPARK-6943: --- After looking at the 2 PRs for this (which

[jira] [Assigned] (SPARK-7045) Word2Vec: avoid intermediate representation when creating model

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7045: --- Assignee: (was: Apache Spark) Word2Vec: avoid intermediate representation when creating

[jira] [Commented] (SPARK-7045) Word2Vec: avoid intermediate representation when creating model

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517621#comment-14517621 ] Apache Spark commented on SPARK-7045: - User 'MechCoder' has created a pull request for

[jira] [Assigned] (SPARK-7045) Word2Vec: avoid intermediate representation when creating model

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7045: --- Assignee: Apache Spark Word2Vec: avoid intermediate representation when creating model

[jira] [Commented] (SPARK-7201) Move identifiable to ml.util

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517642#comment-14517642 ] Apache Spark commented on SPARK-7201: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-7201) Move identifiable to ml.util

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7201: --- Assignee: Xiangrui Meng (was: Apache Spark) Move identifiable to ml.util

[jira] [Assigned] (SPARK-7201) Move identifiable to ml.util

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7201: --- Assignee: Apache Spark (was: Xiangrui Meng) Move identifiable to ml.util

[jira] [Created] (SPARK-7202) Add SparseMatrixPickler to SerDe

2015-04-28 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-7202: -- Summary: Add SparseMatrixPickler to SerDe Key: SPARK-7202 URL: https://issues.apache.org/jira/browse/SPARK-7202 Project: Spark Issue Type: New Feature

[jira] [Assigned] (SPARK-7188) Support math functions in DataFrames in Python

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7188: --- Assignee: Apache Spark (was: Burak Yavuz) Support math functions in DataFrames in Python

[jira] [Commented] (SPARK-6257) Python MLlib API missing items: Recommendation

2015-04-28 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517675#comment-14517675 ] Manoj Kumar commented on SPARK-6257: Btw, I'm able to edit issues of other people as

[jira] [Commented] (SPARK-7188) Support math functions in DataFrames in Python

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517676#comment-14517676 ] Apache Spark commented on SPARK-7188: - User 'brkyvz' has created a pull request for

[jira] [Updated] (SPARK-7195) Can't start spark shell or pyspark in Windows 7

2015-04-28 Thread Mark Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Smiley updated SPARK-7195: --- Attachment: spark_bug.png Sean, I did look around, and had found the old bug and its duplicate, but

[jira] [Updated] (SPARK-5389) spark-shell.cmd does not run from DOS Windows 7

2015-04-28 Thread Mark Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Smiley updated SPARK-5389: --- Attachment: spark_bug.png Related python shell error messages on startup. spark-shell.cmd does not

[jira] [Commented] (SPARK-6257) Python MLlib API missing items: Recommendation

2015-04-28 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517626#comment-14517626 ] Manoj Kumar commented on SPARK-6257: Sounds great to me. ! Thanks a lot :) Python

[jira] [Assigned] (SPARK-7188) Support math functions in DataFrames in Python

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7188: --- Assignee: Burak Yavuz (was: Apache Spark) Support math functions in DataFrames in Python

[jira] [Updated] (SPARK-7202) Add SparseMatrixPickler to SerDe

2015-04-28 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7202: --- Priority: Major (was: Minor) Add SparseMatrixPickler to SerDe

[jira] [Resolved] (SPARK-7185) Python API for math functions in DataFrames

2015-04-28 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz resolved SPARK-7185. Resolution: Duplicate Python API for math functions in DataFrames

[jira] [Commented] (SPARK-5389) spark-shell.cmd does not run from DOS Windows 7

2015-04-28 Thread Mark Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517695#comment-14517695 ] Mark Smiley commented on SPARK-5389: I have the same issue on Spark 1.3.1 using

[jira] [Created] (SPARK-7201) Move identifiable to ml.util

2015-04-28 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7201: Summary: Move identifiable to ml.util Key: SPARK-7201 URL: https://issues.apache.org/jira/browse/SPARK-7201 Project: Spark Issue Type: Improvement

[jira] [Closed] (SPARK-4286) Support External Shuffle Service with Mesos integration

2015-04-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4286. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Iulian Dragos (was: Timothy Chen)

[jira] [Comment Edited] (SPARK-5389) spark-shell.cmd does not run from DOS Windows 7

2015-04-28 Thread Mark Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517695#comment-14517695 ] Mark Smiley edited comment on SPARK-5389 at 4/28/15 7:13 PM: -

[jira] [Updated] (SPARK-7179) Add pattern after show tables to filter desire tablename

2015-04-28 Thread baishuo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] baishuo updated SPARK-7179: --- Priority: Minor (was: Major) Add pattern after show tables to filter desire tablename

[jira] [Resolved] (SPARK-6829) Support math functions in DataFrames

2015-04-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6829. Resolution: Fixed Fix Version/s: 1.4.0 Support math functions in DataFrames

[jira] [Commented] (SPARK-7181) External Sorter merge with aggregation go to an infinite loop when we have a total ordering

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516489#comment-14516489 ] Apache Spark commented on SPARK-7181: - User 'chouqin' has created a pull request for

[jira] [Assigned] (SPARK-7181) External Sorter merge with aggregation go to an infinite loop when we have a total ordering

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7181: --- Assignee: Apache Spark External Sorter merge with aggregation go to an infinite loop when

[jira] [Assigned] (SPARK-7181) External Sorter merge with aggregation go to an infinite loop when we have a total ordering

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7181: --- Assignee: (was: Apache Spark) External Sorter merge with aggregation go to an infinite

[jira] [Created] (SPARK-7188) Support math functions in DataFrames in Python

2015-04-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7188: -- Summary: Support math functions in DataFrames in Python Key: SPARK-7188 URL: https://issues.apache.org/jira/browse/SPARK-7188 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7182) [SQL] Can't remove columns from DataFrame or save DataFrame from a join due to duplicate columns

2015-04-28 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516551#comment-14516551 ] Adrian Wang commented on SPARK-7182: you should use like j = t1.join(t2, t1.a==t2.a

[jira] [Updated] (SPARK-7181) External Sorter merge with aggregation go to an infinite loop when we have a total ordering

2015-04-28 Thread Qiping Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qiping Li updated SPARK-7181: - Description: In the function {{mergeWithAggregation}} of {{ExternalSorter.scala}}, when there is a total

[jira] [Updated] (SPARK-7181) External Sorter merge with aggregation go to an infinite loop when we have a total ordering

2015-04-28 Thread Qiping Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qiping Li updated SPARK-7181: - Summary: External Sorter merge with aggregation go to an infinite loop when we have a total ordering

[jira] [Resolved] (SPARK-5946) Add Python API for Kafka direct stream

2015-04-28 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-5946. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Saisai Shao Add Python API

[jira] [Created] (SPARK-7189) History server will always reload the same file if no log file is updated

2015-04-28 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-7189: -- Summary: History server will always reload the same file if no log file is updated Key: SPARK-7189 URL: https://issues.apache.org/jira/browse/SPARK-7189 Project: Spark

[jira] [Assigned] (SPARK-7183) Memory leak in netty shuffle with spark standalone cluster

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7183: --- Assignee: (was: Apache Spark) Memory leak in netty shuffle with spark standalone

[jira] [Assigned] (SPARK-7183) Memory leak in netty shuffle with spark standalone cluster

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7183: --- Assignee: Apache Spark Memory leak in netty shuffle with spark standalone cluster

[jira] [Commented] (SPARK-7183) Memory leak in netty shuffle with spark standalone cluster

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516765#comment-14516765 ] Apache Spark commented on SPARK-7183: - User 'viirya' has created a pull request for

[jira] [Created] (SPARK-7193) Spark on Mesos may need more tests for spark 1.3.1 release

2015-04-28 Thread Littlestar (JIRA)
Littlestar created SPARK-7193: - Summary: Spark on Mesos may need more tests for spark 1.3.1 release Key: SPARK-7193 URL: https://issues.apache.org/jira/browse/SPARK-7193 Project: Spark Issue

[jira] [Updated] (SPARK-7192) Pyspark casts hive bigint to int

2015-04-28 Thread Tamas Jambor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tamas Jambor updated SPARK-7192: Component/s: PySpark Pyspark casts hive bigint to int

[jira] [Created] (SPARK-7192) Pyspark casts hive bigint to int

2015-04-28 Thread Tamas Jambor (JIRA)
Tamas Jambor created SPARK-7192: --- Summary: Pyspark casts hive bigint to int Key: SPARK-7192 URL: https://issues.apache.org/jira/browse/SPARK-7192 Project: Spark Issue Type: Bug Affects

[jira] [Commented] (SPARK-3808) PySpark fails to start in Windows

2015-04-28 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516705#comment-14516705 ] Masayoshi TSUZUKI commented on SPARK-3808: -- It seems to fail to run jvm. Perhaps

[jira] [Updated] (SPARK-7192) Pyspark casts hive bigint to int

2015-04-28 Thread Tamas Jambor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tamas Jambor updated SPARK-7192: Component/s: SQL Pyspark casts hive bigint to int

[jira] [Assigned] (SPARK-6965) StringIndexer should convert input to Strings

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6965: --- Assignee: Apache Spark (was: Xiangrui Meng) StringIndexer should convert input to Strings

[jira] [Closed] (SPARK-5932) Use consistent naming for byte properties

2015-04-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5932. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Ilya Ganelin (was: Andrew Or) Use

[jira] [Updated] (SPARK-7202) Add SparseMatrixPickler to SerDe

2015-04-28 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7202: --- Priority: Minor (was: Major) Add SparseMatrixPickler to SerDe

[jira] [Updated] (SPARK-5389) spark-shell.cmd does not run from DOS Windows 7

2015-04-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5389: - Component/s: Windows PySpark spark-shell.cmd does not run from DOS Windows 7

[jira] [Updated] (SPARK-6197) handle json parse exception for eventlog file not finished writing

2015-04-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6197: - Fix Version/s: 1.3.1 handle json parse exception for eventlog file not finished writing

[jira] [Commented] (SPARK-6994) Allow to fetch field values by name in sql.Row

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517881#comment-14517881 ] Apache Spark commented on SPARK-6994: - User 'szheng79' has created a pull request for

[jira] [Comment Edited] (SPARK-6994) Allow to fetch field values by name in sql.Row

2015-04-28 Thread Shuai Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517882#comment-14517882 ] Shuai Zheng edited comment on SPARK-6994 at 4/28/15 7:48 PM: -

[jira] [Commented] (SPARK-4414) SparkContext.wholeTextFiles Doesn't work with S3 Buckets

2015-04-28 Thread Tristan Nixon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517886#comment-14517886 ] Tristan Nixon commented on SPARK-4414: -- Thanks, [~petedmarsh], I was having this same

[jira] [Commented] (SPARK-6965) StringIndexer should convert input to Strings

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517700#comment-14517700 ] Apache Spark commented on SPARK-6965: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-6965) StringIndexer should convert input to Strings

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6965: --- Assignee: Xiangrui Meng (was: Apache Spark) StringIndexer should convert input to Strings

[jira] [Commented] (SPARK-7195) Can't start spark shell or pyspark in Windows 7

2015-04-28 Thread Mark Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517826#comment-14517826 ] Mark Smiley commented on SPARK-7195: Sean, I added my bug as a comment to the old bug

[jira] [Commented] (SPARK-7178) Improve DataFrame documentation and code samples

2015-04-28 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517858#comment-14517858 ] Chris Fregly commented on SPARK-7178: - added this to the forums to address the AND and

[jira] [Commented] (SPARK-6197) handle json parse exception for eventlog file not finished writing

2015-04-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517866#comment-14517866 ] Andrew Or commented on SPARK-6197: -- https://github.com/apache/spark/pull/5736 handle

[jira] [Commented] (SPARK-6994) Allow to fetch field values by name in sql.Row

2015-04-28 Thread Shuai Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14517882#comment-14517882 ] Shuai Zheng commented on SPARK-6994: I create one more pull request:

[jira] [Commented] (SPARK-6314) Failed to load application log data from FileStatus

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516452#comment-14516452 ] Apache Spark commented on SPARK-6314: - User 'liyezhang556520' has created a pull

[jira] [Commented] (SPARK-7180) SerializationDebugger fails with ArrayOutOfBoundsException

2015-04-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516536#comment-14516536 ] Patrick Wendell commented on SPARK-7180: /cc [~rxin] SerializationDebugger fails

[jira] [Assigned] (SPARK-7176) Add validation functionality to individual Param

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7176: --- Assignee: Joseph K. Bradley (was: Apache Spark) Add validation functionality to individual

[jira] [Commented] (SPARK-7176) Add validation functionality to individual Param

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516544#comment-14516544 ] Apache Spark commented on SPARK-7176: - User 'jkbradley' has created a pull request for

[jira] [Assigned] (SPARK-7176) Add validation functionality to individual Param

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7176: --- Assignee: Apache Spark (was: Joseph K. Bradley) Add validation functionality to individual

[jira] [Updated] (SPARK-7189) History server will always reload the same file even when no log file is updated

2015-04-28 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-7189: --- Summary: History server will always reload the same file even when no log file is updated (was:

[jira] [Created] (SPARK-7190) UTF8String backed by binary data

2015-04-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7190: -- Summary: UTF8String backed by binary data Key: SPARK-7190 URL: https://issues.apache.org/jira/browse/SPARK-7190 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-04-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516600#comment-14516600 ] Apache Spark commented on SPARK-6980: - User 'BryanCutler' has created a pull request

<    1   2   3   >