[jira] [Updated] (SPARK-17617) Remainder(%) expression.eval returns incorrect result

2016-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17617: Fix Version/s: 1.6.3 > Remainder(%) expression.eval returns incorrect result >

[jira] [Commented] (SPARK-17621) Accumulator value is doubled when using DataFrame.orderBy()

2016-09-21 Thread Sreelal S L (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509301#comment-15509301 ] Sreelal S L commented on SPARK-17621: - Hi. Our actual code is bit different from what i have given.

[jira] [Resolved] (SPARK-17599) Folder deletion after globbing may fail StructuredStreaming jobs

2016-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17599. - Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.1.0 > Folder deletion

[jira] [Updated] (SPARK-17219) QuantileDiscretizer does strange things with NaN values

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17219: -- Assignee: Vincent > QuantileDiscretizer does strange things with NaN values >

[jira] [Resolved] (SPARK-17583) Remove unused rowSeparator variable and set auto-expanding buffer as default for maxCharsPerColumn option in CSV

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17583. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15138

[jira] [Updated] (SPARK-17583) Remove unused rowSeparator variable and set auto-expanding buffer as default for maxCharsPerColumn option in CSV

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17583: -- Assignee: Hyukjin Kwon > Remove unused rowSeparator variable and set auto-expanding buffer as default

[jira] [Commented] (SPARK-17621) Accumulator value is doubled when using DataFrame.orderBy()

2016-09-21 Thread Sreelal S L (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509213#comment-15509213 ] Sreelal S L commented on SPARK-17621: - Hi Sean, Thanks for your quick reply. I didnt understand

[jira] [Updated] (SPARK-17617) Remainder(%) expression.eval returns incorrect result

2016-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17617: Fix Version/s: 2.0.1 > Remainder(%) expression.eval returns incorrect result >

[jira] [Updated] (SPARK-17017) Add a chiSquare Selector based on False Positive Rate (FPR) test

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17017: -- Assignee: Peng Meng > Add a chiSquare Selector based on False Positive Rate (FPR) test >

[jira] [Commented] (SPARK-17621) Accumulator value is doubled when using DataFrame.orderBy()

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509168#comment-15509168 ] Sean Owen commented on SPARK-17621: --- I think you've found the issue. You're actually evaluating

[jira] [Assigned] (SPARK-11918) Better error from WLS for cases like singular input

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11918: Assignee: Apache Spark (was: Sean Owen) > Better error from WLS for cases like singular

[jira] [Assigned] (SPARK-11918) Better error from WLS for cases like singular input

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11918: Assignee: Sean Owen (was: Apache Spark) > Better error from WLS for cases like singular

[jira] [Updated] (SPARK-11918) Better error from WLS for cases like singular input

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11918: -- Assignee: Sean Owen Labels: (was: starter) Summary: Better error from WLS for cases like

[jira] [Assigned] (SPARK-17556) Executor side broadcast for broadcast joins

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17556: Assignee: Apache Spark > Executor side broadcast for broadcast joins >

[jira] [Commented] (SPARK-17596) Streaming job lacks Scala runtime methods

2016-09-21 Thread Evgeniy Tsvigun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509229#comment-15509229 ] Evgeniy Tsvigun commented on SPARK-17596: - Thanks Sean! One more check revealed I had SPARK_HOME

[jira] [Commented] (SPARK-10835) Change Output of NGram to Array(String, True)

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509249#comment-15509249 ] Apache Spark commented on SPARK-10835: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10835) Change Output of NGram to Array(String, True)

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10835: Assignee: Apache Spark (was: yuhao yang) > Change Output of NGram to Array(String, True)

[jira] [Assigned] (SPARK-10835) Change Output of NGram to Array(String, True)

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10835: Assignee: yuhao yang (was: Apache Spark) > Change Output of NGram to Array(String, True)

[jira] [Commented] (SPARK-17556) Executor side broadcast for broadcast joins

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509228#comment-15509228 ] Apache Spark commented on SPARK-17556: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17556) Executor side broadcast for broadcast joins

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17556: Assignee: (was: Apache Spark) > Executor side broadcast for broadcast joins >

[jira] [Resolved] (SPARK-17595) Inefficient selection in Word2VecModel.findSynonyms

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17595. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15150

[jira] [Updated] (SPARK-17595) Inefficient selection in Word2VecModel.findSynonyms

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17595: -- Assignee: William Benton > Inefficient selection in Word2VecModel.findSynonyms >

[jira] [Updated] (SPARK-17617) Remainder(%) expression.eval returns incorrect result

2016-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17617: Assignee: Sean Zhong > Remainder(%) expression.eval returns incorrect result >

[jira] [Resolved] (SPARK-17617) Remainder(%) expression.eval returns incorrect result

2016-09-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17617. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15171

[jira] [Resolved] (SPARK-17017) Add a chiSquare Selector based on False Positive Rate (FPR) test

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17017. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14597

[jira] [Assigned] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-17585: --- Assignee: Yanbo Liang > PySpark SparkContext.addFile supports adding files recursively >

[jira] [Resolved] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17585. - Resolution: Fixed Fix Version/s: 2.1.0 > PySpark SparkContext.addFile supports adding

[jira] [Commented] (SPARK-11918) Better error from WLS for cases like singular input

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509222#comment-15509222 ] Apache Spark commented on SPARK-11918: -- User 'srowen' has created a pull request for this issue:

[jira] [Updated] (SPARK-17622) Cannot run SparkR function on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] renzhi he updated SPARK-17622: -- Description: sc <- sparkR.session(master="local[*]", appName="sparkR", sparkConfig =

[jira] [Updated] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17614: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > sparkSession.read()

[jira] [Commented] (SPARK-17621) Accumulator value is doubled when using DataFrame.orderBy()

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509234#comment-15509234 ] Sean Owen commented on SPARK-17621: --- I think you're generally relying on the RDD being evaluated once,

[jira] [Closed] (SPARK-17596) Streaming job lacks Scala runtime methods

2016-09-21 Thread Evgeniy Tsvigun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Evgeniy Tsvigun closed SPARK-17596. --- Resolution: Not A Problem Found that my SPARK_HOME environment variable was pointing to a

[jira] [Created] (SPARK-17622) Cannot run SparkR function on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
renzhi he created SPARK-17622: - Summary: Cannot run SparkR function on Windows- Spark 2.0.0 Key: SPARK-17622 URL: https://issues.apache.org/jira/browse/SPARK-17622 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-17219) QuantileDiscretizer does strange things with NaN values

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17219. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14858

[jira] [Updated] (SPARK-17057) ProbabilisticClassifierModels' thresholds should be > 0

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17057: -- Summary: ProbabilisticClassifierModels' thresholds should be > 0 (was: ProbabilisticClassifierModels'

[jira] [Commented] (SPARK-15071) Check the result of all TPCDS queries

2016-09-21 Thread Nirman Narang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509559#comment-15509559 ] Nirman Narang commented on SPARK-15071: --- Started working on this. > Check the result of all TPCDS

[jira] [Resolved] (SPARK-17590) Analyze CTE definitions at once and allow CTE subquery to define CTE

2016-09-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17590. --- Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.1.0 >

[jira] [Commented] (SPARK-9686) Spark Thrift server doesn't return correct JDBC metadata

2016-09-21 Thread Shawn Lavelle (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510098#comment-15510098 ] Shawn Lavelle commented on SPARK-9686: -- It's been a few months, any progress on this bug? > Spark

[jira] [Updated] (SPARK-17622) Cannot run create or load DF on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] renzhi he updated SPARK-17622: -- Description: Under spark2.0.0- on Windows- when try to load or create data with the similar codes

[jira] [Closed] (SPARK-17610) The failed stage caused by FetchFailed may never be resubmitted

2016-09-21 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Wang closed SPARK-17610. Resolution: Not A Problem > The failed stage caused by FetchFailed may never be resubmitted >

[jira] [Updated] (SPARK-17622) Cannot run create or load DF on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] renzhi he updated SPARK-17622: -- Description: (was: sc <- sparkR.session(master="local[*]", sparkConfig = list(spark.driver.memory

[jira] [Commented] (SPARK-17610) The failed stage caused by FetchFailed may never be resubmitted

2016-09-21 Thread Tao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509712#comment-15509712 ] Tao Wang commented on SPARK-17610: -- As reason mentioned in https://github.com/apache/spark/pull/15176,

[jira] [Commented] (SPARK-17607) --driver-url doesn't point to my master_ip.

2016-09-21 Thread Sasi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509773#comment-15509773 ] Sasi commented on SPARK-17607: -- It's different, because I was able to start my master with ip 10.5.5.2 and I

[jira] [Updated] (SPARK-17622) Cannot run create or load DF on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] renzhi he updated SPARK-17622: -- Summary: Cannot run create or load DF on Windows- Spark 2.0.0 (was: Cannot run SparkR function on

[jira] [Updated] (SPARK-17622) Cannot run create or load DF on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] renzhi he updated SPARK-17622: -- Description: Under spark2.0.0- on Windows- when try to load or create data with the similar codes

[jira] [Commented] (SPARK-17606) New batches are not created when there are 1000 created after restarting streaming from checkpoint.

2016-09-21 Thread etienne (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509821#comment-15509821 ] etienne commented on SPARK-17606: - Sorry I ask Ops for logs, but they have been lost. I have to wait

[jira] [Created] (SPARK-17623) Failed tasks end reason is always a TaskFailedReason, types should reflect this

2016-09-21 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-17623: Summary: Failed tasks end reason is always a TaskFailedReason, types should reflect this Key: SPARK-17623 URL: https://issues.apache.org/jira/browse/SPARK-17623

[jira] [Assigned] (SPARK-17623) Failed tasks end reason is always a TaskFailedReason, types should reflect this

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17623: Assignee: Apache Spark (was: Imran Rashid) > Failed tasks end reason is always a

[jira] [Commented] (SPARK-17623) Failed tasks end reason is always a TaskFailedReason, types should reflect this

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510377#comment-15510377 ] Apache Spark commented on SPARK-17623: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17623) Failed tasks end reason is always a TaskFailedReason, types should reflect this

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17623: Assignee: Imran Rashid (was: Apache Spark) > Failed tasks end reason is always a

[jira] [Commented] (SPARK-17044) Add window function test in SQLQueryTestSuite

2016-09-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510464#comment-15510464 ] Dongjoon Hyun commented on SPARK-17044: --- Hi, [~rxin] Could you review this issue? > Add window

[jira] [Created] (SPARK-17624) Flaky test? StateStoreSuite maintenance

2016-09-21 Thread Adam Roberts (JIRA)
Adam Roberts created SPARK-17624: Summary: Flaky test? StateStoreSuite maintenance Key: SPARK-17624 URL: https://issues.apache.org/jira/browse/SPARK-17624 Project: Spark Issue Type: Test

[jira] [Updated] (SPARK-17622) Cannot run create or load DF on Windows- Spark 2.0.0

2016-09-21 Thread renzhi he (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] renzhi he updated SPARK-17622: -- Description: sc <- sparkR.session(master="local[*]", sparkConfig = list(spark.driver.memory = "2g"))

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-09-21 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510198#comment-15510198 ] Seth Hendrickson commented on SPARK-17134: -- Hmm, it would be nice to see this vs the old mlor in

[jira] [Assigned] (SPARK-17625) expectedOutputAttributes should be set when converting SimpleCatalogRelation to LogicalRelation

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17625: Assignee: Apache Spark > expectedOutputAttributes should be set when converting

[jira] [Commented] (SPARK-17625) expectedOutputAttributes should be set when converting SimpleCatalogRelation to LogicalRelation

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510658#comment-15510658 ] Apache Spark commented on SPARK-17625: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Wu updated SPARK-17614: Comment: was deleted (was: Create pull request: https://github.com/apache/spark/pull/15183) >

[jira] [Comment Edited] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510626#comment-15510626 ] Paul Wu edited comment on SPARK-17614 at 9/21/16 5:42 PM: -- No, Custom

[jira] [Created] (SPARK-17625) expectedOutputAttributes should be set when converting SimpleCatalogRelation to LogicalRelation

2016-09-21 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-17625: Summary: expectedOutputAttributes should be set when converting SimpleCatalogRelation to LogicalRelation Key: SPARK-17625 URL: https://issues.apache.org/jira/browse/SPARK-17625

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-09-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510672#comment-15510672 ] DB Tsai commented on SPARK-17134: - I'll try the old mlor in rdd tonight when the cluster is not busy.

[jira] [Created] (SPARK-17626) TPC-DS performance improvements using star-schema heuristics

2016-09-21 Thread Ioana Delaney (JIRA)
Ioana Delaney created SPARK-17626: - Summary: TPC-DS performance improvements using star-schema heuristics Key: SPARK-17626 URL: https://issues.apache.org/jira/browse/SPARK-17626 Project: Spark

[jira] [Commented] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510707#comment-15510707 ] Sean Owen commented on SPARK-17614: --- Yup, that much is clearly a bug. Go for a fix, anyone who wants to

[jira] [Commented] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510709#comment-15510709 ] Paul Wu commented on SPARK-17614: - Create pull request: https://github.com/apache/spark/pull/15183 >

[jira] [Commented] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510704#comment-15510704 ] Apache Spark commented on SPARK-17614: -- User 'paulzwu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17614: Assignee: (was: Apache Spark) > sparkSession.read() .jdbc(***) use the sql syntax

[jira] [Assigned] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17614: Assignee: Apache Spark > sparkSession.read() .jdbc(***) use the sql syntax "where 1=0"

[jira] [Updated] (SPARK-17626) TPC-DS performance improvements using star-schema heuristics

2016-09-21 Thread Ioana Delaney (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ioana Delaney updated SPARK-17626: -- Description: *TPC-DS performance improvements using star-schema heuristics* \\ \\ TPC-DS

[jira] [Commented] (SPARK-11702) Guava ClassLoading Issue When Using Different Hive Metastore Version

2016-09-21 Thread Joey Paskhay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510748#comment-15510748 ] Joey Paskhay commented on SPARK-11702: -- Apologies for the super late response, Sabs. In case you or

[jira] [Commented] (SPARK-14849) shuffle broken when accessing standalone cluster through NAT

2016-09-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510745#comment-15510745 ] Shixiong Zhu commented on SPARK-14849: -- [~skyluc] do you still see the error in Spark 2.0.0? >

[jira] [Commented] (SPARK-11918) Better error from WLS for cases like singular input

2016-09-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510792#comment-15510792 ] DB Tsai commented on SPARK-11918: - +1 on QR decomposition. We may add a feature that using LBFGS/OWLQN to

[jira] [Commented] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510525#comment-15510525 ] Paul Wu commented on SPARK-17614: - Thanks. I tried to register my custom dialect as following, but it

[jira] [Updated] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Wu updated SPARK-17614: Priority: Major (was: Minor) > sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that

[jira] [Commented] (SPARK-17614) sparkSession.read() .jdbc(***) use the sql syntax "where 1=0" that Cassandra does not support

2016-09-21 Thread Paul Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510626#comment-15510626 ] Paul Wu commented on SPARK-17614: - No, Custom JdbcDialect won't resolve the problem since DataFrameReader

[jira] [Assigned] (SPARK-17625) expectedOutputAttributes should be set when converting SimpleCatalogRelation to LogicalRelation

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17625: Assignee: (was: Apache Spark) > expectedOutputAttributes should be set when

[jira] [Updated] (SPARK-17626) TPC-DS performance improvements using star-schema heuristics

2016-09-21 Thread Ioana Delaney (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ioana Delaney updated SPARK-17626: -- Attachment: StarSchemaJoinReordering.pptx > TPC-DS performance improvements using star-schema

[jira] [Updated] (SPARK-11702) Guava ClassLoading Issue When Using Different Hive Metastore Version

2016-09-21 Thread Joey Paskhay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joey Paskhay updated SPARK-11702: - Description: A Guava classloading error can occur when using a different version of the Hive

[jira] [Commented] (SPARK-16407) Allow users to supply custom StreamSinkProviders

2016-09-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510787#comment-15510787 ] Michael Armbrust commented on SPARK-16407: -- I'm still a little unclear on the use cases we are

[jira] [Resolved] (SPARK-17418) Spark release must NOT distribute Kinesis related assembly artifact

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-17418. Resolution: Fixed Assignee: Josh Rosen Fix Version/s: 2.1.0

[jira] [Resolved] (SPARK-17616) Getting "java.lang.RuntimeException: Distinct columns cannot exist in Aggregate "

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-17616. Resolution: Duplicate > Getting "java.lang.RuntimeException: Distinct columns cannot exist in >

[jira] [Resolved] (SPARK-11918) Better error from WLS for cases like singular input

2016-09-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-11918. - Resolution: Fixed > Better error from WLS for cases like singular input >

[jira] [Updated] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17618: --- Description: We were getting incorrect results from the DataFrame except method - all rows were

[jira] [Updated] (SPARK-17592) SQL: CAST string as INT inconsistent with Hive

2016-09-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17592: Labels: (was: correctness) > SQL: CAST string as INT inconsistent with Hive >

[jira] [Updated] (SPARK-17592) SQL: CAST string as INT inconsistent with Hive

2016-09-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17592: Fix Version/s: (was: 2.0.1) (was: 2.1.0) > SQL: CAST string as INT

[jira] [Updated] (SPARK-17019) Expose off-heap memory usage in various places

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17019: --- Target Version/s: 2.1.0 (was: 2.0.1, 2.1.0) > Expose off-heap memory usage in various places >

[jira] [Updated] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17618: --- Affects Version/s: 1.6.2 > Dataframe except returns incorrect results when combined with coalesce >

[jira] [Updated] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17618: --- Labels: correctness (was: ) > Dataframe except returns incorrect results when combined with

[jira] [Updated] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17618: --- Priority: Blocker (was: Minor) > Dataframe except returns incorrect results when combined with

[jira] [Updated] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17618: --- Target Version/s: 1.6.3 > Dataframe except returns incorrect results when combined with coalesce >

[jira] [Commented] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510934#comment-15510934 ] Josh Rosen commented on SPARK-17618: Yep, the problem is that {{Coalesce}} advertises that it accepts

[jira] [Commented] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15510874#comment-15510874 ] Josh Rosen commented on SPARK-17618: It looks like this affects 1.6.2 as well, but I was unable to

[jira] [Commented] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15511034#comment-15511034 ] Apache Spark commented on SPARK-17618: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17618: Assignee: Apache Spark (was: Josh Rosen) > Dataframe except returns incorrect results

[jira] [Assigned] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17618: Assignee: Josh Rosen (was: Apache Spark) > Dataframe except returns incorrect results

[jira] [Assigned] (SPARK-17618) Dataframe except returns incorrect results when combined with coalesce

2016-09-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-17618: -- Assignee: Josh Rosen > Dataframe except returns incorrect results when combined with coalesce

[jira] [Assigned] (SPARK-17628) Name of "object StreamingExamples" should be more self-explanatory

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17628: Assignee: Apache Spark > Name of "object StreamingExamples" should be more

[jira] [Reopened] (SPARK-14536) NPE in JDBCRDD when array column contains nulls (postgresql)

2016-09-21 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Thalamati reopened SPARK-14536: -- SPARK-10186 added array data type support for postgres in 1.6. NPE issues still

[jira] [Commented] (SPARK-15717) Cannot perform RDD operations on a checkpointed VertexRDD.

2016-09-21 Thread Asher Krim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15511727#comment-15511727 ] Asher Krim commented on SPARK-15717: Any update on this issue? We are experiencing

[jira] [Commented] (SPARK-14536) NPE in JDBCRDD when array column contains nulls (postgresql)

2016-09-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15511837#comment-15511837 ] Hyukjin Kwon commented on SPARK-14536: -- I see. I rushed to read this and didn't noticed that this is

[jira] [Assigned] (SPARK-17616) Getting "java.lang.RuntimeException: Distinct columns cannot exist in Aggregate "

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17616: Assignee: Herman van Hovell (was: Apache Spark) > Getting "java.lang.RuntimeException:

[jira] [Assigned] (SPARK-17616) Getting "java.lang.RuntimeException: Distinct columns cannot exist in Aggregate "

2016-09-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17616: Assignee: Apache Spark (was: Herman van Hovell) > Getting "java.lang.RuntimeException:

  1   2   >