[jira] [Commented] (SPARK-21118) OOM with 2 handred million vertex when mitrx multply

2017-06-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16051641#comment-16051641 ] Lorenz Bühmann commented on SPARK-21118: The first point would be to u

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2017-06-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16054162#comment-16054162 ] Michael Schmeißer commented on SPARK-650: - [~riteshtijoriwala] - Sorry, but

[jira] [Updated] (SPARK-21287) Cannot use Int.MIN_VALUE as Spark SQL fetchsize

2017-07-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21287: --- Summary: Cannot use Int.MIN_VALUE as Spark SQL fetchsize (was: Cannot use Iint.MIN_VALUE as

[jira] [Created] (SPARK-21287) Cannot use Iint.MIN_VALUE as Spark SQL fetchsize

2017-07-03 Thread JIRA
Maciej Bryński created SPARK-21287: -- Summary: Cannot use Iint.MIN_VALUE as Spark SQL fetchsize Key: SPARK-21287 URL: https://issues.apache.org/jira/browse/SPARK-21287 Project: Spark Issue

[jira] [Updated] (SPARK-21287) Cannot use Int.MIN_VALUE as Spark SQL fetchsize

2017-07-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21287: --- Description: MySQL JDBC driver gives possibility to not store ResultSet in memory. We can do

[jira] [Commented] (SPARK-21287) Cannot use Int.MIN_VALUE as Spark SQL fetchsize

2017-07-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072404#comment-16072404 ] Maciej Bryński commented on SPARK-21287: No. It's not the same like

[jira] [Commented] (SPARK-21287) Cannot use Int.MIN_VALUE as Spark SQL fetchsize

2017-07-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072474#comment-16072474 ] Maciej Bryński commented on SPARK-21287: Quote {code} By default, ResultSets

[jira] [Comment Edited] (SPARK-21287) Cannot use Int.MIN_VALUE as Spark SQL fetchsize

2017-07-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072474#comment-16072474 ] Maciej Bryński edited comment on SPARK-21287 at 7/3/17 1:5

[jira] [Created] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-17 Thread JIRA
Maciej Bryński created SPARK-21439: -- Summary: Cannot use Spark with Python ABCmeta (exception from cloudpickle) Key: SPARK-21439 URL: https://issues.apache.org/jira/browse/SPARK-21439 Project: Spark

[jira] [Updated] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21439: --- Component/s: PySpark > Cannot use Spark with Python ABCmeta (exception from cloudpic

[jira] [Commented] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092744#comment-16092744 ] Maciej Bryński commented on SPARK-21439: I think is a Spark problem with Py

[jira] [Commented] (SPARK-12717) pyspark broadcast fails when using multiple threads

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092756#comment-16092756 ] Maciej Bryński commented on SPARK-12717: Any progress with this error ? I

[jira] [Created] (SPARK-21470) Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
Maciej Bryński created SPARK-21470: -- Summary: Spark History server doesn't support HDFS HA Key: SPARK-21470 URL: https://issues.apache.org/jira/browse/SPARK-21470 Project: Spark Issue

[jira] [Created] (SPARK-21471) Read binary file error in Spark Streaming

2017-07-19 Thread JIRA
Lê Văn Thanh created SPARK-21471: Summary: Read binary file error in Spark Streaming Key: SPARK-21471 URL: https://issues.apache.org/jira/browse/SPARK-21471 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-21471) Read binary file error in Spark Streaming

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lê Văn Thanh updated SPARK-21471: - Description: My client using GZIPOutputStream to compressed the data and push to my server

[jira] [Updated] (SPARK-21470) [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21470: --- Summary: [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA (was:

[jira] [Commented] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092888#comment-16092888 ] Maciej Bryński commented on SPARK-21439: https://github.com/cloud

[jira] [Comment Edited] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092888#comment-16092888 ] Maciej Bryński edited comment on SPARK-21439 at 7/19/17 10:1

[jira] [Updated] (SPARK-11248) Spark hivethriftserver is using the wrong user to while getting HDFS permissions

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-11248: --- Affects Version/s: 2.1.1 > Spark hivethriftserver is using the wrong user to while gett

[jira] [Updated] (SPARK-11248) Spark hivethriftserver is using the wrong user to while getting HDFS permissions

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-11248: --- Affects Version/s: 2.2.0 > Spark hivethriftserver is using the wrong user to while gett

[jira] [Commented] (SPARK-11248) Spark hivethriftserver is using the wrong user to while getting HDFS permissions

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093055#comment-16093055 ] Maciej Bryński commented on SPARK-11248: I have similar issue in Spark 2

[jira] [Commented] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093105#comment-16093105 ] Maciej Bryński commented on SPARK-5159: --- Still existed in Spark 2.2.0. Prob

[jira] [Commented] (SPARK-21470) [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093716#comment-16093716 ] Maciej Bryński commented on SPARK-21470: [~vanzin] I tried. {code} /etc/ha

[jira] [Comment Edited] (SPARK-21470) [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093716#comment-16093716 ] Maciej Bryński edited comment on SPARK-21470 at 7/19/17 8:0

[jira] [Closed] (SPARK-21470) [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński closed SPARK-21470. -- Resolution: Invalid > [SPARK 2.2 Regression] Spark History server doesn't support

[jira] [Commented] (SPARK-21470) [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093750#comment-16093750 ] Maciej Bryński commented on SPARK-21470: OK. I think I found the reason. T

[jira] [Comment Edited] (SPARK-21470) [SPARK 2.2 Regression] Spark History server doesn't support HDFS HA

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093750#comment-16093750 ] Maciej Bryński edited comment on SPARK-21470 at 7/19/17 9:0

[jira] [Updated] (SPARK-19743) Exception when creating more than one implicit Encoder in REPL

2017-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-19743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-19743: --- Affects Version/s: 2.1.1 2.2.0 > Exception when creating more t

[jira] [Created] (SPARK-21507) Exception when using spark.jars.packages

2017-07-22 Thread JIRA
Maciej Bryński created SPARK-21507: -- Summary: Exception when using spark.jars.packages Key: SPARK-21507 URL: https://issues.apache.org/jira/browse/SPARK-21507 Project: Spark Issue Type

[jira] [Updated] (SPARK-20712) [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has length greater than 4000 bytes

2017-07-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-20712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-20712: --- Summary: [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has l

[jira] [Updated] (SPARK-21507) Exception when using spark.jars.packages

2017-07-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21507: --- Description: When more than one process is using packages option it's possible to c

[jira] [Updated] (SPARK-21507) Exception when using spark.jars.packages

2017-07-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21507: --- Description: When more than one process is using packages option it's possible to c

[jira] [Commented] (SPARK-20392) Slow performance when calling fit on ML pipeline for dataset with many columns but few rows

2017-07-24 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16098612#comment-16098612 ] Maciej Bryński commented on SPARK-20392: Is it safe to merge it to 2.2 ?

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2017-07-25 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16099686#comment-16099686 ] Paul Magnus Sørensen-Clark commented on SPARK-12261: I ha

[jira] [Comment Edited] (SPARK-12261) pyspark crash for large dataset

2017-07-25 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16099686#comment-16099686 ] Paul Magnus Sørensen-Clark edited comment on SPARK-12261 at 7/25/17 8:1

[jira] [Created] (SPARK-21534) Exception when creating dataframe from python row with empty bytearray

2017-07-25 Thread JIRA
Maciej Bryński created SPARK-21534: -- Summary: Exception when creating dataframe from python row with empty bytearray Key: SPARK-21534 URL: https://issues.apache.org/jira/browse/SPARK-21534 Project

[jira] [Updated] (SPARK-21534) PickleException when creating dataframe from python row with empty bytearray

2017-07-25 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21534: --- Summary: PickleException when creating dataframe from python row with empty bytearray (was

[jira] [Created] (SPARK-21558) Kinesis lease failover time should be increased or made configurable

2017-07-28 Thread JIRA
Clément MATHIEU created SPARK-21558: --- Summary: Kinesis lease failover time should be increased or made configurable Key: SPARK-21558 URL: https://issues.apache.org/jira/browse/SPARK-21558 Project

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2017-08-01 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16110006#comment-16110006 ] Michael Schmeißer commented on SPARK-650: - Please see my comment from 05/Dec/1

[jira] [Commented] (SPARK-9999) Dataset API on top of Catalyst/DataFrame

2016-06-29 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356633#comment-15356633 ] Maciej Bryński commented on SPARK-: --- [~rxin] What about Python API ? What&#

[jira] [Commented] (SPARK-13233) Python Dataset

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356645#comment-15356645 ] Maciej Bryński commented on SPARK-13233: [~holdenk] Did you find out wh

[jira] [Commented] (SPARK-9999) Dataset API on top of Catalyst/DataFrame

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356647#comment-15356647 ] Maciej Bryński commented on SPARK-: --- OK. So what about this patch ? h

[jira] [Commented] (SPARK-5151) Parquet Predicate Pushdown Does Not Work with Nested Structures.

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356648#comment-15356648 ] Maciej Bryński commented on SPARK-5151: --- CC: [~michael] Found this. >

[jira] [Commented] (SPARK-16226) change the way of JDBC commit

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356653#comment-15356653 ] Maciej Bryński commented on SPARK-16226: -1 for this patch. > change the

[jira] [Created] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-06-30 Thread JIRA
Maciej Bryński created SPARK-16320: -- Summary: Spark 2.0 slower than 1.6 when querying nested columns Key: SPARK-16320 URL: https://issues.apache.org/jira/browse/SPARK-16320 Project: Spark

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Created] (SPARK-16321) Pyspark 2.0 performance drop vs pyspark 1.6

2016-06-30 Thread JIRA
Maciej Bryński created SPARK-16321: -- Summary: Pyspark 2.0 performance drop vs pyspark 1.6 Key: SPARK-16321 URL: https://issues.apache.org/jira/browse/SPARK-16321 Project: Spark Issue Type

[jira] [Updated] (SPARK-16321) Pyspark 2.0 performance drop vs pyspark 1.6

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16321: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-06-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15357213#comment-15357213 ] Maciej Bryński commented on SPARK-16320: OK. I'll try to confirm this

[jira] [Commented] (SPARK-16321) Pyspark 2.0 performance drop vs pyspark 1.6

2016-07-01 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15359702#comment-15359702 ] Maciej Bryński commented on SPARK-16321: [~zjffdu] This query executes in a

[jira] [Commented] (SPARK-16206) Defining our own folds using CrossValidator

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361737#comment-15361737 ] Rémi Delassus commented on SPARK-16206: --- >You can implement whatever you

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361781#comment-15361781 ] Maciej Bryński commented on SPARK-16320: [~rxin] I created benchmark script

[jira] [Comment Edited] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361781#comment-15361781 ] Maciej Bryński edited comment on SPARK-16320 at 7/4/16 9:1

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Priority: Critical (was: Major) > Spark 2.0 slower than 1.6 when querying nested colu

[jira] [Commented] (SPARK-13645) DAG Diagram not shown properly in Chrome

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361807#comment-15361807 ] Maciej Bryński commented on SPARK-13645: I have the same problem. > DAG

[jira] [Created] (SPARK-16371) IS NOT NULL clause gives false for nested column

2016-07-04 Thread JIRA
Maciej Bryński created SPARK-16371: -- Summary: IS NOT NULL clause gives false for nested column Key: SPARK-16371 URL: https://issues.apache.org/jira/browse/SPARK-16371 Project: Spark Issue

[jira] [Updated] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16371: --- Summary: IS NOT NULL clause gives false for nested not empty column (was: IS NOT NULL

[jira] [Commented] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362225#comment-15362225 ] Maciej Bryński commented on SPARK-16371: I tried your example and it'

[jira] [Comment Edited] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362225#comment-15362225 ] Maciej Bryński edited comment on SPARK-16371 at 7/5/16 8:5

[jira] [Comment Edited] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362225#comment-15362225 ] Maciej Bryński edited comment on SPARK-16371 at 7/5/16 9:0

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-07-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Description: I did some test on parquet file with many nested columns (about 30G in 400

[jira] [Commented] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362305#comment-15362305 ] Maciej Bryński commented on SPARK-16371: I forget to add import. I repair

[jira] [Comment Edited] (SPARK-16371) IS NOT NULL clause gives false for nested not empty column

2016-07-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15362305#comment-15362305 ] Maciej Bryński edited comment on SPARK-16371 at 7/5/16 2:1

[jira] [Created] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-07 Thread JIRA
Mikael Ståldal created SPARK-16416: -- Summary: Logging in shutdown hook does not work properly with Log4j 2.x Key: SPARK-16416 URL: https://issues.apache.org/jira/browse/SPARK-16416 Project: Spark

[jira] [Commented] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-08 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367442#comment-15367442 ] Mikael Ståldal commented on SPARK-16416: Maybe just add a log statemen

[jira] [Comment Edited] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-08 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367442#comment-15367442 ] Mikael Ståldal edited comment on SPARK-16416 at 7/8/16 9:2

[jira] [Created] (SPARK-16439) Incorrect information on SQL Query details

2016-07-08 Thread JIRA
Maciej Bryński created SPARK-16439: -- Summary: Incorrect information on SQL Query details Key: SPARK-16439 URL: https://issues.apache.org/jira/browse/SPARK-16439 Project: Spark Issue Type

[jira] [Updated] (SPARK-16439) Incorrect information on SQL Query details

2016-07-08 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16439: --- Attachment: spark.jpg > Incorrect information on SQL Query deta

[jira] [Updated] (SPARK-16439) Incorrect information on SQL Query details

2016-07-08 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16439: --- Description: One picture is worth a thousand words. Please see attachment Incorrect values

[jira] [Updated] (SPARK-16439) Incorrect information in SQL Query details

2016-07-08 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16439: --- Summary: Incorrect information in SQL Query details (was: Incorrect information on SQL

[jira] [Commented] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370435#comment-15370435 ] Maciej Bryński commented on SPARK-16439: I think that the problem exist

[jira] [Commented] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370463#comment-15370463 ] Maciej Bryński commented on SPARK-16439: I'll try to prepare sth. Thi

[jira] [Commented] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370555#comment-15370555 ] Maciej Bryński commented on SPARK-16439: OK. Got this. Using spark-shell {

[jira] [Updated] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16439: --- Attachment: sample.png > Incorrect information in SQL Query deta

[jira] [Comment Edited] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370555#comment-15370555 ] Maciej Bryński edited comment on SPARK-16439 at 7/11/16 11:0

[jira] [Comment Edited] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370555#comment-15370555 ] Maciej Bryński edited comment on SPARK-16439 at 7/11/16 11:0

[jira] [Comment Edited] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370555#comment-15370555 ] Maciej Bryński edited comment on SPARK-16439 at 7/11/16 11:0

[jira] [Comment Edited] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370555#comment-15370555 ] Maciej Bryński edited comment on SPARK-16439 at 7/11/16 11:1

[jira] [Comment Edited] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370555#comment-15370555 ] Maciej Bryński edited comment on SPARK-16439 at 7/11/16 11:2

[jira] [Commented] (SPARK-16478) strongly connected components doesn't cache returned RDD

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370595#comment-15370595 ] Michał Wesołowski commented on SPARK-16478: --- If you run code that I prov

[jira] [Comment Edited] (SPARK-16478) strongly connected components doesn't cache returned RDD

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370595#comment-15370595 ] Michał Wesołowski edited comment on SPARK-16478 at 7/11/16 11:4

[jira] [Comment Edited] (SPARK-16478) strongly connected components doesn't cache returned RDD

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370595#comment-15370595 ] Michał Wesołowski edited comment on SPARK-16478 at 7/11/16 11:4

[jira] [Commented] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370647#comment-15370647 ] Maciej Bryński commented on SPARK-16439: [~proflin] I don't know if w

[jira] [Commented] (SPARK-16439) Incorrect information in SQL Query details

2016-07-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15371780#comment-15371780 ] Maciej Bryński commented on SPARK-16439: I found that problem is lo

[jira] [Commented] (SPARK-13645) DAG Diagram not shown properly in Chrome

2016-07-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15377022#comment-15377022 ] Maciej Bryński commented on SPARK-13645: I think that problem is resolve

[jira] [Commented] (SPARK-14234) Executor crashes for TaskRunner thread interruption

2016-07-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-14234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15379041#comment-15379041 ] Josef Lindman Hörnlund commented on SPARK-14234: +1 for backpor

[jira] [Created] (SPARK-16569) Use Cython in Pyspark internals

2016-07-15 Thread JIRA
Maciej Bryński created SPARK-16569: -- Summary: Use Cython in Pyspark internals Key: SPARK-16569 URL: https://issues.apache.org/jira/browse/SPARK-16569 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-16569) Use Cython in Pyspark internals

2016-07-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16569: --- Description: CC: [~davies] Many operations I do are like: {code} dataframe.rdd.map

[jira] [Updated] (SPARK-16569) Use Cython to speed up Pyspark internals

2016-07-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16569: --- Summary: Use Cython to speed up Pyspark internals (was: Use Cython in Pyspark internals

[jira] [Created] (SPARK-16571) DataFrame repartition leads to unexpected error during shuffle

2016-07-15 Thread JIRA
Björn-Elmar Macek created SPARK-16571: - Summary: DataFrame repartition leads to unexpected error during shuffle Key: SPARK-16571 URL: https://issues.apache.org/jira/browse/SPARK-16571 Project

[jira] [Updated] (SPARK-16571) DataFrame repartition leads to unexpected error during shuffle

2016-07-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Björn-Elmar Macek updated SPARK-16571: -- Description: When executing the following code, an exception is thrown. {code} val

[jira] [Commented] (SPARK-5484) Pregel should checkpoint periodically to avoid StackOverflowError

2016-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15380653#comment-15380653 ] Michał Wesołowski commented on SPARK-5484: -- [~ankurd] do you plan to pre

[jira] [Comment Edited] (SPARK-5484) Pregel should checkpoint periodically to avoid StackOverflowError

2016-07-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15380653#comment-15380653 ] Michał Wesołowski edited comment on SPARK-5484 at 7/16/16 9:3

  1   2   3   4   5   6   7   8   9   10   >