[jira] [Updated] (SPARK-35511) Spark computes all rows during count() on a parquet file

2021-05-25 Thread Ivan Tsukanov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-35511: -- Description: We expect spark uses parquet metadata to fetch the rows count of a parquet

[jira] [Created] (SPARK-35511) Spark computes all rows during count() on a parquet file

2021-05-25 Thread Ivan Tsukanov (Jira)
Ivan Tsukanov created SPARK-35511: - Summary: Spark computes all rows during count() on a parquet file Key: SPARK-35511 URL: https://issues.apache.org/jira/browse/SPARK-35511 Project: Spark

[jira] [Updated] (SPARK-32758) Spark ignores limit(1) and starts tasks for all partition

2020-08-31 Thread Ivan Tsukanov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-32758: -- Description: If we run the following code {code:scala} val sparkConf = new SparkConf()

[jira] [Updated] (SPARK-32758) Spark ignores limit(1) and starts tasks for all partition

2020-08-31 Thread Ivan Tsukanov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-32758: -- Environment: (was:     должен  ) > Spark ignores limit(1) and starts tasks for all

[jira] [Updated] (SPARK-32758) Spark ignores limit(1) and starts tasks for all partition

2020-08-31 Thread Ivan Tsukanov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-32758: -- Description: If we run the following code {code:scala} val sparkConf = new SparkConf()

[jira] [Updated] (SPARK-32758) Spark ignores limit(1) and starts tasks for all partition

2020-08-31 Thread Ivan Tsukanov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-32758: -- Attachment: image-2020-09-01-10-51-09-417.png > Spark ignores limit(1) and starts tasks for

[jira] [Created] (SPARK-32758) Spark ignores limit(1) and starts tasks for all partition

2020-08-31 Thread Ivan Tsukanov (Jira)
Ivan Tsukanov created SPARK-32758: - Summary: Spark ignores limit(1) and starts tasks for all partition Key: SPARK-32758 URL: https://issues.apache.org/jira/browse/SPARK-32758 Project: Spark

[jira] [Updated] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28742: -- Description: The following code {code:java} val rdd = sparkContext.makeRDD(Seq(Row("1"))) val

[jira] [Updated] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28742: -- Description: The following code {code:java} val rdd = sparkContext.makeRDD(Seq(Row("1"))) val

[jira] [Updated] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28742: -- Description: The following code {code:java} val rdd = sparkContext.makeRDD(Seq(Row("1"))) val

[jira] [Updated] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28742: -- Description: The following code {code:java} val rdd = sparkContext.makeRDD(Seq(Row("1"))) val

[jira] [Updated] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28742: -- Attachment: image-2019-08-15-15-19-33-319.png > StackOverflowError when using

[jira] [Updated] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28742: -- Attachment: (was: image-2019-08-15-15-19-33-319.png) > StackOverflowError when using

[jira] [Created] (SPARK-28742) StackOverflowError when using otherwise(col()) in a loop

2019-08-15 Thread Ivan Tsukanov (JIRA)
Ivan Tsukanov created SPARK-28742: - Summary: StackOverflowError when using otherwise(col()) in a loop Key: SPARK-28742 URL: https://issues.apache.org/jira/browse/SPARK-28742 Project: Spark

[jira] [Commented] (SPARK-28480) Types of input parameters of a UDF affect the ability to cache the result

2019-07-23 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16890694#comment-16890694 ] Ivan Tsukanov commented on SPARK-28480: --- ok, let's close the ticket. [~shivuson...@gmail.com],

[jira] [Updated] (SPARK-28480) Types of input parameters of a UDF affect the ability to cache the result

2019-07-22 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28480: -- Fix Version/s: 2.4.3 > Types of input parameters of a UDF affect the ability to cache the

[jira] [Created] (SPARK-28480) Types of input parameters of a UDF affect the ability to cache the result

2019-07-22 Thread Ivan Tsukanov (JIRA)
Ivan Tsukanov created SPARK-28480: - Summary: Types of input parameters of a UDF affect the ability to cache the result Key: SPARK-28480 URL: https://issues.apache.org/jira/browse/SPARK-28480 Project:

[jira] [Updated] (SPARK-28480) Types of input parameters of a UDF affect the ability to cache the result

2019-07-22 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-28480: -- Description: When I define a parameter in a UDF as Boolean or Int the result DataFrame can't

[jira] [Updated] (SPARK-25987) StackOverflowError when executing many operations on a table with many columns

2018-11-08 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-25987: -- Description: When I execute {code:java} val columnsCount = 100 val columns = (1 to

[jira] [Updated] (SPARK-25987) StackOverflowError when executing many operations on a table with many columns

2018-11-08 Thread Ivan Tsukanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Tsukanov updated SPARK-25987: -- Description: When I execute {code:java} val columnsCount = 100 val columns = (1 to

[jira] [Created] (SPARK-25987) StackOverflowError when executing many operations on a table with many columns

2018-11-08 Thread Ivan Tsukanov (JIRA)
Ivan Tsukanov created SPARK-25987: - Summary: StackOverflowError when executing many operations on a table with many columns Key: SPARK-25987 URL: https://issues.apache.org/jira/browse/SPARK-25987