[jira] [Commented] (SPARK-24959) Do not invoke the CSV/JSON parser for empty schema

2019-01-30 Thread Apache Spark (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-24959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16756983#comment-16756983
 ] 

Apache Spark commented on SPARK-24959:
--

User 'HyukjinKwon' has created a pull request for this issue:
https://github.com/apache/spark/pull/23708

> Do not invoke the CSV/JSON parser for empty schema
> --
>
> Key: SPARK-24959
> URL: https://issues.apache.org/jira/browse/SPARK-24959
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.3.1
>Reporter: Maxim Gekk
>Priority: Major
> Fix For: 2.4.0
>
>
> Currently JSON and CSV parsers are called even if required schema is empty. 
> Invoking the parser per each line has some non-zero overhead. The action can 
> be skipped. Such optimization should speed up count(), for example.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-24959) Do not invoke the CSV/JSON parser for empty schema

2019-01-29 Thread Sean Owen (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-24959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16755088#comment-16755088
 ] 

Sean Owen commented on SPARK-24959:
---

Looks like this may need to be reverted: 
https://issues.apache.org/jira/browse/SPARK-26745

> Do not invoke the CSV/JSON parser for empty schema
> --
>
> Key: SPARK-24959
> URL: https://issues.apache.org/jira/browse/SPARK-24959
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.3.1
>Reporter: Maxim Gekk
>Assignee: Maxim Gekk
>Priority: Major
> Fix For: 2.4.0
>
>
> Currently JSON and CSV parsers are called even if required schema is empty. 
> Invoking the parser per each line has some non-zero overhead. The action can 
> be skipped. Such optimization should speed up count(), for example.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-24959) Do not invoke the CSV/JSON parser for empty schema

2019-01-27 Thread Apache Spark (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-24959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16753697#comment-16753697
 ] 

Apache Spark commented on SPARK-24959:
--

User 'HyukjinKwon' has created a pull request for this issue:
https://github.com/apache/spark/pull/23667

> Do not invoke the CSV/JSON parser for empty schema
> --
>
> Key: SPARK-24959
> URL: https://issues.apache.org/jira/browse/SPARK-24959
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.3.1
>Reporter: Maxim Gekk
>Assignee: Maxim Gekk
>Priority: Major
> Fix For: 2.4.0
>
>
> Currently JSON and CSV parsers are called even if required schema is empty. 
> Invoking the parser per each line has some non-zero overhead. The action can 
> be skipped. Such optimization should speed up count(), for example.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-24959) Do not invoke the CSV/JSON parser for empty schema

2018-11-01 Thread Apache Spark (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-24959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16671553#comment-16671553
 ] 

Apache Spark commented on SPARK-24959:
--

User 'MaxGekk' has created a pull request for this issue:
https://github.com/apache/spark/pull/22920

> Do not invoke the CSV/JSON parser for empty schema
> --
>
> Key: SPARK-24959
> URL: https://issues.apache.org/jira/browse/SPARK-24959
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.3.1
>Reporter: Maxim Gekk
>Assignee: Maxim Gekk
>Priority: Major
> Fix For: 2.4.0
>
>
> Currently JSON and CSV parsers are called even if required schema is empty. 
> Invoking the parser per each line has some non-zero overhead. The action can 
> be skipped. Such optimization should speed up count(), for example.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-24959) Do not invoke the CSV/JSON parser for empty schema

2018-07-28 Thread Apache Spark (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-24959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16560797#comment-16560797
 ] 

Apache Spark commented on SPARK-24959:
--

User 'MaxGekk' has created a pull request for this issue:
https://github.com/apache/spark/pull/21909

> Do not invoke the CSV/JSON parser for empty schema
> --
>
> Key: SPARK-24959
> URL: https://issues.apache.org/jira/browse/SPARK-24959
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.3.1
>Reporter: Maxim Gekk
>Priority: Minor
>
> Currently JSON and CSV parsers are called even if required schema is empty. 
> Invoking the parser per each line has some non-zero overhead. The action can 
> be skipped. Such optimization should speed up count(), for example.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org