Eric Liang created SPARK-20450:
----------------------------------
Summary: Unexpected first-query schema inference cost with 2.1.1 RC
Key: SPARK-20450
URL: https://issues.apache.org/jira/browse/SPARK-20450
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 2.1.1
Reporter: Eric Liang
Priority: Blocker
https://issues.apache.org/jira/browse/SPARK-19611 fixes a regression from 2.0
where Spark silently fails to read case-sensitive fields missing a
case-sensitive schema in the table properties. The fix is to detect this
situation, infer the schema, and write the case-sensitive schema into the
metastore.
However this can incur an unexpected performance hit the first time such a
problematic table is queried (and there is a high false-positive rate here
since most tables don't actually have case-sensitive fields).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]