srowen opened a new pull request, #42844:
URL: https://github.com/apache/spark/pull/42844

   ### What changes were proposed in this pull request?
   
   Partial back-port of 
https://github.com/databricks/spark-xml/commit/994e357f7666956b5d0e63627716b2c092d9abbd?diff=split
 from spark-xml
   
   ### Why are the changes needed?
   
   Though no more development was intended on spark-xml, there was a 
non-trivial improvement to inference speed that I committed anyway to resolve a 
customer issue. Part of it can be 'backported' here to sync the code. I 
attached this as a follow-up to the main code port JIRA.
   
   There is still, in general, no intent to commit more to spark-xml in the 
meantime unless it's significantly important.
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   No, this should only speed up schema inference without behavior change.
   
   ### How was this patch tested?
   
   Tested in spark-xml, and will be tested by tests here too


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to