Hi Spark community,
after upgrading to pyspark 4.1.1 from 4.0.1 I'm experiencing issues with the XML
import:
If I'm setting the ignoreNamespace option either to false or true and the xml
file contains namespaces in the tags, an empty dataframe is returned. I created
a minimal working example under
https://gist.github.com/bockj/cf27c7c6fd1b7c26db14fef8b9ade6b0 .
The option is documented under
https://spark.apache.org/docs/4.1.2/sql-data-sources-xml.html . Am I doing
something wrong?
Thank you for your help.
Kind gerads
Johannes
---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]