Hi Spark community,

after upgrading to pyspark 4.1.1 from 4.0.1 I'm experiencing issues with the XML import:

If I'm setting the ignoreNamespace option either to false or true and the xml file contains namespaces in the tags, an empty dataframe is returned. I created a minimal working example under https://gist.github.com/bockj/cf27c7c6fd1b7c26db14fef8b9ade6b0 .

The option is documented under https://spark.apache.org/docs/4.1.2/sql-data-sources-xml.html . Am I doing something wrong?

Thank you for your help.

Kind gerads
Johannes


---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]

Reply via email to