[Spark XML] Reading an XML after setting ignoreNamespace option leads to an empty DataFrame in pyspark 4.1.1

Johannes Bock Wed, 27 May 2026 08:29:44 -0700

Hi Spark community,

after upgrading to pyspark 4.1.1 from 4.0.1 I'm experiencing issues with the XMLimport:

If I'm setting the ignoreNamespace option either to false or true and the xmlfile contains namespaces in the tags, an empty dataframe is returned. I createda minimal working example underhttps://gist.github.com/bockj/cf27c7c6fd1b7c26db14fef8b9ade6b0 .

The option is documented underhttps://spark.apache.org/docs/4.1.2/sql-data-sources-xml.html . Am I doingsomething wrong?


Thank you for your help.

Kind gerads
Johannes


---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]

[Spark XML] Reading an XML after setting ignoreNamespace option leads to an empty DataFrame in pyspark 4.1.1

Reply via email to