[
https://issues.apache.org/jira/browse/SPARK-47218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yousof Hosny updated SPARK-47218:
---------------------------------
Description:
The following returns rows that was within comments:
{{}}
{code:java}
// BUG: rowTag in comment -- incorrectly processed
display(spark.read.xml(write("""<root> <ROW><a>1</a></ROW>
<!--<ROW><a>2</a></ROW>--> </root>"""))){code}
{{}}
This has been reported before:[!https://github.com/fluidicon.png!How to Ignore
XML comments like this · Issue #208 ·
databricks/spark-xml|https://github.com/databricks/spark-xml/issues/208]
{{}}
was:
The following returns rows that was within comments:
```
{{// BUG: rowTag in comment -- incorrectly processed
display(spark.read.xml(write("""<root> <ROW><a>1</a></ROW>
<!--<ROW><a>2</a></ROW>--> </root>""")))}}
```
This has been reported before:[!https://github.com/fluidicon.png!How to Ignore
XML comments like this · Issue #208 ·
databricks/spark-xml|https://github.com/databricks/spark-xml/issues/208]
{{}}
> XML: Skip rowTag in a comment
> -----------------------------
>
> Key: SPARK-47218
> URL: https://issues.apache.org/jira/browse/SPARK-47218
> Project: Spark
> Issue Type: Sub-task
> Components: Spark Core
> Affects Versions: 4.0.0
> Reporter: Yousof Hosny
> Priority: Major
>
> The following returns rows that was within comments:
> {{}}
> {code:java}
> // BUG: rowTag in comment -- incorrectly processed
> display(spark.read.xml(write("""<root> <ROW><a>1</a></ROW>
> <!--<ROW><a>2</a></ROW>--> </root>"""))){code}
> {{}}
> This has been reported before:[!https://github.com/fluidicon.png!How to
> Ignore XML comments like this · Issue #208 ·
> databricks/spark-xml|https://github.com/databricks/spark-xml/issues/208]
> {{}}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]