[
https://issues.apache.org/jira/browse/YETUS-457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Wang updated YETUS-457:
------------------------------
Attachment: YETUS-457.001.patch
Patch attached. Characters caught include quotes, angle brackets, and $. Tested
by generating the Hadoop 3.0.0-alpha1 changelog and converting to HTML with
pandoc, which now looks properly formatted.
A few questions though:
* markdown_sanitize isn't used outside of utils.py and doesn't do what I
consider markdown sanitization. Should we just inline it into text_sanitize?
Renaming would at least be appropriate.
* I don't follow the comment about additional Doxia escaping. Does this mean
additional escaping for the Doxia flavor of markdown, or for Doxia's apt
format? I added the apt escaping since it seems harmless and is very similar to
markdown's slash escaping, but ideally we handle escaping for different formats
with different methods.
* I want to do all this with some third-party escaping library, but we have
some weird escaping requirements. Not sure.
> RDM does not properly escape entities
> -------------------------------------
>
> Key: YETUS-457
> URL: https://issues.apache.org/jira/browse/YETUS-457
> Project: Yetus
> Issue Type: Bug
> Affects Versions: 0.3.0
> Reporter: Andrew Wang
> Assignee: Andrew Wang
> Priority: Critical
> Attachments: YETUS-457.001.patch
>
>
> Noticed while browsing the Hadoop 3.0.0-alpha1 changelog. Quotes and possibly
> some other entities are not escaped properly, leading to malformed markdown
> output.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)