[ 
https://issues.apache.org/jira/browse/YETUS-457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Wang updated YETUS-457:
------------------------------
    Attachment: YETUS-457.001.patch

Patch attached. Characters caught include quotes, angle brackets, and $. Tested 
by generating the Hadoop 3.0.0-alpha1 changelog and converting to HTML with 
pandoc, which now looks properly formatted.

A few questions though:

* markdown_sanitize isn't used outside of utils.py and doesn't do what I 
consider markdown sanitization. Should we just inline it into text_sanitize? 
Renaming would at least be appropriate.
* I don't follow the comment about additional Doxia escaping. Does this mean 
additional escaping for the Doxia flavor of markdown, or for Doxia's apt 
format? I added the apt escaping since it seems harmless and is very similar to 
markdown's slash escaping, but ideally we handle escaping for different formats 
with different methods.
* I want to do all this with some third-party escaping library, but we have 
some weird escaping requirements. Not sure.

> RDM does not properly escape entities
> -------------------------------------
>
>                 Key: YETUS-457
>                 URL: https://issues.apache.org/jira/browse/YETUS-457
>             Project: Yetus
>          Issue Type: Bug
>    Affects Versions: 0.3.0
>            Reporter: Andrew Wang
>            Assignee: Andrew Wang
>            Priority: Critical
>         Attachments: YETUS-457.001.patch
>
>
> Noticed while browsing the Hadoop 3.0.0-alpha1 changelog. Quotes and possibly 
> some other entities are not escaped properly, leading to malformed markdown 
> output.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to