[ https://issues.apache.org/jira/browse/CONNECTORS-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446104#comment-17446104 ]
Julien Massiera commented on CONNECTORS-1679: --------------------------------------------- Patch submitted > HTML Extractor: output has escaped entities > ------------------------------------------- > > Key: CONNECTORS-1679 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1679 > Project: ManifoldCF > Issue Type: Bug > Components: HTML extractor > Affects Versions: ManifoldCF 2.20 > Reporter: Julien Massiera > Priority: Major > Fix For: ManifoldCF 2.21 > > Attachments: patch-CONNECTORS-1679.txt > > > The output of the HTML extractor is generated with escaped entities (eg '&' > becomes '&'), which is not the wanted behavior as we want this connector > to extract text from HTML as it is -- This message was sent by Atlassian Jira (v8.20.1#820001)