[
https://issues.apache.org/jira/browse/ANY23-328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16374668#comment-16374668
]
Hudson commented on ANY23-328:
------------------------------
SUCCESS: Integrated in Jenkins build Any23-trunk #1540 (See
[https://builds.apache.org/job/Any23-trunk/1540/])
ANY23-328 Strip comments from json-ld content to make parsing more (hans: rev
189bf260e74436860054469fde8192531cce6f14)
* (edit)
core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractor.java
* (edit) core/src/main/java/org/apache/any23/extractor/rdf/BaseRDFExtractor.java
* (add) test-resources/src/test/resources/html/html-jsonld-strip-comments.html
* (edit)
core/src/test/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorTest.java
> Problem parsing json-ld content surrounded by comments
> ------------------------------------------------------
>
> Key: ANY23-328
> URL: https://issues.apache.org/jira/browse/ANY23-328
> Project: Apache Any23
> Issue Type: Bug
> Components: core
> Affects Versions: 2.1
> Reporter: Hans Brende
> Assignee: Hans Brende
> Priority: Major
> Fix For: 2.2
>
>
> Sometimes in json-ld script blocks (e.g., on https://www.guthriegreen.com),
> you will see
> /*<![CDATA[*/
> ...json-ld content...
> /*]]>*/
> or
> //<![CDATA[
> ...json-ld content...
> //]]>
>
> Currently we are stripping CDATA markers, but we are not stripping leading &
> trailing comments, which will cause json-ld parsing to fail. This may be
> related to issue #17.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)