[
https://issues.apache.org/jira/browse/ANY23-67?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621073#comment-16621073
]
Lewis John McGibbney commented on ANY23-67:
-------------------------------------------
Yes, this requires some TLC for sure and there is nothing else for it other
than for someone to stand up and take on the challenge.
Prior to this, I would suggest that we see how the current extraction is
working for the given W3C test suite -
http://w3c.github.io/microdata-rdf/tests/. It would be optimal if this could be
automated in our integration tests such that it can be easily implemented when
we create the branch for updating the extraction algorithm.
On a somewhat related note, regarding extraction noise, I would like to
refactor use of http://vocab.sindice.net/any23# as much as possible moving
forward. The namespace refers to a end of life service which is where Any23
stemmed from however this is a separate issue.
> Microdata extraction using obsolete RDF conversion scheme
> ---------------------------------------------------------
>
> Key: ANY23-67
> URL: https://issues.apache.org/jira/browse/ANY23-67
> Project: Apache Any23
> Issue Type: Bug
> Components: microdata
> Affects Versions: 0.7.0
> Reporter: Hannes Mühleisen
> Priority: Major
> Fix For: 2.3
>
>
> There is now a more-or-less final Microdata to RDF algorithm published[1]
> which is different than the one in the current, official HTML5 draft [2]
> (that Ian Hickson has publicly revoked). However, Any23s extractor uses the
> old scheme according to a comment in its source code, which refers to [2].
> However, this is exactly the algorithm that Ian Hickson rescinded at some
> point. Unfortunately, the official working drafts have not been updated for a
> very long time, but if you look at the editor's draft [3], you will see that
> that section has been entirely removed. Instead, there was a Semantic Web
> Interest group task force that discussed the issues, and [1] is the result of
> this discussion. It would be nice if this would be reflected in Any23 in the
> future.
> [Condensed from an E-Mail conversation with Ivan Herman]
> [1] http://www.w3.org/TR/microdata-rdf/
> [2] http://www.w3.org/TR/microdata/#rdf
> [3] http://dev.w3.org/html5/md/Overview.html
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)