[ 
https://issues.apache.org/jira/browse/ANY23-67?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621073#comment-16621073
 ] 

Lewis John McGibbney commented on ANY23-67:
-------------------------------------------

Yes, this requires some TLC for sure and there is nothing else for it other 
than for someone to stand up and take on the challenge. 

Prior to this, I would suggest that we see how the current extraction is 
working for the given W3C test suite - 
http://w3c.github.io/microdata-rdf/tests/. It would be optimal if this could be 
automated in our integration tests such that it can be easily implemented when 
we create the branch for updating the extraction algorithm.

On a somewhat related note, regarding extraction noise, I would like to 
refactor use of http://vocab.sindice.net/any23# as much as possible moving 
forward. The namespace refers to a end of life service which is where Any23 
stemmed from however this is a separate issue. 

> Microdata extraction using obsolete RDF conversion scheme
> ---------------------------------------------------------
>
>                 Key: ANY23-67
>                 URL: https://issues.apache.org/jira/browse/ANY23-67
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: microdata
>    Affects Versions: 0.7.0
>            Reporter: Hannes Mühleisen
>            Priority: Major
>             Fix For: 2.3
>
>
> There is now a more-or-less final Microdata to RDF algorithm published[1] 
> which is different than the one in the current, official HTML5 draft [2] 
> (that Ian Hickson has publicly revoked). However, Any23s extractor uses the 
> old scheme according to a comment in its source code, which refers to [2]. 
> However, this is exactly the algorithm that Ian Hickson rescinded at some 
> point. Unfortunately, the official working drafts have not been updated for a 
> very long time, but if you look at the editor's draft [3], you will see that 
> that section has been entirely removed. Instead, there was a Semantic Web 
> Interest group task force that discussed the issues, and [1] is the result of 
> this discussion. It would be nice if this would be reflected in Any23 in the 
> future.
> [Condensed from an E-Mail conversation with Ivan Herman]
> [1] http://www.w3.org/TR/microdata-rdf/
> [2] http://www.w3.org/TR/microdata/#rdf
> [3] http://dev.w3.org/html5/md/Overview.html



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to