[ 
https://issues.apache.org/jira/browse/ANY23-18?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186888#comment-13186888
 ] 

Michele Mostarda commented on ANY23-18:
---------------------------------------

Ciao Paolo, wasn't me doing the java-rdfa evaluation but a colleague of the 
Sindice team. 
I just saw the report. It seems there was a wrong management of nested items.

In general, being the RDFa 1.1 specifications still open (draft), we preferred 
to write an in house RDFa programmatic parser to: 
 - keep under control the support of the specification while evolving;
 - have the possibility to add non standard features;
 - keep under control extraction algorithm performances .
                
> Add a new extractor for RDFa using java-rdfa
> --------------------------------------------
>
>                 Key: ANY23-18
>                 URL: https://issues.apache.org/jira/browse/ANY23-18
>             Project: Apache Any23
>          Issue Type: Improvement
>            Reporter: Paolo Castagna
>            Assignee: Michele Mostarda
>            Priority: Minor
>              Labels: rdfa
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> I wonder if it is possible to add a new RDFa extractor which uses java-rdfa 
> [1].
> java-rdfa is (according to its creator, Damian Steer :-)) "the cruftiest RDFa 
> parser in the world" (and he is probably right!). java-rdfa is currently 
> passing all conformance tests for XHTML, and the HTML 4 and 5 tests with one 
> exception [2]. An online service|demo [3] is also available. java-rdfa, as 
> far as I understand, is currently licensed with a BSD license. The Maven 
> artifacts are available in the Maven central repository [4].
> From my little understanding of Any23, in order to do this one needs to 
> implement BlindExtractor (which extends Extractor<URI>) and ContentExtractor 
> (which extends Extractor<InputStream>).
> See also: [5].
>  [1] https://github.com/shellac/java-rdfa
>  [2] http://github.com/shellac/java-rdfa/issues#issue/15
>  [3] http://rdf-in-html.appspot.com/
>  [4] http://repo1.maven.org/maven2/net/rootdev/java-rdfa/
>  [5] https://github.com/shellac/java-rdfa/issues/35

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to