Github user lewismc commented on the issue:
https://github.com/apache/any23/pull/49
In the case where one encounters a parse and/or extraction error hence an
unsuccessful extraction, one would see the following result.
As you can see, the partial extraction is now included at the bottom of the
servlet response which is much better e.g. more forgiving, than a plain stack
trace and error message.
```
Failed to fully parse input. The extraction result, at the bottom of this
response, if any, will contain extractions only up until the extraction error.
================================================================
------------ BEGIN Exception context ------------
ExtractionContext(urn:x-any23:html-rdfa11:root-extraction-result-id:http://any23.apache.org/)
Errors {
ERROR: 'The entity "copy" was referenced, but not declared.' (-1,-1)
}
------------ END Exception context ------------
org.apache.any23.extractor.ExtractionException: Error while parsing RDF
document.
at
org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:109)
at
org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:41)
at
org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:467)
at
org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:256)
at org.apache.any23.Any23.extract(Any23.java:300)
at org.apache.any23.Any23.extract(Any23.java:452)
at
org.apache.any23.servlet.WebResponder.runExtraction(WebResponder.java:117)
at org.apache.any23.servlet.Servlet.doGet(Servlet.java:82)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:624)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:731)
at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:303)
at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
at
org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:52)
at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:241)
at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
at
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:218)
at
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:122)
at
org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:505)
at
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:169)
at
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:103)
at
org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:956)
at
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:116)
at
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:442)
at
org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1083)
at
org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:640)
at
org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:318)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at
org.apache.tomcat.util.threads.TaskThread$WrappingRunnable.run(TaskThread.java:61)
at java.lang.Thread.run(Thread.java:748)
Caused by: org.eclipse.rdf4j.rio.RDFParseException:
org.xml.sax.SAXParseException; lineNumber: 306; columnNumber: 55; The entity
"copy" was referenced, but not declared.
at
org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:111)
at
org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:95)
at
org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:105)
... 29 more
Caused by: org.semarglproject.rdf.ParseException:
org.xml.sax.SAXParseException; lineNumber: 306; columnNumber: 55; The entity
"copy" was referenced, but not declared.
at
org.semarglproject.rdf.rdfa.RdfaParser.processException(RdfaParser.java:1141)
at org.semarglproject.source.XmlSource.process(XmlSource.java:50)
at
org.semarglproject.source.StreamProcessor.processInternal(StreamProcessor.java:87)
at
org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:167)
at
org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:154)
at
org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:109)
... 31 more
Caused by: org.xml.sax.SAXParseException; lineNumber: 306; columnNumber:
55; The entity "copy" was referenced, but not declared.
at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
at org.semarglproject.source.XmlSource.process(XmlSource.java:48)
... 35 more
================================================================
<data>
<![CDATA[
@prefix sindice: <http://vocab.sindice.net/> .
<http://any23.apache.org/>
<http://vocab.sindice.net/any23#Date-Revision-yyyymmdd> "20171101"@en ;
<http://vocab.sindice.net/any23#Content-Language> "en"@en ;
<http://vocab.sindice.net/any23#viewport> "width=device-width,
initial-scale=1.0"@en ;
<http://vocab.sindice.net/any23#author> "The Apache Software
Foundation"@en .
@prefix dcterms: <http://purl.org/dc/terms/> .
<http://any23.apache.org/> dcterms:title "Apache Any23 ââ¬â Apache
Any23 - Introduction"@en .
]]>
</data>
```
---