Hi Rupert, There is also problem with web page. That means that text works just fine, but I have problems with documents and web pages. When I try to annotate web page, this is what I get:
http://en.wikipedia.org/wiki/Semantic_Web // URL to annotate **** 174030 // file length - this means that you were right about GetMethod, and it works now 174030 text/html // file length and content type just before calling annotate method ERROR <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/> <title>Error 500 INTERNAL_SERVER_ERROR</title> </head> <body><h2>HTTP ERROR 500</h2> <p>Problem accessing /engines. Reason: <pre> INTERNAL_SERVER_ERROR</pre></p><h3>Caused by:</h3><pre>org.apache.stanbol.enhancer.servicesapi.EngineException at org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine.computeEnhancements(MetaxaEngine.java:191) at org.apache.stanbol.enhancer.jobmanager.impl.WeightedJobManager.enhanceContent(WeightedJobManager.java:80) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceAndBuildResponse(EnginesRootResource.java:175) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceFromData(EnginesRootResource.java:167) at sun.reflect.GeneratedMethodAccessor35.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$ResponseOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:205) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:288) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1465) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1396) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1345) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1335) at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:416) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:699) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.apache.felix.http.base.internal.handler.ServletHandler.doHandle(ServletHandler.java:96) at org.apache.felix.http.base.internal.handler.ServletHandler.handle(ServletHandler.java:79) at org.apache.felix.http.base.internal.dispatch.ServletPipeline.handle(ServletPipeline.java:42) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:49) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.stanbol.commons.httpqueryheaders.impl.QueryHeadersFilter.doFilter(QueryHeadersFilter.java:75) at org.apache.felix.http.base.internal.handler.FilterHandler.doHandle(FilterHandler.java:88) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:76) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:78) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.dispatch.FilterPipeline.dispatch(FilterPipeline.java:48) at org.apache.felix.http.base.internal.dispatch.Dispatcher.dispatch(Dispatcher.java:39) at org.apache.felix.http.base.internal.DispatcherServlet.service(DispatcherServlet.java:67) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:390) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:765) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:943) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Caused by: org.semanticdesktop.aperture.extractor.ExtractorException at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:147) at org.apache.stanbol.enhancer.engines.metaxa.core.html.IksHtmlExtractor.extract(IksHtmlExtractor.java:123) at org.apache.stanbol.enhancer.engines.metaxa.core.MetaxaCore.extract(MetaxaCore.java:120) at org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine.computeEnhancements(MetaxaEngine.java:157) ... 51 more Caused by: java.io.IOException at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:661) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:652) at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:140) ... 54 more Caused by: org.openrdf.rio.RDFParseException: Not a valid (absolute) URI: // creativecommons.org/licenses/by-sa/3.0/ [line 4, column 179] at org.openrdf.rio.helpers.RDFParserBase.reportFatalError(RDFParserBase.java:533) at org.openrdf.rio.rdfxml.RDFXMLParser.reportFatalError(RDFXMLParser.java:1068) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:285) at org.openrdf.rio.helpers.RDFParserBase.resolveURI(RDFParserBase.java:272) at org.openrdf.rio.rdfxml.RDFXMLParser.getPropertyResource(RDFXMLParser.java:751) at org.openrdf.rio.rdfxml.RDFXMLParser.processPropertyElt(RDFXMLParser.java:674) at org.openrdf.rio.rdfxml.RDFXMLParser.emptyElement(RDFXMLParser.java:378) at org.openrdf.rio.rdfxml.SAXFilter.endElement(SAXFilter.java:359) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:260) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:244) at org.openrdf.repository.base.RepositoryConnectionBase.addInputStreamOrReader(RepositoryConnectionBase.java:357) at org.openrdf.repository.base.RepositoryConnectionBase.add(RepositoryConnectionBase.java:312) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:659) ... 56 more Caused by: java.lang.IllegalArgumentException: Not a valid (absolute) URI: //creativecommons.org/licenses/by-sa/3.0/ at org.openrdf.model.impl.URIImpl.setURIString(URIImpl.java:68) at org.openrdf.model.impl.URIImpl.<init>(URIImpl.java:57) at org.openrdf.sail.memory.model.MemValueFactory.createURI(MemValueFactory.java:345) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:282) ... 78 more </pre> <h3>Caused by:</h3><pre>org.semanticdesktop.aperture.extractor.ExtractorException at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:147) at org.apache.stanbol.enhancer.engines.metaxa.core.html.IksHtmlExtractor.extract(IksHtmlExtractor.java:123) at org.apache.stanbol.enhancer.engines.metaxa.core.MetaxaCore.extract(MetaxaCore.java:120) at org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine.computeEnhancements(MetaxaEngine.java:157) at org.apache.stanbol.enhancer.jobmanager.impl.WeightedJobManager.enhanceContent(WeightedJobManager.java:80) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceAndBuildResponse(EnginesRootResource.java:175) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceFromData(EnginesRootResource.java:167) at sun.reflect.GeneratedMethodAccessor35.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$ResponseOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:205) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:288) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1465) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1396) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1345) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1335) at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:416) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:699) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.apache.felix.http.base.internal.handler.ServletHandler.doHandle(ServletHandler.java:96) at org.apache.felix.http.base.internal.handler.ServletHandler.handle(ServletHandler.java:79) at org.apache.felix.http.base.internal.dispatch.ServletPipeline.handle(ServletPipeline.java:42) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:49) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.stanbol.commons.httpqueryheaders.impl.QueryHeadersFilter.doFilter(QueryHeadersFilter.java:75) at org.apache.felix.http.base.internal.handler.FilterHandler.doHandle(FilterHandler.java:88) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:76) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:78) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.dispatch.FilterPipeline.dispatch(FilterPipeline.java:48) at org.apache.felix.http.base.internal.dispatch.Dispatcher.dispatch(Dispatcher.java:39) at org.apache.felix.http.base.internal.DispatcherServlet.service(DispatcherServlet.java:67) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:390) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:765) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:943) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Caused by: java.io.IOException at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:661) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:652) at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:140) ... 54 more Caused by: org.openrdf.rio.RDFParseException: Not a valid (absolute) URI: // creativecommons.org/licenses/by-sa/3.0/ [line 4, column 179] at org.openrdf.rio.helpers.RDFParserBase.reportFatalError(RDFParserBase.java:533) at org.openrdf.rio.rdfxml.RDFXMLParser.reportFatalError(RDFXMLParser.java:1068) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:285) at org.openrdf.rio.helpers.RDFParserBase.resolveURI(RDFParserBase.java:272) at org.openrdf.rio.rdfxml.RDFXMLParser.getPropertyResource(RDFXMLParser.java:751) at org.openrdf.rio.rdfxml.RDFXMLParser.processPropertyElt(RDFXMLParser.java:674) at org.openrdf.rio.rdfxml.RDFXMLParser.emptyElement(RDFXMLParser.java:378) at org.openrdf.rio.rdfxml.SAXFilter.endElement(SAXFilter.java:359) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:260) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:244) at org.openrdf.repository.base.RepositoryConnectionBase.addInputStreamOrReader(RepositoryConnectionBase.java:357) at org.openrdf.repository.base.RepositoryConnectionBase.add(RepositoryConnectionBase.java:312) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:659) ... 56 more Caused by: java.lang.IllegalArgumentException: Not a valid (absolute) URI: //creativecommons.org/licenses/by-sa/3.0/ at org.openrdf.model.impl.URIImpl.setURIString(URIImpl.java:68) at org.openrdf.model.impl.URIImpl.<init>(URIImpl.java:57) at org.openrdf.sail.memory.model.MemValueFactory.createURI(MemValueFactory.java:345) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:282) ... 78 more </pre> <h3>Caused by:</h3><pre>java.io.IOException at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:661) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:652) at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:140) at org.apache.stanbol.enhancer.engines.metaxa.core.html.IksHtmlExtractor.extract(IksHtmlExtractor.java:123) at org.apache.stanbol.enhancer.engines.metaxa.core.MetaxaCore.extract(MetaxaCore.java:120) at org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine.computeEnhancements(MetaxaEngine.java:157) at org.apache.stanbol.enhancer.jobmanager.impl.WeightedJobManager.enhanceContent(WeightedJobManager.java:80) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceAndBuildResponse(EnginesRootResource.java:175) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceFromData(EnginesRootResource.java:167) at sun.reflect.GeneratedMethodAccessor35.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$ResponseOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:205) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:288) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1465) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1396) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1345) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1335) at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:416) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:699) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.apache.felix.http.base.internal.handler.ServletHandler.doHandle(ServletHandler.java:96) at org.apache.felix.http.base.internal.handler.ServletHandler.handle(ServletHandler.java:79) at org.apache.felix.http.base.internal.dispatch.ServletPipeline.handle(ServletPipeline.java:42) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:49) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.stanbol.commons.httpqueryheaders.impl.QueryHeadersFilter.doFilter(QueryHeadersFilter.java:75) at org.apache.felix.http.base.internal.handler.FilterHandler.doHandle(FilterHandler.java:88) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:76) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:78) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.dispatch.FilterPipeline.dispatch(FilterPipeline.java:48) at org.apache.felix.http.base.internal.dispatch.Dispatcher.dispatch(Dispatcher.java:39) at org.apache.felix.http.base.internal.DispatcherServlet.service(DispatcherServlet.java:67) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:390) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:765) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:943) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Caused by: org.openrdf.rio.RDFParseException: Not a valid (absolute) URI: // creativecommons.org/licenses/by-sa/3.0/ [line 4, column 179] at org.openrdf.rio.helpers.RDFParserBase.reportFatalError(RDFParserBase.java:533) at org.openrdf.rio.rdfxml.RDFXMLParser.reportFatalError(RDFXMLParser.java:1068) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:285) at org.openrdf.rio.helpers.RDFParserBase.resolveURI(RDFParserBase.java:272) at org.openrdf.rio.rdfxml.RDFXMLParser.getPropertyResource(RDFXMLParser.java:751) at org.openrdf.rio.rdfxml.RDFXMLParser.processPropertyElt(RDFXMLParser.java:674) at org.openrdf.rio.rdfxml.RDFXMLParser.emptyElement(RDFXMLParser.java:378) at org.openrdf.rio.rdfxml.SAXFilter.endElement(SAXFilter.java:359) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:260) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:244) at org.openrdf.repository.base.RepositoryConnectionBase.addInputStreamOrReader(RepositoryConnectionBase.java:357) at org.openrdf.repository.base.RepositoryConnectionBase.add(RepositoryConnectionBase.java:312) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:659) ... 56 more Caused by: java.lang.IllegalArgumentException: Not a valid (absolute) URI: //creativecommons.org/licenses/by-sa/3.0/ at org.openrdf.model.impl.URIImpl.setURIString(URIImpl.java:68) at org.openrdf.model.impl.URIImpl.<init>(URIImpl.java:57) at org.openrdf.sail.memory.model.MemValueFactory.createURI(MemValueFactory.java:345) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:282) ... 78 more </pre> <h3>Caused by:</h3><pre>org.openrdf.rio.RDFParseException: Not a valid (absolute) URI: //creativecommons.org/licenses/by-sa/3.0/ [line 4, column 179] at org.openrdf.rio.helpers.RDFParserBase.reportFatalError(RDFParserBase.java:533) at org.openrdf.rio.rdfxml.RDFXMLParser.reportFatalError(RDFXMLParser.java:1068) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:285) at org.openrdf.rio.helpers.RDFParserBase.resolveURI(RDFParserBase.java:272) at org.openrdf.rio.rdfxml.RDFXMLParser.getPropertyResource(RDFXMLParser.java:751) at org.openrdf.rio.rdfxml.RDFXMLParser.processPropertyElt(RDFXMLParser.java:674) at org.openrdf.rio.rdfxml.RDFXMLParser.emptyElement(RDFXMLParser.java:378) at org.openrdf.rio.rdfxml.SAXFilter.endElement(SAXFilter.java:359) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:260) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:244) at org.openrdf.repository.base.RepositoryConnectionBase.addInputStreamOrReader(RepositoryConnectionBase.java:357) at org.openrdf.repository.base.RepositoryConnectionBase.add(RepositoryConnectionBase.java:312) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:659) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:652) at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:140) at org.apache.stanbol.enhancer.engines.metaxa.core.html.IksHtmlExtractor.extract(IksHtmlExtractor.java:123) at org.apache.stanbol.enhancer.engines.metaxa.core.MetaxaCore.extract(MetaxaCore.java:120) at org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine.computeEnhancements(MetaxaEngine.java:157) at org.apache.stanbol.enhancer.jobmanager.impl.WeightedJobManager.enhanceContent(WeightedJobManager.java:80) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceAndBuildResponse(EnginesRootResource.java:175) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceFromData(EnginesRootResource.java:167) at sun.reflect.GeneratedMethodAccessor35.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$ResponseOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:205) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:288) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1465) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1396) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1345) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1335) at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:416) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:699) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.apache.felix.http.base.internal.handler.ServletHandler.doHandle(ServletHandler.java:96) at org.apache.felix.http.base.internal.handler.ServletHandler.handle(ServletHandler.java:79) at org.apache.felix.http.base.internal.dispatch.ServletPipeline.handle(ServletPipeline.java:42) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:49) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.stanbol.commons.httpqueryheaders.impl.QueryHeadersFilter.doFilter(QueryHeadersFilter.java:75) at org.apache.felix.http.base.internal.handler.FilterHandler.doHandle(FilterHandler.java:88) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:76) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:78) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.dispatch.FilterPipeline.dispatch(FilterPipeline.java:48) at org.apache.felix.http.base.internal.dispatch.Dispatcher.dispatch(Dispatcher.java:39) at org.apache.felix.http.base.internal.DispatcherServlet.service(DispatcherServlet.java:67) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:390) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:765) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:943) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Caused by: java.lang.IllegalArgumentException: Not a valid (absolute) URI: //creativecommons.org/licenses/by-sa/3.0/ at org.openrdf.model.impl.URIImpl.setURIString(URIImpl.java:68) at org.openrdf.model.impl.URIImpl.<init>(URIImpl.java:57) at org.openrdf.sail.memory.model.MemValueFactory.createURI(MemValueFactory.java:345) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:282) ... 78 more </pre> <h3>Caused by:</h3><pre>java.lang.IllegalArgumentException: Not a valid (absolute) URI: //creativecommons.org/licenses/by-sa/3.0/ at org.openrdf.model.impl.URIImpl.setURIString(URIImpl.java:68) at org.openrdf.model.impl.URIImpl.<init>(URIImpl.java:57) at org.openrdf.sail.memory.model.MemValueFactory.createURI(MemValueFactory.java:345) at org.openrdf.rio.helpers.RDFParserBase.createURI(RDFParserBase.java:282) at org.openrdf.rio.helpers.RDFParserBase.resolveURI(RDFParserBase.java:272) at org.openrdf.rio.rdfxml.RDFXMLParser.getPropertyResource(RDFXMLParser.java:751) at org.openrdf.rio.rdfxml.RDFXMLParser.processPropertyElt(RDFXMLParser.java:674) at org.openrdf.rio.rdfxml.RDFXMLParser.emptyElement(RDFXMLParser.java:378) at org.openrdf.rio.rdfxml.SAXFilter.endElement(SAXFilter.java:359) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:260) at org.openrdf.rio.rdfxml.RDFXMLParser.parse(RDFXMLParser.java:244) at org.openrdf.repository.base.RepositoryConnectionBase.addInputStreamOrReader(RepositoryConnectionBase.java:357) at org.openrdf.repository.base.RepositoryConnectionBase.add(RepositoryConnectionBase.java:312) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:659) at org.openrdf.rdf2go.RepositoryModel.readFrom(RepositoryModel.java:652) at org.apache.stanbol.enhancer.engines.metaxa.core.html.XsltExtractor.extract(XsltExtractor.java:140) at org.apache.stanbol.enhancer.engines.metaxa.core.html.IksHtmlExtractor.extract(IksHtmlExtractor.java:123) at org.apache.stanbol.enhancer.engines.metaxa.core.MetaxaCore.extract(MetaxaCore.java:120) at org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine.computeEnhancements(MetaxaEngine.java:157) at org.apache.stanbol.enhancer.jobmanager.impl.WeightedJobManager.enhanceContent(WeightedJobManager.java:80) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceAndBuildResponse(EnginesRootResource.java:175) at org.apache.stanbol.enhancer.jersey.resource.EnginesRootResource.enhanceFromData(EnginesRootResource.java:167) at sun.reflect.GeneratedMethodAccessor35.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$ResponseOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:205) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:288) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1465) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1396) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1345) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1335) at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:416) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:699) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.apache.felix.http.base.internal.handler.ServletHandler.doHandle(ServletHandler.java:96) at org.apache.felix.http.base.internal.handler.ServletHandler.handle(ServletHandler.java:79) at org.apache.felix.http.base.internal.dispatch.ServletPipeline.handle(ServletPipeline.java:42) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:49) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.stanbol.commons.httpqueryheaders.impl.QueryHeadersFilter.doFilter(QueryHeadersFilter.java:75) at org.apache.felix.http.base.internal.handler.FilterHandler.doHandle(FilterHandler.java:88) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:76) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.handler.FilterHandler.handle(FilterHandler.java:78) at org.apache.felix.http.base.internal.dispatch.InvocationFilterChain.doFilter(InvocationFilterChain.java:47) at org.apache.felix.http.base.internal.dispatch.HttpFilterChain.doFilter(HttpFilterChain.java:33) at org.apache.felix.http.base.internal.dispatch.FilterPipeline.dispatch(FilterPipeline.java:48) at org.apache.felix.http.base.internal.dispatch.Dispatcher.dispatch(Dispatcher.java:39) at org.apache.felix.http.base.internal.DispatcherServlet.service(DispatcherServlet.java:67) at javax.servlet.http.HttpServlet.service(HttpServlet.java:820) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:511) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:390) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:765) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:943) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) </pre> <hr /><i><small>Powered by Jetty://</small></i><br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> </body> </html> Best, Srecko On Thu, Jan 12, 2012 at 6:40 PM, srecko joksimovic < [email protected]> wrote: > Hi Rupert, > > I have another question, and I will finish soon. > > I tried to annotate pdf document, and I didn't get result I expected. Then > I put string you sent to me > "John Smith works for the Apple Inc. in Cupertino, California." > in MS Word document, and this is the result I got: > > <rdf:RDF > xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" > xmlns:j.0="http://www.semanticdesktop.org/ontologies/2007/01/19/nie#" > xmlns:j.1="http://purl.org/dc/terms/" > xmlns:j.2="http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#" > xmlns:j.3="http://fise.iks-project.eu/ontology/" > > <rdf:Description > rdf:about="urn:enhancement-55016818-eb97-7b98-521a-422e3742173b"> > <rdf:type rdf:resource=" > http://fise.iks-project.eu/ontology/TextAnnotation"/> > <j.1:creator rdf:datatype="http://www.w3.org/2001/XMLSchema#string > ">org.apache.stanbol.enhancer.engines.langid.LangIdEnhancementEngine</j.1:creator> > <j.1:created rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime > ">2012-01-12T17:34:20.288Z</j.1:created> > <j.3:extracted-from > rdf:resource="urn:content-item-sha1-835c8a5397d9b376a268b7bb5d3c8b4ab7e8b81f"/> > <rdf:type rdf:resource=" > http://fise.iks-project.eu/ontology/Enhancement"/> > <j.1:language>fr</j.1:language> > </rdf:Description> > <rdf:Description > rdf:about="urn:content-item-sha1-835c8a5397d9b376a268b7bb5d3c8b4ab7e8b81f"> > <rdf:type rdf:resource=" > http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#PaginatedTextDocument > "/> > <j.0:plainTextContent>Microsoft Word-Dokument
 > srecko</j.0:plainTextContent> > </rdf:Description> > <rdf:Description > rdf:about="urn:enhancement-0644a1ed-f1d8-334d-d4e9-690a0446cba8"> > <j.3:confidence rdf:datatype="http://www.w3.org/2001/XMLSchema#double > ">1.0</j.3:confidence> > <rdf:type rdf:resource=" > http://fise.iks-project.eu/ontology/TextAnnotation"/> > <j.1:creator rdf:datatype="http://www.w3.org/2001/XMLSchema#string > ">org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine</j.1:creator> > <j.1:created rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime > ">2012-01-12T17:34:20.273Z</j.1:created> > <j.3:extracted-from > rdf:resource="urn:content-item-sha1-835c8a5397d9b376a268b7bb5d3c8b4ab7e8b81f"/> > <rdf:type rdf:resource=" > http://fise.iks-project.eu/ontology/Enhancement"/> > </rdf:Description> > </rdf:RDF> > > > and this is the code: > > public List<String> Annotate(byte[] _stream_to_annotate, > ServiceUtils.MIMETypes _content_type, String _encoding) > { > List<String> _return_list = new ArrayList<String>(); > try > { > URL url = new URL(ServiceUtils.SERVICE_URL); > HttpURLConnection con = (HttpURLConnection)url.openConnection(); > > con.setDoOutput(true); > con.setRequestMethod("POST"); > con.setRequestProperty("Accept", "application/rdf+xml"); > > con.setRequestProperty("Content-type", _content_type.getValue()); > > java.io.OutputStream out = con.getOutputStream(); > > IOUtils.write(_stream_to_annotate, out); > IOUtils.closeQuietly(out); > > con.connect(); //send the request > > if(con.getResponseCode() > 299) > { > java.io.InputStream errorStream = con.getErrorStream(); > > if(errorStream != null) > { > String errorMessage = IOUtils.toString(errorStream); > > IOUtils.closeQuietly(errorStream); > } > else > { > //no error data > //write default error message with the status code > > } > } > else > { > Model model = ModelFactory.createDefaultModel(); > > java.io.InputStream enhancementResults = con.getInputStream(); > > model.read(enhancementResults, null); > > String queryStringForGraph = "PREFIX t: < > http://fise.iks-project.eu/ontology/> " + > "SELECT ?label WHERE {?alias t:entity-reference ?label}"; > > Query query = QueryFactory.create(queryStringForGraph); > > QueryExecution qe = QueryExecutionFactory.create(query, model); > > ResultSet results = qe.execSelect(); > while(results.hasNext()) > { > _return_list.add(results.next().toString()); > } > } > } > catch(Exception ex) > { > System.out.println(ex.getMessage()); > } > return _return_list; > } > > On Thu, Jan 12, 2012 at 8:32 AM, srecko joksimovic < > [email protected]> wrote: > >> >> Hi Rupert, >> >> Thank you for the answer. I've probably missed that. >> >> Best, >> Srecko >> >> >> On Thu, Jan 12, 2012 at 6:12 AM, Rupert Westenthaler < >> [email protected]> wrote: >> >>> Hi Srecko >>> >>> I think the last time I directly used this API is about 3-4 years ago, >>> but after a look at the http client tutorial [1] I think the reason for >>> your problem is that you do not execute the GetMethod. >>> >>> Based on this tutorial the code should look like >>> >>> // Create an instance of HttpClient. >>> HttpClient client = new HttpClient(); >>> GetMethod get = new GetMethod(url); >>> try { >>> // Execute the method. >>> int statusCode = client.executeMethod(get); >>> if (statusCode != HttpStatus.SC_OK) { >>> //handle the error >>> } >>> InputStream t_is = get.getResponseBodyAsStream(); >>> //read the data of the stream >>> } >>> >>> In addition you should not use a Reader if you want to read byte >>> oriented data from the input stream. >>> >>> hope this helps >>> best >>> Rupert >>> >>> [1] http://hc.apache.org/httpclient-3.x/tutorial.html >>> >>> On 11.01.2012, at 22:34, Srecko Joksimovic wrote: >>> >>> > That's it. Thank you! >>> > I have already configured KeywordLinkingEngine when I used my own >>> ontology. >>> > I think I'm familiar with that and I will try that option too. >>> > >>> > In meanwhile I found another interesting problem. I tried to annotate >>> > document and web page. With web page, I tried >>> > IOUtils.write(byte[], out) and I had to convert URL to byte[]: >>> > >>> > public static byte[] GetBytesFromURL(String _url) throws IOException >>> > { >>> > GetMethod get = new GetMethod(_url); >>> > InputStream t_is = get.getResponseBodyAsStream(); >>> > byte[] buffer = new byte[1024]; >>> > int count = -1; >>> > Reader t_url_reader = new BufferedReader(new >>> > InputStreamReader(t_is)); >>> > byte[] t_bytes = IOUtils.toByteArray(t_url_reader, "UTF-8"); >>> > >>> > return t_bytes; >>> > } >>> > >>> > But, the problem is that I'm getting null for InputStream. >>> > >>> > Any ideas? >>> > >>> > Best, >>> > Srecko >>> > >>> > >>> > >>> > -----Original Message----- >>> > From: Rupert Westenthaler [mailto:[email protected]] >>> > Sent: Wednesday, January 11, 2012 22:08 >>> > To: Srecko Joksimovic >>> > Cc: [email protected] >>> > Subject: Re: Annotating using DBPedia ontology >>> > >>> > >>> > On 11.01.2012, at 21:41, Srecko Joksimovic wrote: >>> >> Hi Rupert, >>> >> >>> >> When I load localhost:8080/engines it says this: >>> >> >>> >> There are currently 5 active engines. >>> >> org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine >>> >> org.apache.stanbol.enhancer.engines.langid.LangIdEnhancementEngine >>> >> >>> > >>> org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhanc >>> >> ementEngine >>> >> >>> > >>> org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEng >>> >> ine >>> >> >>> > >>> org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEng >>> >> ine >>> >> >>> >> Maybe this could tell you something? >>> >> >>> > >>> > This are exactly the 5 engines that are expected to run with the >>> default >>> > configuration. >>> > Based on this the Stanbol Enhnacer should just work fine. >>> > >>> > After looking at the the text you enhanced I noticed however that is >>> does >>> > not mention >>> > any named entities such as Persons, Organizations and Places. So I >>> checked >>> > it with >>> > my local Stanbol version and was also not any detected entities. >>> > >>> > So to check if Stanbol works as expected you should try to use an >>> other text >>> > the >>> > mentions some Named Entities such as >>> > >>> > "John Smith works for the Apple Inc. in Cupertino, California." >>> > >>> > >>> > If you want to search also for entities like "Bank", "Blog", >>> "Consumer", >>> > "Telephone" . >>> > you need to also configure a KeywordLinkingEngine for dbpedia. Part B >>> or [3] >>> > provides >>> > more information on how to do that. >>> > >>> > But let me mention that the KeywordLinkingEngine is more useful if >>> used in >>> > combination >>> > with an own domain specific thesaurus rather than a global data set >>> like >>> > dbpedia. When >>> > used with dbpedia you will also get a lot of false positives. >>> > >>> > best >>> > Rupert >>> > >>> > [3] >>> http://incubator.apache.org/stanbol/docs/trunk/customvocabulary.html >>> > >>> >>> >> >
