Hi Alfonso, Can you please provide us with a URI which reproduces this issue? If we can reproduce it, then we can register a ticket over at https://issues.apache.org/jira/projects/ANY23 Thanks
On Thu, Nov 30, 2017 at 6:48 AM, <user-digest-h...@any23.apache.org> wrote: > > From: alfonso.debi...@libero.it > To: user@any23.apache.org > Cc: > Bcc: > Date: Thu, 30 Nov 2017 15:48:01 +0100 (CET) > Subject: parse broken uri > > Hi users, I’m using any23 version 2.0 in my project, I have tested the > extraction of RDF microformats from HTML pages. In this HTML there is an > inconsistent URI, without protocol specification (example: // > any23.apache.org instead of https://any23.apache.org ) > > The library gives me the log: > > WARN rdf.Any23ValueFactoryWrapper: Not a valid (absolute) IRI: > > INFO extractor.SingleDocumentExtraction: Processing null > > I am seeing the method fixIRIWithException that fixes some potentially > broken relative or absolute URI, but for this case it doesn’t fix this > problem.Is it possible to integrate a patch to solve this problem? Thanks > > Best regards, > > Alfonso > > -- http://home.apache.org/~lewismc/ @hectorMcSpector http://www.linkedin.com/in/lmcgibbney