Hans Brende created ANY23-351: --------------------------------- Summary: NullPointerException in HCardExtractor Key: ANY23-351 URL: https://issues.apache.org/jira/browse/ANY23-351 Project: Apache Any23 Issue Type: Bug Components: microformats Affects Versions: 2.3 Reporter: Hans Brende
When extracting from the url: https://cambridgewi.com/make-cambridge-home/char/V/ I get the following NullPointerException, which kills the entire extraction process: {code} java.lang.NullPointerException at org.apache.any23.extractor.html.HTMLDocument.readUrlField(HTMLDocument.java:119) at org.apache.any23.extractor.html.HTMLDocument.getPluralUrlField(HTMLDocument.java:288) at org.apache.any23.extractor.html.HCardExtractor.addLogo(HCardExtractor.java:267) at org.apache.any23.extractor.html.HCardExtractor.extractEntity(HCardExtractor.java:130) at org.apache.any23.extractor.html.EntityBasedMicroformatExtractor.extract(EntityBasedMicroformatExtractor.java:66) at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:102) at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:44) at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:480) at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:259) at org.apache.any23.Any23.extract(Any23.java:302) at org.apache.any23.Any23.extract(Any23.java:437) {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)