[
https://issues.apache.org/jira/browse/ANY23-351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16525269#comment-16525269
]
ASF GitHub Bot commented on ANY23-351:
--------------------------------------
Github user HansBrende commented on the issue:
https://github.com/apache/any23/pull/86
@lewismc any comments?
> NullPointerException in HCardExtractor
> --------------------------------------
>
> Key: ANY23-351
> URL: https://issues.apache.org/jira/browse/ANY23-351
> Project: Apache Any23
> Issue Type: Bug
> Components: microformats
> Affects Versions: 2.3
> Reporter: Hans Brende
> Priority: Major
>
> When extracting from the url:
> https://cambridgewi.com/make-cambridge-home/char/V/
> I get the following NullPointerException, which kills the entire extraction
> process:
> {code}
> java.lang.NullPointerException
> at
> org.apache.any23.extractor.html.HTMLDocument.readUrlField(HTMLDocument.java:119)
> at
> org.apache.any23.extractor.html.HTMLDocument.getPluralUrlField(HTMLDocument.java:288)
> at
> org.apache.any23.extractor.html.HCardExtractor.addLogo(HCardExtractor.java:267)
> at
> org.apache.any23.extractor.html.HCardExtractor.extractEntity(HCardExtractor.java:130)
> at
> org.apache.any23.extractor.html.EntityBasedMicroformatExtractor.extract(EntityBasedMicroformatExtractor.java:66)
> at
> org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:102)
> at
> org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:44)
> at
> org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:480)
> at
> org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:259)
> at org.apache.any23.Any23.extract(Any23.java:302)
> at org.apache.any23.Any23.extract(Any23.java:437)
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)