[
https://issues.apache.org/jira/browse/ANY23-351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16525671#comment-16525671
]
Hudson commented on ANY23-351:
------------------------------
SUCCESS: Integrated in Jenkins build Any23-trunk #1570 (See
[https://builds.apache.org/job/Any23-trunk/1570/])
ANY23-351 fixed NullPointerException in HCardExtractor (hans: rev
31e1142d1c43ca06065d6d48dd929f16a60f7c12)
* (add) test-resources/src/test/resources/microformats/hcard/null-pointer.html
* (edit) core/src/main/java/org/apache/any23/extractor/html/HCardExtractor.java
* (edit)
core/src/test/java/org/apache/any23/extractor/html/HCardExtractorTest.java
* (edit) core/src/main/java/org/apache/any23/extractor/html/HTMLDocument.java
> NullPointerException in HCardExtractor
> --------------------------------------
>
> Key: ANY23-351
> URL: https://issues.apache.org/jira/browse/ANY23-351
> Project: Apache Any23
> Issue Type: Bug
> Components: microformats
> Affects Versions: 2.3
> Reporter: Hans Brende
> Assignee: Hans Brende
> Priority: Major
> Fix For: 2.3
>
>
> When extracting from the url:
> https://cambridgewi.com/make-cambridge-home/char/V/
> I get the following NullPointerException, which kills the entire extraction
> process:
> {code}
> java.lang.NullPointerException
> at
> org.apache.any23.extractor.html.HTMLDocument.readUrlField(HTMLDocument.java:119)
> at
> org.apache.any23.extractor.html.HTMLDocument.getPluralUrlField(HTMLDocument.java:288)
> at
> org.apache.any23.extractor.html.HCardExtractor.addLogo(HCardExtractor.java:267)
> at
> org.apache.any23.extractor.html.HCardExtractor.extractEntity(HCardExtractor.java:130)
> at
> org.apache.any23.extractor.html.EntityBasedMicroformatExtractor.extract(EntityBasedMicroformatExtractor.java:66)
> at
> org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:102)
> at
> org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:44)
> at
> org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:480)
> at
> org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:259)
> at org.apache.any23.Any23.extract(Any23.java:302)
> at org.apache.any23.Any23.extract(Any23.java:437)
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)