[ https://issues.apache.org/jira/browse/ANY23-349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16478622#comment-16478622 ]
ASF GitHub Bot commented on ANY23-349: -------------------------------------- GitHub user HansBrende opened a pull request: https://github.com/apache/any23/pull/84 ANY23-349 repair syntactically invalid microdata links mvn clean test -> all tests pass You can merge this pull request into a Git repository by running: $ git pull https://github.com/HansBrende/any23 ANY23-349 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/any23/pull/84.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #84 ---- commit dae72f3edac0c3e23bfcea97fc031b92dd55c5ed Author: Hans <firedrake93@...> Date: 2018-05-17T06:11:49Z ANY23-349 repair syntactically invalid microdata links ---- > MicrodataExtractor errors for links that are telephone numbers > -------------------------------------------------------------- > > Key: ANY23-349 > URL: https://issues.apache.org/jira/browse/ANY23-349 > Project: Apache Any23 > Issue Type: Bug > Components: microdata > Affects Versions: 2.3 > Reporter: Hans Brende > Priority: Major > > I get the following error when extracting from > http://clubzone.com/ontario-los-angeles/places/ > This error kills the whole extraction process. > {code} > Exception in thread "main" org.apache.any23.extractor.ExtractionException: > Error while processing on subject '_:node1cb6a1b5jx5' the itemProp: '{ > "xpath" : > "/HTML[1]/BODY[1]/DIV[3]/DIV[3]/DIV[1]/DIV[1]/SECTION[1]/ARTICLE[2]/DIV[2]/DIV[1]/P[2]/A[1]", > "name" : "telephone", "value" : { "content" : "tel:(909) 484-2020", "type" : > "Link" } }' > at > org.apache.any23.extractor.microdata.MicrodataExtractor.processType(MicrodataExtractor.java:442) > at > org.apache.any23.extractor.microdata.MicrodataExtractor.run(MicrodataExtractor.java:116) > at > org.apache.any23.extractor.microdata.MicrodataExtractor.run(MicrodataExtractor.java:60) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)