[
https://issues.apache.org/jira/browse/NUTCH-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13872106#comment-13872106
]
Canan Girgin commented on NUTCH-1703:
-------------------------------------
ok. A new patch Patch had been added which contains TestDOMContentUtils class.
(NUTCH_1703.patch_v1)
> Nutch ignores alt text of images
> --------------------------------
>
> Key: NUTCH-1703
> URL: https://issues.apache.org/jira/browse/NUTCH-1703
> Project: Nutch
> Issue Type: Bug
> Components: parser
> Affects Versions: 2.2.1
> Reporter: Canan Girgin
> Fix For: 2.3, 1.8
>
> Attachments: NUTCH_1703.patch, NUTCH_1703_v2.patch
>
>
> If you put image as link alt text of that image is equivalent to the anchor
> text of text link. During content parse nutch does not give image alt text
> and anchor text for that link is empty.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)