[ 
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14333206#comment-14333206
 ] 

Mohammad Al-Mohsin edited comment on NUTCH-1933 at 2/23/15 11:11 AM:
---------------------------------------------------------------------

As requested http://www.mail-archive.com/dev%40nutch.apache.org/msg16617.html, 

The patch has been updated to make Selenium handle only HTML and XHTML content 
types. While I'm at it, took care of Tika 1.7 update as well.

Thanks,
Mohammad


was (Author: almohsin):
Takes care of Tika 1.7 update and handles only HTML and XHTML content types.

> nutch-selenium plugin
> ---------------------
>
>                 Key: NUTCH-1933
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1933
>             Project: Nutch
>          Issue Type: New Feature
>          Components: protocol
>            Reporter: Mo Omer
>            Assignee: Lewis John McGibbney
>             Fix For: 1.10
>
>         Attachments: NUTCH-selenium-trunk.patch, NUTCH-selenium-trunk.v2.patch
>
>
> I updated the plugin [nutch-selenium|https://github.com/momer/nutch-selenium] 
> plugin to run against trunk.
> I feel that there is a good bit of work to be done here however early testing 
> on my system are that it works. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to