[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14335538#comment-14335538
]
Lewis John McGibbney commented on NUTCH-1933:
---------------------------------------------
Hi [~almohsin], GREAT :)
Further comments
* the following code should be moved from $NUTCH_HOME/ivy/ivy.xml, to
$NUTCH_HOME/src/plugin/protocol-selenium/ivy.xml
{code}
+ <!-- begin selenium dependencies -->
+ <dependency org="org.seleniumhq.selenium" name="selenium-java"
rev="2.44.0" />
+
+ <dependency org="com.opera" name="operadriver" rev="1.5">
+ <exclude org="org.seleniumhq.selenium"
name="selenium-remote-driver" />
+ </dependency>
+ <!-- end selenium dependencies -->
{code}
You can consult parse-tika for the implementation, please also see plugin.xml
* You can also remove
src/plugin/protocol-selenium/src/target/classes/org/apache/nutch/protocol/htmlunit/package.html
Once this is done I am +1 for this patch.
> nutch-selenium plugin
> ---------------------
>
> Key: NUTCH-1933
> URL: https://issues.apache.org/jira/browse/NUTCH-1933
> Project: Nutch
> Issue Type: New Feature
> Components: protocol
> Reporter: Mo Omer
> Assignee: Lewis John McGibbney
> Fix For: 1.10
>
> Attachments: NUTCH-selenium-trunk.patch,
> NUTCH-selenium-trunk.v2.1.patch, NUTCH-selenium-trunk.v2.patch
>
>
> I updated the plugin [nutch-selenium|https://github.com/momer/nutch-selenium]
> plugin to run against trunk.
> I feel that there is a good bit of work to be done here however early testing
> on my system are that it works.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)