[
https://issues.apache.org/jira/browse/NUTCH-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14933845#comment-14933845
]
Asitang Mishra commented on NUTCH-2110:
---------------------------------------
To keep everything under one single url in the end (how it practically is) or
under some new concocted url I think is the question. I am not sure if in the
end one needs to distinguish all this data into separate parts or not. Here we
need to think more I guess.
Meanwhile, I created two more sub tasks that can do more specific things using
standardized key value pairs to the injector. Let us focus on them right now
and then we can move back here to this issue which is a little abstract.
> Create the capability to provide seeds in the form of "url+xpath(including
> option to enter seach terms).selenium"
> ------------------------------------------------------------------------------------------------------------------
>
> Key: NUTCH-2110
> URL: https://issues.apache.org/jira/browse/NUTCH-2110
> Project: Nutch
> Issue Type: Sub-task
> Components: fetcher
> Affects Versions: 1.10
> Reporter: Asitang Mishra
> Labels: memex
>
> Create the capability to provide seeds in the form of "url+xpath(including
> option to enter seach terms).selenium" to be used by selenium
> protocols/plugins as urls/flow to reach to a specific ajax based page or save
> the state of a selenium operation for the next fetching round.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)