[ 
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12471231
 ] 

Dogacan Güney commented on NUTCH-443:
-------------------------------------

Here is a very initial patch. It is entirely untested and only changes stuff 
under src/java(so, code won't even compile:).

I am posting this, because while what we change here is trivial, it is also 
very intrusive. (I mean this patch is almost 
700 lines long, and it doesn't even change the plugins). So, I hope that this 
patch can get some early review, 
suggestions and corrections.

I will post the necessary changes to plugins too, as soon as I can.

> allow parsers to return multiple Parse object, this will speed up the rss 
> parser
> --------------------------------------------------------------------------------
>
>                 Key: NUTCH-443
>                 URL: https://issues.apache.org/jira/browse/NUTCH-443
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 0.9.0
>            Reporter: Renaud Richardet
>            Priority: Minor
>             Fix For: 0.9.0
>
>         Attachments: parse-map-core-untested.patch
>
>
> allow Parser#parse to return a Map<String,Parse>. This way, the RSS parser 
> can return multiple parse objects, that will all be indexed separately. 
> Advantage: no need to fetch all feed-items separately.
> see the discussion at 
> http://www.nabble.com/RSS-fecter-and-index-individul-how-can-i-realize-this-function-tf3146271.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to