Using Solr BlockJoins would probably be the easiest these days unless you 
really need to process them in Nutch. If you still want to process them 
simultaneously you can write a custom Solr UpdateRequestProcessor plugin and 
build the logic there.
 
-----Original message-----
> From:Lewis John Mcgibbney <[email protected]>
> Sent: Wednesday 26th November 2014 0:10
> To: [email protected]
> Subject: Re: Processing Pages in Pairs
> 
> Hi Iain,
> 
> On Tue, Nov 25, 2014 at 2:44 PM, <[email protected]> wrote:
> 
> >
> >
> > What would you recommend in this situation?  Are there other options that I
> > am missing?
> 
> 
> I think that our good friend Markus has previously provided some insight
> into the technical implementation of a task which may be synonymous with
> what you are trying to achieve.
> http://www.mail-archive.com/user%40nutch.apache.org/msg04695.html
> Sounds pretty hands on to me, it would be difficult to keep your version of
> Nutch up-to-date with trunk if you were doing that.
> hth
> Lewis
> 

Reply via email to