Hi Markus That's ok for me. I thought it would be good share data between different attempts of the same task (e.g. loaded plugins, rules of robot.txt, etc). That was just an idea. Thanks a lot for your answer.
Regards ----- Original Message ----- > From: "Markus Jelsma" <[email protected]> > To: [email protected] > Sent: Wednesday, January 11, 2017 6:00:37 PM > Subject: RE: [MASSMAIL]RE: Distributed cache > > Hello, > > This is in general a not-so-good idea to have in Hadoop world, where jobs > should be idempotent. Nutch will probably never have a distributed cache > because it is, in a plugin, so easy to implement yourself. And because it > would require a second set of daemons to manage the cache. Daemons like > Redis of Memcache already provide such a feature and are good at it. > > Implementing it in a plugin is always custom work, whether Nutch supports it > out-of-the-box, or not. Just connect your a client, set or get an item and > you are good to go. Memcache and Redis (and many others) are a perfect fit > for this job. > > Hadoop itself offers a distributed cache, but it is nothing more than just > sending the code and data to all nodes, still allowing it to be idempotent > because each node has the data and code. The Nutch code itself cannot run > (correct me if i am wrong) without Hadoop's distributed cache. > > Markus > > > -----Original message----- > > From:Roannel Fernández Hernández <[email protected]> > > Sent: Wednesday 11th January 2017 15:58 > > To: [email protected] > > Subject: Re: [MASSMAIL]RE: Distributed cache > > > > Hi > > > > Thanks for your advice. But now, what do you think if Nutch had a > > distributed cache out-of-the-box? I mean, what you propose to me, I think > > could be native to Nutch. > > > > Regards > > > > > > ----- Original Message ----- > > > From: "Markus Jelsma" <[email protected]> > > > To: [email protected] > > > Sent: Wednesday, January 11, 2017 10:32:51 AM > > > Subject: RE: [MASSMAIL]RE: Distributed cache > > > > > > Hi - just implement it in Java code in the plugins where you need it. > > > It's > > > just a simple telnet protocol or there are plenty of memcached clients > > > for > > > Java. > > > > > > Markus > > > > > > -----Original message----- > > > > From:Roannel Fernández Hernández <[email protected]> > > > > Sent: Wednesday 11th January 2017 13:38 > > > > To: [email protected] > > > > Subject: Re: [MASSMAIL]RE: Distributed cache > > > > > > > > Hi Markus: > > > > > > > > How can I use Memcached with Nutch? Do you know any way to do that? > > > > > > > > Thanks a lot. > > > > > > > > ----- Original Message ----- > > > > > From: "Markus Jelsma" <[email protected]> > > > > > To: [email protected] > > > > > Sent: Wednesday, January 11, 2017 7:50:31 AM > > > > > Subject: [MASSMAIL]RE: Distributed cache > > > > > > > > > > Hello - i think memcached is the easiest method to have something > > > > > cached > > > > > and > > > > > available everywhere. > > > > > Markus > > > > > > > > > > > > > > > > > > > > -----Original message----- > > > > > > From:Roannel Fernández Hernández <[email protected]> > > > > > > Sent: Wednesday 11th January 2017 3:03 > > > > > > To: [email protected] > > > > > > Subject: Distributed cache > > > > > > > > > > > > Hi folks: > > > > > > > > > > > > I need to share resources (values into variables) between deferent > > > > > > jobs, > > > > > > even could be useful between task in different nodes of a Hadoop > > > > > > cluster. > > > > > > There is something like a distributed cache or something like that > > > > > > in > > > > > > Nutch right now. > > > > > > > > > > > > To sum up I need something to share data into different task, jobs > > > > > > or > > > > > > even > > > > > > hosts. > > > > > > > > > > > > Regards > > > > > > > > > > > > > > > > > ----------- > > > > > > La @universidad_uci es Fidel. Los jóvenes no fallaremos > > > > > > > > > > > #HastaSiempreComandante > > > > > > > > > > > #HastalaVictoriaSiempre > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > La @universidad_uci es Fidel. Los jóvenes no fallaremos. > > > > #HastaSiempreComandante > > > > #HastalaVictoriaSiempre > > > > > > > > > > > > > La @universidad_uci es Fidel. Los jóvenes no fallaremos. > > #HastaSiempreComandante > > #HastalaVictoriaSiempre > > > > > La @universidad_uci es Fidel. Los jóvenes no fallaremos. #HastaSiempreComandante #HastalaVictoriaSiempre

