Hi Folks, Hadoop 2 support is ready for Nutch 2.x. I just wait Gora 0.6. My ideas,
Sitemap, Jsoup (HTML5 parser) , RDF Microformats Supports would be good. Talat 2015-02-05 13:03 GMT+02:00 Markus Jelsma <[email protected]>: > Well, Hadoop 2.x sounds right indeed! > > -----Original message----- > From: Julien Nioche<[email protected]> > Sent: Thursday 5th February 2015 1:34 > To: [email protected] > Subject: Re: GSoC 2015 > > Moving to Hadoop 2.x ? > > On 4 February 2015 at 14:42, Lewis John Mcgibbney <[email protected] > <mailto:[email protected]>> wrote: > > Hi Folks, > > Does anyone have any good ideas for GSoC? > > Seb mentioned moving Nutch towards Spark so potentially a pluggable runtime > execution engine abstraction? > > I am currently working on a lot of security and authentication related work > so I would possibly be tempted to overhaul and improve that aspect of Nutch. > > Any other ideas? > > Thanks folks > Lewis > > -- > > Lewis > > -- > > Open Source Solutions for Text Engineering > > http://digitalpebble.blogspot.com/ > <http://digitalpebble.blogspot.com/>http://www.digitalpebble.com > <http://www.digitalpebble.com> > http://twitter.com/digitalpebble <http://twitter.com/digitalpebble> > > -- Talat UYARER Websitesi: http://talat.uyarer.com Twitter: http://twitter.com/talatuyarer Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

