Hi Folks,

Hadoop 2 support is ready for Nutch 2.x. I just wait Gora 0.6. My ideas,

Sitemap, Jsoup (HTML5 parser) , RDF Microformats Supports would be good.

Talat


2015-02-05 13:03 GMT+02:00 Markus Jelsma <[email protected]>:
> Well, Hadoop 2.x sounds right indeed!
>
> -----Original message-----
> From: Julien Nioche<[email protected]>
> Sent: Thursday 5th February 2015 1:34
> To: [email protected]
> Subject: Re: GSoC 2015
>
> Moving to Hadoop 2.x ?
>
> On 4 February 2015 at 14:42, Lewis John Mcgibbney <[email protected] 
> <mailto:[email protected]>> wrote:
>
> Hi Folks,
>
> Does anyone have any good ideas for GSoC?
>
> Seb mentioned moving Nutch towards Spark so potentially a pluggable runtime 
> execution engine abstraction?
>
> I am currently working on a lot of security and authentication related work 
> so I would possibly be tempted to overhaul and improve that aspect of Nutch.
>
> Any other ideas?
>
> Thanks folks
> Lewis
>
> --
>
> Lewis
>
> --
>
> Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/ 
> <http://digitalpebble.blogspot.com/>http://www.digitalpebble.com 
> <http://www.digitalpebble.com>
> http://twitter.com/digitalpebble <http://twitter.com/digitalpebble>
>
>



-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

Reply via email to