Hadoop and universities/research

Steve Loughran Tue, 15 Sep 2009 06:01:04 -0700

I've stuck up a little slideset with some thoughts on what we could bedoing to get tighter engagement between universities and the Hadoop codebase

http://www.slideshare.net/steve_l/hadoop-and-universities


There are various levels of engagement

- teaching MapReduce and other datacentre-scale coding techiques.

This could be done by getting involved with the undergrad/gradteachers, helping with the lectures and the coursework.Remember , most universities do welcome outside lecturers givingguest talks to the students.

- encourage scientific computation to be done on top of Hadoop. There'sa small problem there, cluster time, which means that people with accessto datacentres with CPU and storage to spare need to lift a hand here.Or we help get Hadoop up on the existing clusters the physicists run. [see http://www.slideshare.net/steve_l/hadoop-hep for some plans here]


- encourage people doing maths and CS work to do it on Hadoop.

The plugins for scheduling and placement are a good low-risk place wecan get people involved, the other area of interest is stuff on top,such as graphs and other leading-edge stuff.

- get some of the people doing work in this area to talk at apachecon.Come on, you want to know how do debug a particle accelerator experimentthat generates 1PB/month of data but which only 50 events/year areactually interesting.

Term-time is rushing up, so now might be a good time for Apache to getready to bring the universities on board.

I'm going to start by proposing we create a hadoop-research list, forpeople doing more researchy stuff on top of and inside hadoop. we canthen start identifying who is interested in this area, which academicand industrial people are involved, where they are, start meeting up. Wehad a good little workshop at Bristol University last monthhttp://wiki.apache.org/hadoop/BristolHadoopWorkshop ; it was good to gettogether the different groups.


Thoughts?

-steve

Hadoop and universities/research

Reply via email to