I apologize for the miss-send.

Hi all,

Here is my Mid-term update for the GSoC project "Map-Reduce support for Apache Tuscany." I will briefly explain the progress I have made so far, as well as the next steps I will take during the second phase of the program.

Progress thus far:

I have spent time getting to know the Tuscany code base. I have gone through the POJO sample implementation and have a good understanding of what is needed in order to complete a component implementation.

I have spent time getting to know the Hadoop Code base. The current Map-Reduce API supported by Hadoop uses a JAR file to pass the necessary classes from node to node. This requires all the functions to be put in a JAR file before the job is submitted. I have been looking through the code base to see if there is a way to execute a Map-Reduce job without a JAR file. I have not found a way so far. I have also created two test functions that interact with Hadoop and submit Map-Reduce jobs without the use of a shell script.

This first phase was marked by a large amount of reading documentation as well as code.

Plans for the second phase:

Setting aside Hadoop for the moment, the first step will be to represent the Map-Reduce programing model as a composite of java components using a simple example. This will be a good step in establishing how the model will be represented in Tuscany. I will then be able to receive feedback on wether the design is simple enough to make Map-Reduce programing easier, as well as flexible enough to handle important use cases. Once an acceptable design is implemented, I will work to create component types for each java component in the composite. Once the basic types are implemented, I will revisit Hadoop and start the integration with HDFS.

Finally, I would like to thank my mentor Jean-Sebastien, Luciano, and Raymond, as well as the rest of the Tuscany community, for their tremendous support during the first phase of the program.

Any questions or comments are always welcome.

Chris Trezzo



On Jul 12, 2008, at 12:30 AM, Chris Trezzo wrote:

Hi all,

Here is my Mid-term update for the GSoC project "Map-Reduce support for Apache Tuscany." I will briefly explain the progress I have made so far, as well as the next steps I will take during the second portion of the program.

Progress thus far:



On Jul 9, 2008, at 9:25 AM, Luciano Resende wrote:

To help Mentors with mid-term evaluation, I'd like to suggest students
to send a mid-term update describing high level
progress/acomplishments towards the original project proposal.

Mentors : Any thoughts on this ? other suggestions ?

--
Luciano Resende
Apache Tuscany Committer
http://people.apache.org/~lresende
http://lresende.blogspot.com/


Reply via email to