[Hadoop Wiki] Update of "GitAndHadoop" by SteveLoughran

Apache Wiki Fri, 27 Nov 2009 07:26:37 -0800

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change 
notification.


The "GitAndHadoop" page has been changed by SteveLoughran.
The comment on this change is: Page on Git.
http://wiki.apache.org/hadoop/GitAndHadoop

--------------------------------------------------

New page:
= Git And Hadoop =

A lot of people use Git with Hadoop because they have their own patches to make 
to Hadoop, and Git helps them manage it.

 * GitHub provide some good lessons on git at [[http://learn.github.com]]
 * Apache serves up read-only Git versions of their source at 
[[http://git.apache.org/]]. People cannot commit changes with Git; for that the 
patches need to be applied to the SVN repositories

== Before you begin ==

You need a copy of git on your system. Some IDEs ship with Git support; this 
page assumes you are using the command line.

== Checking out the source ==

The first step is to create your own Git repository from the Apache 
repositories. There are separate repositories for all the different Hadoop 
sub-projects; this page looks at the core filesystem and MapReduce engine.

Make your base hadoop directory
{{{
mkdir hadoop
}}}
Change into this directory
{{{
cd hadoop
}}}
Create "clones" of the Apache Git repositories
{{{
git clone git://git.apache.org/hadoop-common.git
git clone git://git.apache.org/hadoop-hdfs.git
git clone git://git.apache.org/hadoop-mapreduce.git
}}}
The total download is well over 100MB, so the initial checkout process works 
best when the network is fast. Once downloaded, Git works offline.

[Hadoop Wiki] Update of "GitAndHadoop" by SteveLoughran

Reply via email to