On 17/03/11 07:05, Matthew John wrote:
Hi,

Can someone provide me some pointers on the following details of
Hadoop code base:

1) breakdown of HDFS code base (approximate lines of code) into
following modules:
          - HDFS at the Datanodes
          - Namenode
          - Zookeeper
          - MapReduce based
          - Any other relevant split

2) breakdown of Hbase code into following modules:
          - HMaster
          - RegionServers
          - MapReduce
          - Any other relevant split


You are free to check out the source code and do whatever analysis you want. You can also look at the entire SVN history and do some really interesting analysis, especially if you have any data mining tooling to hand, like a small hadoop cluster.

Reply via email to