Re: Resources for understanding Hadoop

2022-06-23 Thread Brahma Reddy Battula
Please go through the following https://github.com/apache/hadoop/blob/trunk/BUILDING.txt and a specific command to generate the distribution which can be run after your changes. mvn package -Pdist -DskipTests -Dtar -Dmaven.javadoc.skip=true Hope this helps. On Thu, Jun 23, 2022 at 9:41 PM Ra

Re: Resources for understanding Hadoop

2022-06-23 Thread Rahul Bhardwaj
I am following this wiki to build and run hadoop locally in pseudo-dsitributed mode. But I am unable to figure out how to build my changes and generate similar binaries so that I can test my changes locall

Re: Resources for understanding Hadoop

2022-06-12 Thread Brahma Reddy Battula
Hi Rahul, Welcome to hadoop world. Apart from the gautham mentioned, you can check the following also. https://livebook.manning.com/book/hadoop-in-action/part-1/ Go through the following wiki for contributions https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute Please subscrib

Re: Resources for understanding Hadoop

2022-06-12 Thread Gautham Banasandra
Hi Rahul, I was looking for something more detailed and low-level like how the code > for the various services in HDFS is organized, entrypoints etc. I found this book useful to get a good idea of Hadoop in general - Apache Hadoop™ YARN: Moving beyond MapReduce and Batch Processing with Apache Ha

Resources for understanding Hadoop

2022-06-12 Thread Rahul Bhardwaj
Hi all, I am a newbie wanting to start contributing to the hadoop ecosystem. I want to start by contributing to HDFS and was looking for resources to understand the architecture and I just found this - https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsDesign.html which is a