Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The "Books" page has been changed by SteveLoughran: https://wiki.apache.org/hadoop/Books?action=diff&rev1=26&rev2=27 Comment: rm all packt publishing trackback args in their book URLs, cut the flume and hive books. == Hadoop Books == - These books are listed in order of publication, most recent first. The Apache Software Foundation does not endorse any specific book. The links to Amazon are affiliated with the specific author. That said, we also encourage you to support your local bookshops, by buying the book from any local outlet, especially independent ones. == Books in Print == - Here are the books that are currently in print -in order of publishing-, along with the Hadoop version they were written against. One problem anyone writing a book will encounter is that Hadoop is a very fast-moving target, and that things can change fast. Usually this is for the better, when a book says "Hadoop can't" they really mean "the version of Hadoop we worked with couldn't", and that the situation may have improved since then. If you have any query about Hadoop, don't be afraid to ask on the relevant user mailing lists. - {{{#!wiki comment/dotted Attention people adding new entries. @@ -15, +12 @@ # Please write this in a neutral voice, not "this book will help you", as that implies that the ASF has opinions on the matter. Someone will just edit the claims out. # Please do not go overboard in exaggerating the outcome of reading a book, "readers of this book will become experts in advanced production-scale Hadoop MapReduce jobs". Such claims will be edited out. + # Please don't have tracking URLs. We'll only cut them. }}} + === YARN Essentials === + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/yarn-essentials/|YARN Essentials]] + '''Authors:''' Amol Fasale, Nirmal Kumar - - === Apache Flume: Distributed Log Collection for Hadoop - Second Edition === - - '''Name:''' [[https://www.packtpub.com/application-development/apache-flume-distributed-log-collection-hadoop-second-edition/?utm_source=Pgwiki.apache.org&utm_medium=pod&utm_campaign=1784392170|Apache Flume: Distributed Log Collection for Hadoop - Second Edition]] - - '''Author:''' Steve Hoffman '''Publisher:''' Packt Publishing '''Date of Publishing:''' February, 2015 - Apache Flume: Distributed Log Collection for Hadoop - Second Edition is for Hadoop programmers who want to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner. + YARN Essentials is for developers with little knowledge of Hadoop 1.x and want to start afresh with YARN. - === YARN Essentials === + === Learning YARN === + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/learning-yarn/|Learning YARN]] - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/yarn-essentials/?utm_source=PGwiki.apache.org&utm_medium=pod&utm_campaign=1784391735|YARN Essentials]] + '''Authors:''' Akhil Arora, Shrey Mehrotra - '''Authors:''' Amol Fasale, Nirmal Kumar + '''Publisher:''' Packt Publishing + + '''Date of Publishing:''' August, 2015 + + Learning YARN is intended for those who want to understand what YARN is and how to efficiently use it for the resource management of large clusters. + + + === Big Data Forensics: Learning Hadoop Investigations === + '''Name:''' [[https://www.packtpub.com/networking-and-servers/big-data-forensics-learning-hadoop-investigations/|Big Data Forensics: Learning Hadoop Investigations]] + + '''Author:''' Joe Sremack + + '''Publisher:''' Packt Publishing + + '''Date of Publishing:''' August, 2015 + + Big Data Forensics: Learning Hadoop Investigations will guide statisticians and forensic analysts with basic knowledge of digital forensics to conduct Hadoop forensic investigations. + + === Learning Hadoop 2 === + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/learning-hadoop/|Learning Hadoop 2]] + + '''Authors:''' Garry Turkington, Gabriele Modena '''Publisher:''' Packt Publishing '''Date of Publishing:''' February, 2015 - YARN Essentials is for developers with little knowledge of Hadoop 1.x and want to start afresh with YARN. + Learning Hadoop 2 is an introduction guide to building data-processing applications with the wide variety of tools supported by Hadoop 2. - === Learning YARN === + === Hadoop MapReduce v2 Cookbook - Second Edition === + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-mapreduce-v2-cookbook-second-edition/|Hadoop MapReduce v2 Cookbook - Second Edition]] + '''Authors:''' Thilina Gunarathne - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/learning-yarn/?utm_source=PGwiki.apache.org&utm_medium=pod&utm_campaign=1784393967|Learning YARN]] - - '''Authors:''' Akhil Arora, Shrey Mehrotra - - '''Publisher:''' Packt Publishing - - '''Date of Publishing:''' August, 2015 - - Learning YARN is intended for those who want to understand what YARN is and how to efficiently use it for the resource management of large clusters. - - === Apache Hive Essentials === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/apache-hive-essentials/?utm_source=PGwiki.apache.org&utm_medium=pod&utm_campaign=1783558571|Apache Hive Essentials]] - - '''Author:''' Dayong Du '''Publisher:''' Packt Publishing '''Date of Publishing:''' February, 2015 - Apache Hive Essentials is for data analysts and developers who want to use Hive to explore and analyze data in Hadoop. - - === Big Data Forensics: Learning Hadoop Investigations === - - '''Name:''' [[https://www.packtpub.com/networking-and-servers/big-data-forensics-learning-hadoop-investigations/?utm_source=PGwiki.apache.org&utm_medium=pod&utm_campaign=1785288105|Big Data Forensics: Learning Hadoop Investigations]] - - '''Author:''' Joe Sremack - - '''Publisher:''' Packt Publishing - - '''Date of Publishing:''' August, 2015 - - Big Data Forensics: Learning Hadoop Investigations will guide statisticians and forensic analysts with basic knowledge of digital forensics to conduct Hadoop forensic investigations. - - === Learning Hadoop 2 === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/learning-hadoop/?utm_source=POD&utm_medium=referral&utm_campaign=1783285516|Learning Hadoop 2]] - - '''Authors:''' Garry Turkington, Gabriele Modena - - '''Publisher:''' Packt Publishing - - '''Date of Publishing:''' February, 2015 - - Learning Hadoop 2 is an introduction guide to building data-processing applications with the wide variety of tools supported by Hadoop 2. - - === Hadoop MapReduce v2 Cookbook - Second Edition === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-mapreduce-v2-cookbook-second-edition/?utm_source=POD&utm_medium=referral&utm_campaign=1783285478|Hadoop MapReduce v2 Cookbook - Second Edition]] - - '''Authors:''' Thilina Gunarathne - - '''Publisher:''' Packt Publishing - - '''Date of Publishing:''' February, 2015 - Hadoop MapReduce v2 Cookbook - Second Edition is a beginner's guide to explore the Hadoop MapReduce v2 ecosystem to gain insights from very large datasets. === Scaling Big Data with Hadoop and Solr - Second Edition === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/scaling-big-data-hadoop-and-solr-second-edition/?utm_source=POD&utm_medium=referral&utm_campaign=1783553391|Scaling Big Data with Hadoop and Solr - Second Edition]] + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/scaling-big-data-hadoop-and-solr-second-edition/|Scaling Big Data with Hadoop and Solr - Second Edition]] '''Authors:''' Hrishikesh Vijay Karambelkar @@ -117, +84 @@ Scaling Big Data with Hadoop and Solr - Second Edition is aimed at developers, designers, and architects who would like to build big data enterprise search solutions for their customers or organizations === Hadoop for Finance Essentials === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-finance-essentials/?utm_source=POD&utm_medium=referral&utm_campaign=1784395161|Hadoop for Finance Essentials]] + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-finance-essentials/|Hadoop for Finance Essentials]] '''Authors:''' Rajiv Tiwari @@ -129, +95 @@ Hadoop for Finance Essentials is for developers who would like to perform big data analytics with Hadoop for the financial sector. === Monitoring Hadoop === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/monitoring-hadoop/?utm_source=POD&utm_medium=referral&utm_campaign=1783281553|Monitoring Hadoop]] + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/monitoring-hadoop/|Monitoring Hadoop]] '''Authors:''' Gurmukh Singh @@ -141, +106 @@ Monitoring Hadoop is for Hadoop administrators who want to learn how to monitor and diagnose their clusters. === Hadoop Backup and Recovery Solutions === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-backup-and-recovery-solutions/?utm_source=POD&utm_medium=referral&utm_campaign=178328904X|Hadoop Backup and Recovery Solutions]] + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-backup-and-recovery-solutions/|Hadoop Backup and Recovery Solutions]] '''Authors:''' Gaurav Barot, Chintan Mehta, Amij Patel @@ -155, +119 @@ Hadoop Backup and Recovery Solutions demonstrates the strategies for data recovery from Hadoop backup clusters and troubleshoot problems. === Hadoop Essentials === - - '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-essentials/?utm_source=POD&utm_medium=referral&utm_campaign=1784396680|Hadoop Essentials]] + '''Name:''' [[https://www.packtpub.com/big-data-and-business-intelligence/hadoop-essentials/|Hadoop Essentials]] '''Authors:''' Shiva Achari @@ -169, +132 @@ Hadoop Essentials explains the key concepts of Hadoop and gives a thorough understanding of the Hadoop ecosystem. == Hadoop in Practice, Second Edition == - '''Name:''' [[http://www.manning.com/holmes2/|Hadoop in Practice, Second Edition]] '''Author:''' Alex Holmes @@ -185, +147 @@ The second edition of Hadoop in Practice includes over 100 Hadoop techniques. This edition covers Hadoop 2 (YARN and MapReduce 2) and updates include new techniques that show how to integrate Kafka, Impala, and Spark SQL with Hadoop. === Optimizing Hadoop for MapReduce === - '''Name:''' [[http://www.packtpub.com/learn-to-implement-and-use-hadoop-mapreduce-framework/book|Optimizing Hadoop for MapReduce]] '''Author:''' Khaled Tannir @@ -199, +160 @@ Optimizing Hadoop for !MapReduce book is an example-based tutorial that deals with Optimizing Hadoop for !MapReduce job performance. === Scaling Big Data with Hadoop and Solr === - '''Name:''' [[http://www.packtpub.com/scaling-big-data-with-hadoop-and-solr/book|Scaling Big Data with Hadoop and Solr]] '''Author:''' Hrishikesh Karambelkar @@ -212, +172 @@ Scaling Big Data with Hadoop and Solr is a step-by-step guide to building a search engine while scaling data. Starting with the basics of Apache Hadoop and Solr, this book then dives into advanced topics of optimizing search with some real-world use cases and sample Java code. - === Hadoop Operations and Cluster Management Cookbook === - '''Name:''' [[http://www.packtpub.com/hadoop-operations-and-cluster-management-cookbook/book|Hadoop Operations and Cluster Management Cookbook]] '''Author:''' Shumin Guo @@ -229, +187 @@ Hadoop Operations and Cluster Management Cookbook is a guide for designing and managing a Hadoop cluster. - === Hadoop Beginner's Guide === - '''Name:''' [[http://www.packtpub.com/hadoop-beginners-guide/book|Hadoop Beginner's Guide]] '''Author:''' Garry Turkington @@ -247, +203 @@ Written for complete beginners to Hadoop, covers how to install and run Hadoop on a local Ubuntu host or create an on-demand Hadoop cluster on Amazon Web Services (EC2), before getting to grips with !MapReduce. === Hadoop Real World Solutions Cookbook === - '''Name:''' [[http://www.packtpub.com/hadoop-real-world-solutions-cookbook/book|Hadoop Real World Solutions Cookbook]] '''Author:''' Jonathan Owens, Brian Femiano, Jon Lentz @@ -262, +217 @@ Collection of real world code analytics and design patterns using various tools from the Hadoop community. Each recipe walks the reader through the implementation, or in some cases debugging and configuration tuning. The book covers various tools including !MapReduce, Hive, Pig, MRUnit, serialization using Avro/Thrift/ProtoBuffs, Giraph, Accumulo and several others. - === Hadoop MapReduce Cookbook === - '''Name:''' [[http://www.packtpub.com/hadoop-mapreduce-cookbook/book|Hadoop MapReduce Cookbook]] '''Author:''' Srinath Perera, Thilina Gunarathne @@ -277, +230 @@ '''Sample Chapter:''' [[https://www.packtpub.com/sites/default/files/9781849517287_Chapter_06.pdf|Chapter 6: Analytics]] - Hadoop !MapReduce Cookbook is a guide to processing large and complex data sets using Hadoop MapReduce. + Hadoop !MapReduce Cookbook is a guide to processing large and complex data sets using Hadoop MapReduce. == Hadoop Operations == - '''Name:''' [[http://shop.oreilly.com/product/0636920025085.do|Hadoop Operations]] '''Author:''' Eric Sammers @@ -294, +246 @@ A guide to running large-scale Hadoop clusters, written by someone who has practical experience in such deployments. == Hadoop in Practice == - '''Name:''' [[http://www.manning.com/holmes/|Hadoop in Practice]] '''Author:''' Alex Holmes @@ -307, +258 @@ '''Sample Chapter:''' [[http://www.manning.com/holmes/HiPmeap_ch01.pdf|Chapter 1]] - === Hadoop: The Definitive Guide, 3rd Edition === - '''Name:''' [[http://shop.oreilly.com/product/0636920021773.do|Hadoop: The Definitive Guide, 3rd Edition]] '''Author:''' Tom White @@ -320, +269 @@ '''Date of Publishing:''' May 2012 - '''Sample Chapter:''' [[http://cdn.oreilly.com/oreilly/booksamplers/9781449311520_sampler.pdf|Sample Chapter]] === Hadoop in Action === - '''Name:''' [[http://www.manning.com/lam/|Hadoop in Action]] '''Author:''' Chuck Lam @@ -340, +287 @@ Hadoop in Action introduces the subject and shows how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming. === Hadoop: The Definitive Guide, 2nd Edition === - '''Name:''' [[http://shop.oreilly.com/product/0636920010388.do|Hadoop: The Definitive Guide, 2nd Edition]] '''Author:''' Tom White @@ -352, +298 @@ '''Date of Publishing:''' September 2010 === Pro Hadoop === - '''Name:''' [[http://www.amazon.com/dp/1430219424?tag=jewlerymall|Pro Hadoop]] '''Author:''' Jason Venner @@ -366, +311 @@ Jason says "This book is a step by step guide to writing, running and debugging Map/Reduce jobs using Hadoop, and to installing and managing Hadoop Clusters. It is ideal for training new Map/Reduce users and Cluster administrators and for polishing existing Hadoop skills." === Hadoop: The Definitive Guide === - '''Name:''' [[http://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/0596521979/|Hadoop: The Definitive Guide]] '''Author:''' Tom White @@ -378, +322 @@ '''Date of Publishing:''' June 19, 2009 == Forthcoming Books == - - === Hadoop in Action, Second Edition === - '''Name:''' [[http://www.manning.com/lam2/|Hadoop in Action, Second Edition]] '''Author:''' Chuck P. Lam, Mark W. Davis