[Lucene-hadoop Wiki] Update of "ImportantConcepts" by TedDunning

Apache Wiki Thu, 19 Jul 2007 20:13:24 -0700

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for 
change notification.


The following page has been changed by TedDunning:
http://wiki.apache.org/lucene-hadoop/ImportantConcepts

------------------------------------------------------------------------------
  Some notable terms that may confuse you:
  
  * Hadoop - Hadoop itself refers to the overall system that runs jobs, 
distributes tasks (pieces of these jobs) and stores data in a parallel and 
distributed fashion.
+ 
+ * [:HadoopMapReduce:Map/reduce] - Is the style in which most programs running 
on Hadoop are written.  In this style, input is broken in tiny pieces which are 
processed independently (the map part).  The results of these independent 
processes are then collated into groups and processed as groups (the reduce 
part).  Follow the link for a much more complete description.
  
  * Job -  In hadoop, the combination of all of the jars and classes needed to 
run a map/reduce program is called a job.  All of these components are 
themselves collected into a jar which is usually referred to as a job file.  To 
execute a job, you normally will use the command:

[Lucene-hadoop Wiki] Update of "ImportantConcepts" by TedDunning

Reply via email to