Dear Wiki user, You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.
The following page has been changed by TedDunning: http://wiki.apache.org/lucene-hadoop/ImportantConcepts ------------------------------------------------------------------------------ Some notable terms that may confuse you: * Hadoop - Hadoop itself refers to the overall system that runs jobs, distributes tasks (pieces of these jobs) and stores data in a parallel and distributed fashion. + + * [:HadoopMapReduce:Map/reduce] - Is the style in which most programs running on Hadoop are written. In this style, input is broken in tiny pieces which are processed independently (the map part). The results of these independent processes are then collated into groups and processed as groups (the reduce part). Follow the link for a much more complete description. * Job - In hadoop, the combination of all of the jars and classes needed to run a map/reduce program is called a job. All of these components are themselves collected into a jar which is usually referred to as a job file. To execute a job, you normally will use the command: