Hadoop
Dosed by Rajkumar Parthasarathi – M & E Team (from MphasiS Software Services)
What is Hadoop?
Apache Hadoop is a Free Java software framework that supports data intensive
distributed applications running on large clusters of commodity computers.
Hadoop was inspired by Google's Map Reduce, in which an application is broken
down into numerous small parts. Any of these parts can be run on any node in
the cluster.
In layman’s terms Hadoop breaks up a computation problem into manageable chunks
and distribute them to different processors, once all the individual results
are in, they are combined into one big result.
Hadoop is a top level Apache project, being built and used by a community of
contributors from all over the world. Yahoo! has been the largest contributor
to the project and uses Hadoop extensively in its Web Search and Advertising
businesses. IBM and Google have announced a major initiative to use Hadoop to
support University courses in Distributed Computer Programming.
Hadoop was named after its creator's (Doug Cutting, now a Yahoo employee)
child's stuffed elephant. It was originally developed to support distribution
for the Nutch search engine project.
Core Features:
It enables applications to easily scale out to thousands of nodes and petabytes
of data.
Tasks can be split into multiple clusters and Hadoop takes care of fail-over
mechanism and monitoring tasks.
Further References
Hadoop website
http://hadoop.apache.org
Hadoop wiki
http://wiki.apache.org/hadoop/
Explore your hobbies and interests. Go to
http://in.promos.yahoo.com/groups/
Join Access India convention: For updates on it visit:
http://accessindia.org.in/harish/convention.htm
To unsubscribe send a message to [EMAIL PROTECTED] with the subject unsubscribe.
To change your subscription to digest mode or make any other changes, please
visit the list home page at
http://accessindia.org.in/mailman/listinfo/accessindia_accessindia.org.in