Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by ZhengShao: http://wiki.apache.org/hadoop/Hive ------------------------------------------------------------------------------ - = Welcome to the Hive Wiki! = - = What is Hive = [http://hadoop.apache.org/hive/ Hive] is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built in capabilities of the language. @@ -13, +11 @@ If your input data is small you can execute a query in a short time. For example, if a table has 100 rows you can 'set mapred.reduce.tasks=1' and 'set mapred.map.tasks=1' and the query time will be ~15 seconds. = Information = - Following are some useful links for users and developers interested in Hive: * General information about Hive * [wiki:/GettingStarted Getting Started] * [wiki:/Presentations Presentations About Hive]
