Dear Wiki user, You have subscribed to a wiki page or wiki category on "Pig Wiki" for change notification.
The following page has been changed by GregStein: http://wiki.apache.org/pig/FrontPage The comment on the change is: better branding ------------------------------------------------------------------------------ #language en #pragma section-numbers off - = Pig Wiki = + = Apache Pig Wiki = - [http://incubator.apache.org/pig/ Pig] is a platform for analyzing large data sets. Pig's language, Pig Latin, lets you specify a sequence of data transformations such as merging data sets, filtering them, and applying functions to records or groups of records. Pig comes with many built-in functions but you can also create your own user-defined functions to do special-purpose processing. + [http://incubator.apache.org/pig/ Apache Pig] is a platform for analyzing large data sets. Pig's language, Pig Latin, lets you specify a sequence of data transformations such as merging data sets, filtering them, and applying functions to records or groups of records. Pig comes with many built-in functions but you can also create your own user-defined functions to do special-purpose processing. Pig Latin programs run in a distributed fashion on a cluster (programs are complied into Map/Reduce jobs and executed using Hadoop). For quick prototyping, Pig Latin programs can also run in "local mode" without a cluster (all processing takes place in a single local JVM). @@ -20, +20 @@ '''Why Pig Latin instead of SQL?''' [http://www.cs.cmu.edu/~olston/publications/sigmod08.pdf Pig Latin: A Not-So-Foreign Language ...] - '''Pig Has Grown Up!'''. On 10/22/08 Pig graduated from the [http://incubator.apache.org/ Incubator] and joined [http://hadoop.apache.org/ Hadoop] as a subproject. + '''Pig Has Grown Up!'''. On 10/22/08 Pig graduated from the [http://incubator.apache.org/ Incubator] and joined [http://hadoop.apache.org/ Apache Hadoop] as a subproject. '''Pig is Getting Faster!''' 2-6 times faster, for many queries. We've created a set of benchmarks and run them against the pig 0.1.0 release (modified to run on hadoop 0.18) and against the current trunk (previously `types` branch.) Joins and order bys in particular made large performance gains. For complete details see PigMix. @@ -57, +57 @@ * A Vim plugin for Pig Latin - [http://www.vim.org/scripts/script.php?script_id=2186] * More Pig - * [http://hadoop.apache.org/pig/docs/r0.3.0/cookbook.html Pig Cookbook] - Want Pig to fly? Tips and tricks on how to write efficient Pig scripts + * [http://hadoop.apache.org/pig/docs/r0.3.0/cookbook.html Apache Pig Cookbook] - Want Pig to fly? Tips and tricks on how to write efficient Pig scripts * [http://hadoop.apache.org/pig/javadoc/docs/api/ Javadocs] - Refer to the Javadocs for embedded Pig and UDFs * [http://wiki.apache.org/pig/FAQ FAQ] - The answer to your question may be here
