Author: buildbot
Date: Sun Nov 18 20:45:33 2012
New Revision: 838771
Log:
Staging update by buildbot for crunch
Added:
websites/staging/crunch/trunk/content/crunch/getting-started.html
Modified:
websites/staging/crunch/trunk/content/ (props changed)
websites/staging/crunch/trunk/content/crunch/download.html
websites/staging/crunch/trunk/content/crunch/future-work.html
websites/staging/crunch/trunk/content/crunch/index.html
websites/staging/crunch/trunk/content/crunch/intro.html
websites/staging/crunch/trunk/content/crunch/mailing-lists.html
websites/staging/crunch/trunk/content/crunch/pipelines.html
websites/staging/crunch/trunk/content/crunch/scrunch.html
websites/staging/crunch/trunk/content/crunch/source-repository.html
Propchange: websites/staging/crunch/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Sun Nov 18 20:45:33 2012
@@ -1 +1 @@
-1410721
+1410987
Modified: websites/staging/crunch/trunk/content/crunch/download.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/download.html (original)
+++ websites/staging/crunch/trunk/content/crunch/download.html Sun Nov 18
20:45:33 2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><b>Download</b></li>
Modified: websites/staging/crunch/trunk/content/crunch/future-work.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/future-work.html (original)
+++ websites/staging/crunch/trunk/content/crunch/future-work.html Sun Nov 18
20:45:33 2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>
Added: websites/staging/crunch/trunk/content/crunch/getting-started.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/getting-started.html (added)
+++ websites/staging/crunch/trunk/content/crunch/getting-started.html Sun Nov
18 20:45:33 2012
@@ -0,0 +1,219 @@
+<!DOCTYPE html>
+
+
+<html xmlns="http://www.w3.org/1999/xhtml" lang="en">
+ <head>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
+ <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+ <meta http-equiv="Content-Language" content="en" />
+
+ <title>Apache Crunch - Getting Started</title>
+
+ <link rel="stylesheet" href="/crunch/css/bootstrap-2.1.0.min.css" />
+ <link rel="stylesheet" href="/crunch/css/crunch.css" type="text/css">
+ <script type="text/javascript"
src="/crunch/js/bootstrap-2.1.0.min.js"></script>
+ </head>
+ <body>
+
+ <div class="navbar navbar-inverse navbar-static-top">
+
+ <div class="container-fluid">
+
+ <a class="nav pull-right brand" href="http://incubator.apache.org">
+ <img src="http://incubator.apache.org/images/egg-logo.png"
alt="apache Incubator Logo" />
+ </a>
+
+ </div>
+
+ </div>
+
+ <ul class="breadcrumb">
+ <li>
+ <a href="/">Incubator</a>
+ <span class="divider">»</span>
+ </li>
+ <li>
+ <a href="/crunch/">Crunch</a>
+ </li>
+
+ </ul>
+
+ <div class="container-fluid">
+ <div class="row-fluid">
+
+ <!-- SIDEBAR AREA -->
+ <div class="span2">
+ <div class="sidebar-nav">
+ <ul class="nav nav-list">
+
+
+ <li class="nav-header">Apache Crunch</li>
+
+
+
+
+ <li><a href="/crunch/index.html">Overview</a></li>
+
+
+
+
+
+ <li><b>Getting Started</b></li>
+
+
+
+
+
+ <li><a href="/crunch/download.html">Download</a></li>
+
+
+
+
+
+ <li><a href="/crunch/apidocs/0.4.0/">API</a></li>
+
+
+
+
+
+ <li><a
href="https://cwiki.apache.org/confluence/display/CRUNCH/">Wiki</a></li>
+
+
+
+
+ <li class="nav-header">Project</li>
+
+
+
+
+ <li><a href="/crunch/source-repository.html">Source
Code</a></li>
+
+
+
+
+
+ <li><a href="/crunch/mailing-lists.html">Mailing
Lists</a></li>
+
+
+
+
+
+ <li><a
href="http://issues.apache.org/jira/browse/CRUNCH">Issue Tracking</a></li>
+
+
+
+
+
+ <li><a
href="http://apache.org/licenses/LICENSE-2.0.html">License</a></li>
+
+
+
+ </ul>
+ </div> <!-- /well -->
+ </div> <!-- /span -->
+
+ <!-- CONTENT AREA -->
+ <div class="span10">
+ <h1 class="title">
+ Getting Started
+
+ </h1>
+
+ <p>Crunch is developed against Apache Hadoop version 1.0.3 and is
also tested against
+Apache Hadoop 2.0.0-alpha. Crunch should work with any version of Hadoop
+after 1.0.3 or 2.0.0-alpha, and is also known to work with distributions from
+vendors like Cloudera, Hortonworks, and IBM. Crunch is <em>not</em> compatible
with
+versions of Hadoop prior to 1.0.x or 2.0.x, such as Apache Hadoop 0.20.x.</p>
+<p>The easiest way to get started with Crunch is to use its Maven archetype
+to generate a simple project. The archetype is available from Maven Central;
+just enter the following command, answer a few questions, and you're ready to
+go:</p>
+<pre>
+$ <strong>mvn archetype:generate
-Dfilter=org.apache.crunch:crunch-archetype</strong>
+[...]
+1: remote -> org.apache.crunch:crunch-archetype (Create a basic,
self-contained job for Apache Crunch.)
+Choose a number or apply filter (format: [groupId:]artifactId, case sensitive
contains): : <strong>1</strong>
+Define value for property 'groupId': : <strong>com.example</strong>
+Define value for property 'artifactId': : <strong>crunch-demo</strong>
+Define value for property 'version': 1.0-SNAPSHOT: : <strong>[HIT
ENTER]</strong>
+Define value for property 'package': com.example: : <strong>[HIT
ENTER]</strong>
+Confirm properties configuration:
+groupId: com.example
+artifactId: crunch-demo
+version: 1.0-SNAPSHOT
+package: com.example
+ Y: : <strong>[HIT ENTER]</strong>
+[...]
+$
+</pre>
+
+<p>The generated Maven project contains an example application that counts
+word frequencies in text files:</p>
+<pre>
+$ <strong>cd crunch-demo</strong>
+$ <strong>tree</strong>
+.
+|-- pom.xml
+`-- src
+ |-- main
+ | |-- assembly
+ | | `-- <strong>hadoop-job.xml</strong>
+ | `-- java
+ | `-- com
+ | `-- example
+ | |-- StopWordFilter.java
+ | |-- Tokenizer.java
+ | `-- <strong>WordCount.java</strong>
+ `-- test
+ `-- java
+ `-- com
+ `-- example
+ |-- StopWordFilterTest.java
+ `-- TokenizerTest.java
+</pre>
+
+<p>The <code>WordCount.java</code> file contains the main class that defines a
Crunch-based
+application which is referenced from <code>pom.xml</code>.</p>
+<p>Build the code:</p>
+<pre>
+$ <strong>mvn package</strong>
+</pre>
+
+<p>Your packaged application is created in the <code>target</code> directory.
The build
+process uses Maven's assembly plugin with some configuration in
+<code>hadoop-job.xml</code> to create a special JAR file (suffix
<code>-job.jar</code>).
+Depending on your Hadoop configuration, you can run it locally or on a
+cluster using Hadoop's launcher script:</p>
+<pre>
+$ <strong>hadoop jar target/hadoop-job-demo-1.0-SNAPSHOT-job.jar <in>
<out></strong>
+</pre>
+
+<p>The <code><in></code> parameter references a text file or a directory
containing text
+files, while <code><out></code> is a directory where Crunch writes the
final results to.</p>
+<p>Crunch also lets you run applications from within an IDE, either as
standalone
+Java applications or from unit tests. All required dependencies are on Maven's
+classpath so you can run the <code>WordCount</code> class directly without any
additional
+setup.</p>
+ </div> <!-- /span -->
+
+ </div> <!-- /row-fluid -->
+
+ </div>
+
+ <hr/>
+
+ <footer>
+ <div class="container-fluid">
+ <div class="row span12">Copyright © 2012
+ <a href="http://www.apache.org/">The Apache Software Foundation</a>,
+ licensed under the <a
href="http://www.apache.org/licenses/LICENSE-2.0">Apache License, Version
2.0</a>.
+ <p><small>Apache Incubator, Apache Hadoop, Hadoop, Apache, and the
+ Apache feather logo are trademarks of The Apache Software Foundation.
+ Other names appearing on the site may be trademarks of their
+ respective owners.</small></p>
+ </div>
+ </div>
+ </footer>
+
+ </body>
+</html>
Modified: websites/staging/crunch/trunk/content/crunch/index.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/index.html (original)
+++ websites/staging/crunch/trunk/content/crunch/index.html Sun Nov 18 20:45:33
2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>
Modified: websites/staging/crunch/trunk/content/crunch/intro.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/intro.html (original)
+++ websites/staging/crunch/trunk/content/crunch/intro.html Sun Nov 18 20:45:33
2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>
Modified: websites/staging/crunch/trunk/content/crunch/mailing-lists.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/mailing-lists.html (original)
+++ websites/staging/crunch/trunk/content/crunch/mailing-lists.html Sun Nov 18
20:45:33 2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>
Modified: websites/staging/crunch/trunk/content/crunch/pipelines.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/pipelines.html (original)
+++ websites/staging/crunch/trunk/content/crunch/pipelines.html Sun Nov 18
20:45:33 2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>
Modified: websites/staging/crunch/trunk/content/crunch/scrunch.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/scrunch.html (original)
+++ websites/staging/crunch/trunk/content/crunch/scrunch.html Sun Nov 18
20:45:33 2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>
Modified: websites/staging/crunch/trunk/content/crunch/source-repository.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/source-repository.html
(original)
+++ websites/staging/crunch/trunk/content/crunch/source-repository.html Sun Nov
18 20:45:33 2012
@@ -58,6 +58,12 @@
+ <li><a href="/crunch/getting-started.html">Getting
Started</a></li>
+
+
+
+
+
<li><a href="/crunch/download.html">Download</a></li>