Modified: samza/site/sitemap.xml URL: http://svn.apache.org/viewvc/samza/site/sitemap.xml?rev=1851293&r1=1851292&r2=1851293&view=diff ============================================================================== --- samza/site/sitemap.xml (original) +++ samza/site/sitemap.xml Mon Jan 14 20:19:31 2019 @@ -20,7 +20,7 @@ <url> <loc>http://samza.apache.org/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> <changefreq>daily</changefreq> <priority>1.0</priority> </url> @@ -30,658 +30,658 @@ <url> <loc>http://samza.apache.org/learn/documentation/versioned/yarn/application-master</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/architecture/architecture-overview</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/introduction/architecture</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/introduction/background</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/checkpointing</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/contribute/code</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/contribute/coding-guide</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/community/committers-old</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/community/committers</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/introduction/concepts</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/configuration</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/hadoop/consumer</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/community/contact-us</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/contribute/contributors-corner</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/coordinator-stream</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/core-concepts/core-concepts</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/deploy-samza-job-from-hdfs</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/deploy-samza-to-CDH</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/deployment/deployment-model</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/contribute/enhancement-proposal</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/event-loop</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/connectors/eventhubs</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/connectors/hdfs</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/hello-samza-high-level-code</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/hello-samza-high-level-yarn</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/hello-samza-high-level-zk</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/api/high-level-api</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/archive/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/powered-by/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/talks/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/meetups/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/startup/quick-start/versioned/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/startup/preview/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/startup/hello-samza/versioned/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/startup/code-examples/versioned/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/startup/download/</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/comparisons/introduction</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/community/irc</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/yarn/isolation</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/jmx</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/job-runner</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/rest/resources/jobs</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/operations/kafka</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/connectors/kafka</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/architecture/kinesis</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/connectors/kinesis</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/logging</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/api/low-level-api</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/metrics</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/operations/monitoring</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/rest/monitors</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/comparisons/mupd8</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/connectors/overview</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/hadoop/overview</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/rest/overview</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/packaging</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/hadoop/producer</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/api/programming-model</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/startup/releases/versioned/release-notes</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/remote-debugging-samza</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/reprocessing</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/rest/resource-directory</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/rest/resources</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/run-hello-samza-without-internet</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/run-in-multi-node-yarn</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/samza-async-user-guide</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/samza-configurations</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/samza-container</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/samza-event-hubs-standalone</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/samza-rest-getting-started</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/samza-sql</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/api/samza-sql</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/tutorials/versioned/samza-tools</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/operations/security</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/serialization</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/comparisons/spark-streaming</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/split-deployment</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/deployment/standalone</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/state-management</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/comparisons/storm</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/streams</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/api/table-api</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/rest/resources/tasks</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/contribute/tests</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/web-ui-rest-api</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/container/windowing</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/yarn/yarn-host-affinity</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/jobs/yarn-jobs</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/yarn/yarn-resource-localization</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/yarn/yarn-security</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url> <url> <loc>http://samza.apache.org/learn/documentation/versioned/deployment/yarn</loc> - <lastmod>2018-12-11</lastmod> + <lastmod>2019-01-14</lastmod> </url>
Modified: samza/site/startup/code-examples/1.0.0/index.html URL: http://svn.apache.org/viewvc/samza/site/startup/code-examples/1.0.0/index.html?rev=1851293&r1=1851292&r2=1851293&view=diff ============================================================================== --- samza/site/startup/code-examples/1.0.0/index.html (original) +++ samza/site/startup/code-examples/1.0.0/index.html Mon Jan 14 20:19:31 2019 @@ -546,6 +546,60 @@ These include:</p> <li><p><a href="https://github.com/apache/samza-hello-samza/tree/master/src/main/java/samza/examples/kinesis">Amazon Kinesis</a> and <a href="https://github.com/apache/samza-hello-samza/tree/latest/src/main/java/samza/examples/azure">Azure Eventhubs</a> examples that cover how to consume input data from the respective systems.</p></li> </ul> +<h4 id="apache-beam-api-examples">Apache Beam API examples</h4> + +<p>The easiest way to get a copy of the WordCount examples in Beam API is to use <a href="http://maven.apache.org/download.cgi">Apache Maven</a>. After installing Maven, please run the following command:</p> + +<figure class="highlight"><pre><code class="language-bash" data-lang="bash"><span></span>> mvn archetype:generate <span class="se">\</span> + -DarchetypeGroupId<span class="o">=</span>org.apache.beam <span class="se">\</span> + -DarchetypeArtifactId<span class="o">=</span>beam-sdks-java-maven-archetypes-examples <span class="se">\</span> + -DarchetypeVersion<span class="o">=</span><span class="m">2</span>.9.0 <span class="se">\</span> + -DgroupId<span class="o">=</span>org.example <span class="se">\</span> + -DartifactId<span class="o">=</span>word-count-beam <span class="se">\</span> + -Dversion<span class="o">=</span><span class="s2">"0.1"</span> <span class="se">\</span> + -Dpackage<span class="o">=</span>org.apache.beam.examples <span class="se">\</span> + -DinteractiveMode<span class="o">=</span><span class="nb">false</span></code></pre></figure> + +<p>This command creates a maven project <code>word-count-beam</code> which contains a series of example pipelines that count words in text files:</p> + +<figure class="highlight"><pre><code class="language-bash" data-lang="bash"><span></span>> <span class="nb">cd</span> word-count-beam/ + +> ls src/main/java/org/apache/beam/examples/ +DebuggingWordCount.java WindowedWordCount.java common +MinimalWordCount.java WordCount.java</code></pre></figure> + +<p>To use SamzaRunner, please add the following <code>samza-runner</code> profile to <code>pom.xml</code> under the “profiles” section, same as in <a href="https://github.com/apache/beam/blob/master/sdks/java/maven-archetypes/examples/src/main/resources/archetype-resources/pom.xml">here</a>.</p> + +<figure class="highlight"><pre><code class="language-xml" data-lang="xml"><span></span> ... + <span class="nt"><profile></span> + <span class="nt"><id></span>samza-runner<span class="nt"></id></span> + <span class="nt"><dependencies></span> + <span class="nt"><dependency></span> + <span class="nt"><groupId></span>org.apache.beam<span class="nt"></groupId></span> + <span class="nt"><artifactId></span>beam-runners-samza<span class="nt"></artifactId></span> + <span class="nt"><version></span>${beam.version}<span class="nt"></version></span> + <span class="nt"><scope></span>runtime<span class="nt"></scope></span> + <span class="nt"></dependency></span> + <span class="nt"></dependencies></span> + <span class="nt"></profile></span> + ....</code></pre></figure> + +<p>Now we can run the wordcount example with Samza using the following command:</p> + +<figure class="highlight"><pre><code class="language-bash" data-lang="bash"><span></span>>mvn compile exec:java -Dexec.mainClass<span class="o">=</span>org.apache.beam.examples.WordCount <span class="se">\</span> + -Dexec.args<span class="o">=</span><span class="s2">"--inputFile=pom.xml --output=/tmp/counts --runner=SamzaRunner"</span> -Psamza-runner</code></pre></figure> + +<p>After the pipeline finishes, you can check out the output counts files in /tmp folder. Note Beam generates multiple output files for parallel processing. If you prefer a single output, please update the code to use TextIO.write().withoutSharding().</p> + +<figure class="highlight"><pre><code class="language-bash" data-lang="bash"><span></span>>more /tmp/counts* +AS: <span class="m">1</span> +IO: <span class="m">2</span> +IS: <span class="m">1</span> +OF: <span class="m">1</span> +...</code></pre></figure> + +<p>A walkthrough of the example code can be found <a href="https://beam.apache.org/get-started/wordcount-example/">here</a>. Feel free to play with other examples in the project or write your own. Please don’t hesitate to <a href="https://samza.apache.org/community/contact-us.html">reach out</a> if you encounter any issues.</p> + </div> </div>