This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 6ec9c9583ae84f4735d7f44f4ef487e6c0b5b8a2
Author: Mergebot <merge...@apache.org>
AuthorDate: Thu Feb 22 11:12:39 2018 -0800

    Prepare repository for deployment.
---
 content/get-started/wordcount-example/index.html | 43 +++++++++++-------------
 1 file changed, 19 insertions(+), 24 deletions(-)

diff --git a/content/get-started/wordcount-example/index.html 
b/content/get-started/wordcount-example/index.html
index bbe6fad..eddefd4 100644
--- a/content/get-started/wordcount-example/index.html
+++ b/content/get-started/wordcount-example/index.html
@@ -298,7 +298,11 @@ often, command-line arguments are used to set <code 
class="highlighter-rouge">Pi
 example, your pipeline executes locally using the <code 
class="highlighter-rouge">DirectRunner</code>. In the next
 sections, we will specify the pipeline’s runner.</p>
 
-<div class="language-java highlighter-rouge"><pre class="highlight"><code> 
<span class="n">PipelineOptions</span> <span class="n">options</span> <span 
class="o">=</span> <span class="n">PipelineOptionsFactory</span><span 
class="o">.</span><span class="na">create</span><span class="o">();</span>
+<div class="language-java highlighter-rouge"><pre class="highlight"><code> 
<span class="c1">// Create a PipelineOptions object. This object lets us set 
various execution</span>
+ <span class="c1">// options for our pipeline, such as the runner you wish to 
use. This example</span>
+ <span class="c1">// will run with the DirectRunner by default, based on the 
class path configured</span>
+ <span class="c1">// in its dependencies.</span>
+ <span class="n">PipelineOptions</span> <span class="n">options</span> <span 
class="o">=</span> <span class="n">PipelineOptionsFactory</span><span 
class="o">.</span><span class="na">create</span><span class="o">();</span>
 
     <span class="c1">// In order to run your pipeline, you need to make 
following runner specific changes:</span>
     <span class="c1">//</span>
@@ -372,25 +376,19 @@ data stored in a publicly accessible Google Cloud Storage 
bucket (“gs://”).<
     </div>
   </li>
   <li>
-    <p>A <a href="/documentation/programming-guide/#pardo">ParDo</a>
-transform that invokes a <code class="highlighter-rouge">DoFn</code> (defined 
in-line as an anonymous class) on
+    <p>This transform splits the lines in PCollection<String>, where each 
element
+is an individual word in Shakespeare's collected texts.
+As an alternative, it would have been possible to use a 
+[ParDo](/documentation/programming-guide/#pardo)
+transform that invokes a `DoFn` (defined in-line as an anonymous class) on
 each element that tokenizes the text lines into individual words. The input
-for this transform is the <code class="highlighter-rouge">PCollection</code> 
of text lines generated by the
-previous <code class="highlighter-rouge">TextIO.Read</code> transform. The 
<code class="highlighter-rouge">ParDo</code> transform outputs a new
-<code class="highlighter-rouge">PCollection</code>, where each element 
represents an individual word in the text.</p>
+for this transform is the `PCollection` of text lines generated by the
+previous `TextIO.Read` transform. The `ParDo` transform outputs a new
+`PCollection`, where each element represents an individual word in the 
text.</String></p>
 
-    <div class="language-java highlighter-rouge"><pre 
class="highlight"><code><span class="o">.</span><span 
class="na">apply</span><span class="o">(</span><span 
class="s">"ExtractWords"</span><span class="o">,</span> <span 
class="n">ParDo</span><span class="o">.</span><span class="na">of</span><span 
class="o">(</span><span class="k">new</span> <span class="n">DoFn</span><span 
class="o">&lt;</span><span class="n">String</span><span class="o">,</span> 
<span class="n">String</span><span cla [...]
-    <span class="nd">@ProcessElement</span>
-    <span class="kd">public</span> <span class="kt">void</span> <span 
class="nf">processElement</span><span class="o">(</span><span 
class="n">ProcessContext</span> <span class="n">c</span><span 
class="o">)</span> <span class="o">{</span>
-        <span class="c1">// \p{L} denotes the category of Unicode 
letters,</span>
-        <span class="c1">// so this pattern will match on everything that is 
not a letter.</span>
-        <span class="k">for</span> <span class="o">(</span><span 
class="n">String</span> <span class="n">word</span> <span class="o">:</span> 
<span class="n">c</span><span class="o">.</span><span 
class="na">element</span><span class="o">().</span><span 
class="na">split</span><span class="o">(</span><span 
class="s">"[^\\p{L}]+"</span><span class="o">))</span> <span class="o">{</span>
-            <span class="k">if</span> <span class="o">(!</span><span 
class="n">word</span><span class="o">.</span><span 
class="na">isEmpty</span><span class="o">())</span> <span class="o">{</span>
-                <span class="n">c</span><span class="o">.</span><span 
class="na">output</span><span class="o">(</span><span 
class="n">word</span><span class="o">);</span>
-            <span class="o">}</span>
-        <span class="o">}</span>
-    <span class="o">}</span>
-<span class="o">}))</span>
+    <div class="language-java highlighter-rouge"><pre class="highlight"><code> 
   <span class="o">.</span><span class="na">apply</span><span 
class="o">(</span><span class="s">"ExtractWords"</span><span class="o">,</span> 
<span class="n">FlatMapElements</span>
+        <span class="o">.</span><span class="na">into</span><span 
class="o">(</span><span class="n">TypeDescriptors</span><span 
class="o">.</span><span class="na">strings</span><span class="o">())</span>
+        <span class="o">.</span><span class="na">via</span><span 
class="o">((</span><span class="n">String</span> <span 
class="n">word</span><span class="o">)</span> <span class="o">-&gt;</span> 
<span class="n">Arrays</span><span class="o">.</span><span 
class="na">asList</span><span class="o">(</span><span 
class="n">word</span><span class="o">.</span><span class="na">split</span><span 
class="o">(</span><span class="s">"[^\\p{L}]+"</span><span class="o">))))</span>
 </code></pre>
     </div>
 
@@ -428,12 +426,9 @@ file.</p>
 simple <code class="highlighter-rouge">ParDo</code>. For each element in the 
input <code class="highlighter-rouge">PCollection</code>, the map
 transform applies a function that produces exactly one output element.</p>
 
-    <div class="language-java highlighter-rouge"><pre 
class="highlight"><code><span class="o">.</span><span 
class="na">apply</span><span class="o">(</span><span 
class="s">"FormatResults"</span><span class="o">,</span> <span 
class="n">MapElements</span><span class="o">.</span><span 
class="na">via</span><span class="o">(</span><span class="k">new</span> <span 
class="n">SimpleFunction</span><span class="o">&lt;</span><span 
class="n">KV</span><span class="o">&lt;</span><span class="n">String [...]
-    <span class="nd">@Override</span>
-    <span class="kd">public</span> <span class="n">String</span> <span 
class="nf">apply</span><span class="o">(</span><span class="n">KV</span><span 
class="o">&lt;</span><span class="n">String</span><span class="o">,</span> 
<span class="n">Long</span><span class="o">&gt;</span> <span 
class="n">input</span><span class="o">)</span> <span class="o">{</span>
-        <span class="k">return</span> <span class="n">input</span><span 
class="o">.</span><span class="na">getKey</span><span class="o">()</span> <span 
class="o">+</span> <span class="s">": "</span> <span class="o">+</span> <span 
class="n">input</span><span class="o">.</span><span 
class="na">getValue</span><span class="o">();</span>
-    <span class="o">}</span>
-<span class="o">}))</span>
+    <div class="language-java highlighter-rouge"><pre 
class="highlight"><code><span class="o">.</span><span 
class="na">apply</span><span class="o">(</span><span 
class="s">"FormatResults"</span><span class="o">,</span> <span 
class="n">MapElements</span>
+    <span class="o">.</span><span class="na">into</span><span 
class="o">(</span><span class="n">TypeDescriptors</span><span 
class="o">.</span><span class="na">strings</span><span class="o">())</span>
+    <span class="o">.</span><span class="na">via</span><span 
class="o">((</span><span class="n">KV</span><span class="o">&lt;</span><span 
class="n">String</span><span class="o">,</span> <span 
class="n">Long</span><span class="o">&gt;</span> <span 
class="n">wordCount</span><span class="o">)</span> <span class="o">-&gt;</span> 
<span class="n">wordCount</span><span class="o">.</span><span 
class="na">getKey</span><span class="o">()</span> <span class="o">+</span> 
<span class="s">": "</span>  [...]
 </code></pre>
     </div>
 

-- 
To stop receiving notification emails like this one, please contact
mergebot-r...@apache.org.

Reply via email to