This is an automated email from the ASF dual-hosted git repository. mergebot-role pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/beam-site.git
commit 6ec9c9583ae84f4735d7f44f4ef487e6c0b5b8a2 Author: Mergebot <merge...@apache.org> AuthorDate: Thu Feb 22 11:12:39 2018 -0800 Prepare repository for deployment. --- content/get-started/wordcount-example/index.html | 43 +++++++++++------------- 1 file changed, 19 insertions(+), 24 deletions(-) diff --git a/content/get-started/wordcount-example/index.html b/content/get-started/wordcount-example/index.html index bbe6fad..eddefd4 100644 --- a/content/get-started/wordcount-example/index.html +++ b/content/get-started/wordcount-example/index.html @@ -298,7 +298,11 @@ often, command-line arguments are used to set <code class="highlighter-rouge">Pi example, your pipeline executes locally using the <code class="highlighter-rouge">DirectRunner</code>. In the next sections, we will specify the pipeline’s runner.</p> -<div class="language-java highlighter-rouge"><pre class="highlight"><code> <span class="n">PipelineOptions</span> <span class="n">options</span> <span class="o">=</span> <span class="n">PipelineOptionsFactory</span><span class="o">.</span><span class="na">create</span><span class="o">();</span> +<div class="language-java highlighter-rouge"><pre class="highlight"><code> <span class="c1">// Create a PipelineOptions object. This object lets us set various execution</span> + <span class="c1">// options for our pipeline, such as the runner you wish to use. This example</span> + <span class="c1">// will run with the DirectRunner by default, based on the class path configured</span> + <span class="c1">// in its dependencies.</span> + <span class="n">PipelineOptions</span> <span class="n">options</span> <span class="o">=</span> <span class="n">PipelineOptionsFactory</span><span class="o">.</span><span class="na">create</span><span class="o">();</span> <span class="c1">// In order to run your pipeline, you need to make following runner specific changes:</span> <span class="c1">//</span> @@ -372,25 +376,19 @@ data stored in a publicly accessible Google Cloud Storage bucket (“gs://”).< </div> </li> <li> - <p>A <a href="/documentation/programming-guide/#pardo">ParDo</a> -transform that invokes a <code class="highlighter-rouge">DoFn</code> (defined in-line as an anonymous class) on + <p>This transform splits the lines in PCollection<String>, where each element +is an individual word in Shakespeare's collected texts. +As an alternative, it would have been possible to use a +[ParDo](/documentation/programming-guide/#pardo) +transform that invokes a `DoFn` (defined in-line as an anonymous class) on each element that tokenizes the text lines into individual words. The input -for this transform is the <code class="highlighter-rouge">PCollection</code> of text lines generated by the -previous <code class="highlighter-rouge">TextIO.Read</code> transform. The <code class="highlighter-rouge">ParDo</code> transform outputs a new -<code class="highlighter-rouge">PCollection</code>, where each element represents an individual word in the text.</p> +for this transform is the `PCollection` of text lines generated by the +previous `TextIO.Read` transform. The `ParDo` transform outputs a new +`PCollection`, where each element represents an individual word in the text.</String></p> - <div class="language-java highlighter-rouge"><pre class="highlight"><code><span class="o">.</span><span class="na">apply</span><span class="o">(</span><span class="s">"ExtractWords"</span><span class="o">,</span> <span class="n">ParDo</span><span class="o">.</span><span class="na">of</span><span class="o">(</span><span class="k">new</span> <span class="n">DoFn</span><span class="o"><</span><span class="n">String</span><span class="o">,</span> <span class="n">String</span><span cla [...] - <span class="nd">@ProcessElement</span> - <span class="kd">public</span> <span class="kt">void</span> <span class="nf">processElement</span><span class="o">(</span><span class="n">ProcessContext</span> <span class="n">c</span><span class="o">)</span> <span class="o">{</span> - <span class="c1">// \p{L} denotes the category of Unicode letters,</span> - <span class="c1">// so this pattern will match on everything that is not a letter.</span> - <span class="k">for</span> <span class="o">(</span><span class="n">String</span> <span class="n">word</span> <span class="o">:</span> <span class="n">c</span><span class="o">.</span><span class="na">element</span><span class="o">().</span><span class="na">split</span><span class="o">(</span><span class="s">"[^\\p{L}]+"</span><span class="o">))</span> <span class="o">{</span> - <span class="k">if</span> <span class="o">(!</span><span class="n">word</span><span class="o">.</span><span class="na">isEmpty</span><span class="o">())</span> <span class="o">{</span> - <span class="n">c</span><span class="o">.</span><span class="na">output</span><span class="o">(</span><span class="n">word</span><span class="o">);</span> - <span class="o">}</span> - <span class="o">}</span> - <span class="o">}</span> -<span class="o">}))</span> + <div class="language-java highlighter-rouge"><pre class="highlight"><code> <span class="o">.</span><span class="na">apply</span><span class="o">(</span><span class="s">"ExtractWords"</span><span class="o">,</span> <span class="n">FlatMapElements</span> + <span class="o">.</span><span class="na">into</span><span class="o">(</span><span class="n">TypeDescriptors</span><span class="o">.</span><span class="na">strings</span><span class="o">())</span> + <span class="o">.</span><span class="na">via</span><span class="o">((</span><span class="n">String</span> <span class="n">word</span><span class="o">)</span> <span class="o">-></span> <span class="n">Arrays</span><span class="o">.</span><span class="na">asList</span><span class="o">(</span><span class="n">word</span><span class="o">.</span><span class="na">split</span><span class="o">(</span><span class="s">"[^\\p{L}]+"</span><span class="o">))))</span> </code></pre> </div> @@ -428,12 +426,9 @@ file.</p> simple <code class="highlighter-rouge">ParDo</code>. For each element in the input <code class="highlighter-rouge">PCollection</code>, the map transform applies a function that produces exactly one output element.</p> - <div class="language-java highlighter-rouge"><pre class="highlight"><code><span class="o">.</span><span class="na">apply</span><span class="o">(</span><span class="s">"FormatResults"</span><span class="o">,</span> <span class="n">MapElements</span><span class="o">.</span><span class="na">via</span><span class="o">(</span><span class="k">new</span> <span class="n">SimpleFunction</span><span class="o"><</span><span class="n">KV</span><span class="o"><</span><span class="n">String [...] - <span class="nd">@Override</span> - <span class="kd">public</span> <span class="n">String</span> <span class="nf">apply</span><span class="o">(</span><span class="n">KV</span><span class="o"><</span><span class="n">String</span><span class="o">,</span> <span class="n">Long</span><span class="o">></span> <span class="n">input</span><span class="o">)</span> <span class="o">{</span> - <span class="k">return</span> <span class="n">input</span><span class="o">.</span><span class="na">getKey</span><span class="o">()</span> <span class="o">+</span> <span class="s">": "</span> <span class="o">+</span> <span class="n">input</span><span class="o">.</span><span class="na">getValue</span><span class="o">();</span> - <span class="o">}</span> -<span class="o">}))</span> + <div class="language-java highlighter-rouge"><pre class="highlight"><code><span class="o">.</span><span class="na">apply</span><span class="o">(</span><span class="s">"FormatResults"</span><span class="o">,</span> <span class="n">MapElements</span> + <span class="o">.</span><span class="na">into</span><span class="o">(</span><span class="n">TypeDescriptors</span><span class="o">.</span><span class="na">strings</span><span class="o">())</span> + <span class="o">.</span><span class="na">via</span><span class="o">((</span><span class="n">KV</span><span class="o"><</span><span class="n">String</span><span class="o">,</span> <span class="n">Long</span><span class="o">></span> <span class="n">wordCount</span><span class="o">)</span> <span class="o">-></span> <span class="n">wordCount</span><span class="o">.</span><span class="na">getKey</span><span class="o">()</span> <span class="o">+</span> <span class="s">": "</span> [...] </code></pre> </div> -- To stop receiving notification emails like this one, please contact mergebot-r...@apache.org.