This is an automated email from the ASF dual-hosted git repository.

git-site-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 1f44beb4c24 Publishing website 2022/08/18 22:16:53 at commit 062a0d2
1f44beb4c24 is described below

commit 1f44beb4c2405795ebf1e3546afd87ffd4e261fc
Author: jenkins <[email protected]>
AuthorDate: Thu Aug 18 22:16:53 2022 +0000

    Publishing website 2022/08/18 22:16:53 at commit 062a0d2
---
 .../get-started/from-spark/index.html              |  4 +-
 website/generated-content/get-started/index.xml    | 48 ++++++++++------------
 website/generated-content/sitemap.xml              |  2 +-
 3 files changed, 25 insertions(+), 29 deletions(-)

diff --git a/website/generated-content/get-started/from-spark/index.html 
b/website/generated-content/get-started/from-spark/index.html
index ee66d4b22aa..1acd6de5c8b 100644
--- a/website/generated-content/get-started/from-spark/index.html
+++ b/website/generated-content/get-started/from-spark/index.html
@@ -69,7 +69,7 @@ This serves both as comments and makes your pipeline easier 
to debug.</p><p>This
         <span class=o>|</span> <span class=s1>&#39;Multiply by two&#39;</span> 
<span class=o>&gt;&gt;</span> <span class=n>beam</span><span 
class=o>.</span><span class=n>Map</span><span class=p>(</span><span 
class=k>lambda</span> <span class=n>x</span><span class=p>:</span> <span 
class=n>x</span> <span class=o>*</span> <span class=mi>2</span><span 
class=p>)</span>
         <span class=o>|</span> <span class=s1>&#39;Sum everything&#39;</span> 
<span class=o>&gt;&gt;</span> <span class=n>beam</span><span 
class=o>.</span><span class=n>CombineGlobally</span><span class=p>(</span><span 
class=nb>sum</span><span class=p>)</span>
         <span class=o>|</span> <span class=s1>&#39;Print results&#39;</span> 
<span class=o>&gt;&gt;</span> <span class=n>beam</span><span 
class=o>.</span><span class=n>Map</span><span class=p>(</span><span 
class=k>print</span><span class=p>)</span>
-    <span class=p>)</span></code></pre></div></div></div><h2 
id=setup>Setup</h2><p>Here&rsquo;s a comparison on how to get started both in 
PySpark and Beam.</p><div 
class=table-wrapper><table><tr><th></th><th>PySpark</th><th>Beam</th></tr><tr><td><b>Install</b></td><td><code>$
 pip install pyspark</code></td><td><code>$ pip install 
apache-beam</code></td></tr><tr><td><b>Imports</b></td><td><code>import 
pyspark</code></td><td><code>import apache_beam as 
beam</code></td></tr><tr><td><b>Crea [...]
+    <span class=p>)</span></code></pre></div></div></div><h2 
id=setup>Setup</h2><p>Here&rsquo;s a comparison on how to get started both in 
PySpark and Beam.</p><div class=table-wrapper><table style=width:100%><tr><th 
style=width:20%></th><th style=width:40%>PySpark</th><th 
style=width:40%>Beam</th></tr><tr><td><b>Install</b></td><td><code>$ pip 
install pyspark</code></td><td><code>$ pip install 
apache-beam</code></td></tr><tr><td><b>Imports</b></td><td><code>import 
pyspark</code></td><td [...]
 <a href=/documentation/transforms/python/overview>Python transform 
gallery</a>.</p></blockquote><h2 id=using-calculated-values>Using calculated 
values</h2><p>Since we are working in potentially distributed environments,
 we can&rsquo;t guarantee that the results we&rsquo;ve calculated are available 
at any given machine.</p><p>In PySpark, we can get a result from a collection 
of elements (RDD) by using
 <code>data.collect()</code>, or other aggregations such as 
<code>reduce()</code>, <code>count()</code>, and more.</p><p>Here&rsquo;s an 
example to scale numbers into a range between zero and one.</p><div 
class="language-py snippet"><div class="notebook-skip code-snippet"><a 
class=copy type=button data-bs-toggle=tooltip data-bs-placement=bottom 
title="Copy to clipboard"><img src=/images/copy-icon.svg></a><div 
class=highlight><pre class=chroma><code class=language-py data-lang=py><span 
cla [...]
@@ -108,7 +108,7 @@ and access them as an <a 
href=https://docs.python.org/3/glossary.html#term-itera
     <span class=n>scaled_values</span> <span class=o>|</span> <span 
class=n>beam</span><span class=o>.</span><span class=n>Map</span><span 
class=p>(</span><span class=k>print</span><span 
class=p>)</span></code></pre></div></div></div><blockquote><p>ℹ️ In Beam we 
need to pass a side input explicitly, but we get the
 benefit that a reduction or aggregation does <em>not</em> have to fit into 
memory.
 Lazily computing side inputs also allows us to compute <code>values</code> 
only once,
-rather than for each distinct reduction (or requiring explicit caching of the 
RDD).</p></blockquote><h2 id=next-steps>Next Steps</h2><ul><li>Take a look at 
all the available transforms in the <a 
href=/documentation/transforms/python/overview>Python transform 
gallery</a>.</li><li>Learn how to read from and write to files in the <a 
href=/documentation/programming-guide/#pipeline-io><em>Pipeline I/O</em> 
section of the <em>Programming guide</em></a></li><li>Walk through additional 
WordCount [...]
+rather than for each distinct reduction (or requiring explicit caching of the 
RDD).</p></blockquote><h2 id=next-steps>Next Steps</h2><ul><li>Take a look at 
all the available transforms in the <a 
href=/documentation/transforms/python/overview>Python transform 
gallery</a>.</li><li>Learn how to read from and write to files in the <a 
href=/documentation/programming-guide/#pipeline-io><em>Pipeline I/O</em> 
section of the <em>Programming guide</em></a></li><li>Walk through additional 
WordCount [...]
 <a href=http://www.apache.org>The Apache Software Foundation</a>
 | <a href=/privacy_policy>Privacy Policy</a>
 | <a href=/feed.xml>RSS Feed</a><br><br>Apache Beam, Apache, Beam, the Beam 
logo, and the Apache feather logo are either registered trademarks or 
trademarks of The Apache Software Foundation. All other products or name brands 
are trademarks of their respective holders, including The Apache Software 
Foundation.</div></div><div class="footer__cols__col 
footer__cols__col__logos"><div class=footer__cols__col--group><div 
class=footer__cols__col__logo><a href=https://github.com/apache/beam><im [...]
\ No newline at end of file
diff --git a/website/generated-content/get-started/index.xml 
b/website/generated-content/get-started/index.xml
index 618f639b594..c9afd0482c5 100644
--- a/website/generated-content/get-started/index.xml
+++ b/website/generated-content/get-started/index.xml
@@ -4415,11 +4415,11 @@ This serves both as comments and makes your pipeline 
easier to debug.&lt;/p>
 &lt;/div>
 &lt;h2 id="setup">Setup&lt;/h2>
 &lt;p>Here&amp;rsquo;s a comparison on how to get started both in PySpark and 
Beam.&lt;/p>
-&lt;div class="table-wrapper">&lt;table>
+&lt;div class="table-wrapper">&lt;table style="width:100%">
 &lt;tr>
-&lt;th>&lt;/th>
-&lt;th>PySpark&lt;/th>
-&lt;th>Beam&lt;/th>
+&lt;th style="width:20%">&lt;/th>
+&lt;th style="width:40%">PySpark&lt;/th>
+&lt;th style="width:40%">Beam&lt;/th>
 &lt;/tr>
 &lt;tr>
 &lt;td>&lt;b>Install&lt;/b>&lt;/td>
@@ -4472,86 +4472,82 @@ This serves both as comments and makes your pipeline 
easier to debug.&lt;/p>
 &lt;/table>&lt;/div>
 &lt;h2 id="transforms">Transforms&lt;/h2>
 &lt;p>Here are the equivalents of some common transforms in both PySpark and 
Beam.&lt;/p>
-&lt;div class="table-wrapper">&lt;table>
-&lt;thead>
+&lt;div class="table-wrapper">&lt;table style="width:100%">
 &lt;tr>
-&lt;th>&lt;/th>
-&lt;th>PySpark&lt;/th>
-&lt;th>Beam&lt;/th>
+&lt;th style="width:20%">&lt;/th>
+&lt;th style="width:40%">PySpark&lt;/th>
+&lt;th style="width:40%">Beam&lt;/th>
 &lt;/tr>
-&lt;/thead>
-&lt;tbody>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/elementwise/map/">&lt;strong>Map&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/elementwise/map/">Map&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.map(lambda x: x * 2)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.Map(lambda x: x * 2)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/elementwise/filter/">&lt;strong>Filter&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/elementwise/filter/">Filter&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.filter(lambda x: x % 2 == 0)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.Filter(lambda x: x % 2 == 0)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/elementwise/flatmap/">&lt;strong>FlatMap&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/elementwise/flatmap/">FlatMap&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.flatMap(lambda x: range(x))&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.FlatMap(lambda x: range(x))&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/groupbykey/">&lt;strong>Group
 by key&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/groupbykey/">Group by 
key&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.groupByKey()&lt;/code>&lt;/td>
 &lt;td>&lt;code>pairs | beam.GroupByKey()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/combineglobally/">&lt;strong>Reduce&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/combineglobally/">Reduce&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.reduce(lambda x, y: x+y)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.CombineGlobally(sum)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/combineperkey/">&lt;strong>Reduce
 by key&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/combineperkey/">Reduce by 
key&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.reduceByKey(lambda x, y: x+y)&lt;/code>&lt;/td>
 &lt;td>&lt;code>pairs | beam.CombinePerKey(sum)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/distinct/">&lt;strong>Distinct&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/distinct/">Distinct&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.distinct()&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.Distinct()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/count/">&lt;strong>Count&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/count/">Count&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.count()&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Count.Globally()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/count/">&lt;strong>Count by 
key&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/count/">Count by 
key&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.countByKey()&lt;/code>&lt;/td>
 &lt;td>&lt;code>pairs | beam.combiners.Count.PerKey()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/top/">&lt;strong>Take 
smallest&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/top/">Take 
smallest&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.takeOrdered(3)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Top.Smallest(3)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/top/">&lt;strong>Take 
largest&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/top/">Take 
largest&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.takeOrdered(3, lambda x: -x)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Top.Largest(3)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/sample/">&lt;strong>Random 
sample&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/sample/">Random 
sample&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.takeSample(False, 3)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | 
beam.combiners.Sample.FixedSizeGlobally(3)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/other/flatten/">&lt;strong>Union&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/other/flatten/">Union&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.union(otherValues)&lt;/code>&lt;/td>
 &lt;td>&lt;code>(values, otherValues) | beam.Flatten()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a 
href="/documentation/transforms/python/aggregation/cogroupbykey/">&lt;strong>Co-group&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a 
href="/documentation/transforms/python/aggregation/cogroupbykey/">Co-group&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.cogroup(otherPairs)&lt;/code>&lt;/td>
 &lt;td>&lt;code>{'Xs': pairs, 'Ys': otherPairs} | 
beam.CoGroupByKey()&lt;/code>&lt;/td>
 &lt;/tr>
-&lt;/tbody>
 &lt;/table>&lt;/div>
 &lt;blockquote>
 &lt;p>ℹ️ To learn more about the transforms available in Beam, check the
diff --git a/website/generated-content/sitemap.xml 
b/website/generated-content/sitemap.xml
index babe8bdfd75..93820975b1a 100644
--- a/website/generated-content/sitemap.xml
+++ b/website/generated-content/sitemap.xml
@@ -1 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset 
xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"; 
xmlns:xhtml="http://www.w3.org/1999/xhtml";><url><loc>/case-studies/intuit/</loc><lastmod>2022-08-18T01:27:08+06:00</lastmod></url><url><loc>/blog/go-2.40/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/categories/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/c
 [...]
\ No newline at end of file
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset 
xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"; 
xmlns:xhtml="http://www.w3.org/1999/xhtml";><url><loc>/case-studies/intuit/</loc><lastmod>2022-08-18T01:27:08+06:00</lastmod></url><url><loc>/blog/go-2.40/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/categories/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/c
 [...]
\ No newline at end of file

Reply via email to