Author: matei
Date: Tue Aug 26 01:53:10 2014
New Revision: 1620493
URL: http://svn.apache.org/r1620493
Log:
Updated screecast links to work over HTTPS too
Modified:
spark/screencasts/_posts/2013-04-10-1-first-steps-with-spark.md
spark/screencasts/_posts/2013-04-11-2-spark-documentation-overview.md
spark/screencasts/_posts/2013-04-16-3-transformations-and-caching.md
spark/screencasts/_posts/2013-08-26-4-a-standalone-job-in-spark.md
spark/site/downloads.html
spark/site/news/amp-camp-2013-registration-ope.html
spark/site/news/index.html
spark/site/news/run-spark-and-shark-on-amazon-emr.html
spark/site/news/spark-0-6-1-and-0-5-2-released.html
spark/site/news/spark-0-7-0-released.html
spark/site/news/spark-0-7-2-released.html
spark/site/news/spark-0-7-3-released.html
spark/site/news/spark-0-8-0-released.html
spark/site/news/spark-0-8-1-released.html
spark/site/news/spark-0-9-0-released.html
spark/site/news/spark-1-0-0-released.html
spark/site/news/spark-1-0-1-released.html
spark/site/news/spark-and-shark-in-the-news.html
spark/site/news/spark-becomes-tlp.html
spark/site/news/spark-meetups.html
spark/site/news/spark-user-survey-and-powered-by-page.html
spark/site/news/strata-exercises-now-available-online.html
spark/site/news/submit-talks-to-spark-summit-2014.html
spark/site/news/two-weeks-to-spark-summit-2014.html
spark/site/news/video-from-first-spark-development-meetup.html
spark/site/releases/spark-release-0-3.html
spark/site/releases/spark-release-0-5-0.html
spark/site/releases/spark-release-0-5-1.html
spark/site/releases/spark-release-0-6-0.html
spark/site/releases/spark-release-0-7-0.html
spark/site/releases/spark-release-0-8-0.html
spark/site/releases/spark-release-0-8-1.html
spark/site/releases/spark-release-0-9-0.html
spark/site/releases/spark-release-0-9-1.html
spark/site/releases/spark-release-0-9-2.html
spark/site/releases/spark-release-1-0-0.html
spark/site/releases/spark-release-1-0-1.html
spark/site/releases/spark-release-1-0-2.html
spark/site/screencasts/1-first-steps-with-spark.html
spark/site/screencasts/2-spark-documentation-overview.html
spark/site/screencasts/3-transformations-and-caching.html
spark/site/screencasts/4-a-standalone-job-in-spark.html
Modified: spark/screencasts/_posts/2013-04-10-1-first-steps-with-spark.md
URL:
http://svn.apache.org/viewvc/spark/screencasts/_posts/2013-04-10-1-first-steps-with-spark.md?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/screencasts/_posts/2013-04-10-1-first-steps-with-spark.md (original)
+++ spark/screencasts/_posts/2013-04-10-1-first-steps-with-spark.md Tue Aug 26
01:53:10 2014
@@ -18,7 +18,7 @@ This screencast marks the beginning of a
<li>Introduce the API using the Spark interactive shell to explore a
file.</li>
</ol>
-<div class="video-container video-square shadow"><iframe width="755"
height="705"
src="http://www.youtube.com/embed/bWorBGOFBWY?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
+<div class="video-container video-square shadow"><iframe width="755"
height="705"
src="//www.youtube.com/embed/bWorBGOFBWY?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
Check out the next spark screencast in the series, <a
href="{{site.url}}screencasts/2-spark-documentation-overview.html">Spark
Screencast #2 - Overview of Spark Documentation</a>.
Modified: spark/screencasts/_posts/2013-04-11-2-spark-documentation-overview.md
URL:
http://svn.apache.org/viewvc/spark/screencasts/_posts/2013-04-11-2-spark-documentation-overview.md?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/screencasts/_posts/2013-04-11-2-spark-documentation-overview.md
(original)
+++ spark/screencasts/_posts/2013-04-11-2-spark-documentation-overview.md Tue
Aug 26 01:53:10 2014
@@ -10,7 +10,7 @@ published: true
---
This is our 2nd Spark screencast. In it, we take a tour of the documentation
available for Spark users online.
-<div class="video-container video-square shadow"><iframe width="755"
height="705"
src="http://www.youtube.com/embed/Dbqe_rv-NJQ?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
+<div class="video-container video-square shadow"><iframe width="755"
height="705"
src="//www.youtube.com/embed/Dbqe_rv-NJQ?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
Check out the next spark screencast in the series, <a
href="{{site.url}}screencasts/3-transformations-and-caching.html">Spark
Screencast #3 - Transformations and Caching</a>.
Modified: spark/screencasts/_posts/2013-04-16-3-transformations-and-caching.md
URL:
http://svn.apache.org/viewvc/spark/screencasts/_posts/2013-04-16-3-transformations-and-caching.md?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/screencasts/_posts/2013-04-16-3-transformations-and-caching.md
(original)
+++ spark/screencasts/_posts/2013-04-16-3-transformations-and-caching.md Tue
Aug 26 01:53:10 2014
@@ -10,7 +10,7 @@ published: true
---
In this third Spark screencast, we demonstrate more advanced use of RDD
actions and transformations, as well as caching RDDs in memory.
-<div class="video-container video-square shadow"><iframe width="755"
height="705"
src="http://www.youtube.com/embed/TtvxKzO9jXE?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
+<div class="video-container video-square shadow"><iframe width="755"
height="705"
src="//www.youtube.com/embed/TtvxKzO9jXE?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
Check out the next spark screencast in the series, <a
href="{{site.url}}screencasts/4-a-standalone-job-in-spark.html">Spark
Screencast #4 - A Standalone Job in Scala</a>.
Modified: spark/screencasts/_posts/2013-08-26-4-a-standalone-job-in-spark.md
URL:
http://svn.apache.org/viewvc/spark/screencasts/_posts/2013-08-26-4-a-standalone-job-in-spark.md?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/screencasts/_posts/2013-08-26-4-a-standalone-job-in-spark.md
(original)
+++ spark/screencasts/_posts/2013-08-26-4-a-standalone-job-in-spark.md Tue Aug
26 01:53:10 2014
@@ -11,6 +11,6 @@ published: true
In this Spark screencast, we create a standalone Apache Spark job in Scala. In
the job, we create a spark context and read a file into an RDD of strings; then
apply transformations and actions to the RDD and print out the results.
-<div class="video-container video-16x9 shadow"><iframe width="755"
height="425"
src="http://www.youtube.com/embed/GaBn-YjlR8Q?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
+<div class="video-container video-16x9 shadow"><iframe width="755"
height="425"
src="//www.youtube.com/embed/GaBn-YjlR8Q?autohide=0&showinfo=0&list=PL-x35fyliRwhKT-NpTKprPW1bkbdDcTTW"
frameborder="0" allowfullscreen></iframe></div>
For more information and links to other Spark screencasts, check out the <a
href="{{site.url}}documentation.html">Spark documentation page</a>.
Modified: spark/site/downloads.html
URL:
http://svn.apache.org/viewvc/spark/site/downloads.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/downloads.html (original)
+++ spark/site/downloads.html Tue Aug 26 01:53:10 2014
@@ -197,7 +197,7 @@ version: 1.0.2
<h3 id="development-version">Development Version</h3>
<p>If you are interested in working with the newest under-development code or
contributing to Spark development, you can also check out the master branch
from Git: <tt>git clone git://github.com/apache/spark.git</tt>.</p>
-<p>Once you’ve downloaded Spark, you can find instructions for
installing and building it on the <a href="/documentation.html">documentation
page</a>.</p>
+<p>Once youâve downloaded Spark, you can find instructions for installing
and building it on the <a href="/documentation.html">documentation page</a>.</p>
<h3 id="all-releases">All Releases</h3>
<ul>
Modified: spark/site/news/amp-camp-2013-registration-ope.html
URL:
http://svn.apache.org/viewvc/spark/site/news/amp-camp-2013-registration-ope.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/amp-camp-2013-registration-ope.html (original)
+++ spark/site/news/amp-camp-2013-registration-ope.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Registration open for AMP Camp training camp in Berkeley</h2>
-<p>Want to learn how to use Spark, Shark, GraphX, and related technologies in
person? The AMP Lab is hosting a two-day training workshop for them on August
29th and 30th in Berkeley. The workshop will include tutorials, talks from
users, and over four hours of hands-on exercises. <a
href="http://ampcamp.berkeley.edu/amp-camp-three-berkeley-2013/">Registration
is now open on the AMP Camp website</a>, for a price of $250 per person. We
recommend signing up early because last year’s workshop was sold out.</p>
+<p>Want to learn how to use Spark, Shark, GraphX, and related technologies in
person? The AMP Lab is hosting a two-day training workshop for them on August
29th and 30th in Berkeley. The workshop will include tutorials, talks from
users, and over four hours of hands-on exercises. <a
href="http://ampcamp.berkeley.edu/amp-camp-three-berkeley-2013/">Registration
is now open on the AMP Camp website</a>, for a price of $250 per person. We
recommend signing up early because last yearâs workshop was sold out.</p>
<p>
Modified: spark/site/news/index.html
URL:
http://svn.apache.org/viewvc/spark/site/news/index.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/index.html (original)
+++ spark/site/news/index.html Tue Aug 26 01:53:10 2014
@@ -196,7 +196,7 @@ Contributions to this release came from
<h3 class="entry-title"><a href="/news/spark-1-0-1-released.html">Spark
1.0.1 released</a></h3>
<div class="entry-date">July 11, 2014</div>
</header>
- <div class="entry-content"><p>We are happy to announce the availability of
<a href="/releases/spark-release-1-0-1.html" title="Spark Release 1.0.1">Spark
1.0.1</a>! This release includes contributions from 70 developers. Spark 1.0.0
includes fixes across several areas of Spark, including the core API, PySpark,
and MLlib. It also includes new features in Spark’s (alpha) SQL library,
including support for JSON data and performance and stability fixes.</p>
+ <div class="entry-content"><p>We are happy to announce the availability of
<a href="/releases/spark-release-1-0-1.html" title="Spark Release 1.0.1">Spark
1.0.1</a>! This release includes contributions from 70 developers. Spark 1.0.0
includes fixes across several areas of Spark, including the core API, PySpark,
and MLlib. It also includes new features in Sparkâs (alpha) SQL library,
including support for JSON data and performance and stability fixes.</p>
</div>
</article>
@@ -219,8 +219,8 @@ organizations using Spark, focused on us
<h3 class="entry-title"><a href="/news/spark-1-0-0-released.html">Spark
1.0.0 released</a></h3>
<div class="entry-date">May 30, 2014</div>
</header>
- <div class="entry-content"><p>We are happy to announce the availability of
<a href="/releases/spark-release-1-0-0.html" title="Spark Release 1.0.0">Spark
1.0.0</a>! Spark 1.0.0 is the first in the 1.X line of releases, providing API
stability for Spark’s core interfaces. It is Spark’s largest
release ever, with contributions from 117 developers.
-This release expands Spark’s standard libraries, introducing a new SQL
package (Spark SQL) that lets users integrate SQL queries into existing Spark
workflows. MLlib, Spark’s machine learning library, is expanded with
sparse vector support and several new algorithms. The GraphX and Streaming
libraries also introduce new features and optimizations. Spark’s core
engine adds support for secured YARN clusters, a unified tool for submitting
Spark applications, and several performance and stability improvements.</p>
+ <div class="entry-content"><p>We are happy to announce the availability of
<a href="/releases/spark-release-1-0-0.html" title="Spark Release 1.0.0">Spark
1.0.0</a>! Spark 1.0.0 is the first in the 1.X line of releases, providing API
stability for Sparkâs core interfaces. It is Sparkâs largest release ever,
with contributions from 117 developers.
+This release expands Sparkâs standard libraries, introducing a new SQL
package (Spark SQL) that lets users integrate SQL queries into existing Spark
workflows. MLlib, Sparkâs machine learning library, is expanded with sparse
vector support and several new algorithms. The GraphX and Streaming libraries
also introduce new features and optimizations. Sparkâs core engine adds
support for secured YARN clusters, a unified tool for submitting Spark
applications, and several performance and stability improvements.</p>
</div>
</article>
@@ -257,7 +257,7 @@ Contributions to this release came from
<h3 class="entry-title"><a
href="/news/submit-talks-to-spark-summit-2014.html">Submissions and
registration open for Spark Summit 2014</a></h3>
<div class="entry-date">March 20, 2014</div>
</header>
- <div class="entry-content"><p>After last year’s successful <a
href="http://spark-summit.org/2013">first Spark Summit</a>, registrations
+ <div class="entry-content"><p>After last yearâs successful <a
href="http://spark-summit.org/2013">first Spark Summit</a>, registrations
and talk submissions are now open for <a
href="http://spark-summit.org/2014">Spark Summit 2014</a>.
This will be a 3-day event in San Francisco organized by multiple companies in
the Spark community.
The event will run <strong>June 30th to July 2nd</strong> in San Francisco,
CA.</p>
@@ -270,7 +270,7 @@ The event will run <strong>June 30th to
<h3 class="entry-title"><a href="/news/spark-becomes-tlp.html">Spark
becomes top-level Apache project</a></h3>
<div class="entry-date">February 27, 2014</div>
</header>
- <div class="entry-content"><p>The Apache Software Foundation <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">announced</a>
today that Spark has graduated from the Apache Incubator to become a top-level
Apache project, signifying that the project’s community and products have
been well-governed under the ASF’s meritocratic process and principles.
This is a major step for the community and we are very proud to share this news
with users as we complete Spark’s move to Apache. Read more about
Spark’s growth during the past year and from contributors and users in
the ASF’s <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">press
release</a>.</p>
+ <div class="entry-content"><p>The Apache Software Foundation <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">announced</a>
today that Spark has graduated from the Apache Incubator to become a top-level
Apache project, signifying that the projectâs community and products have
been well-governed under the ASFâs meritocratic process and principles. This
is a major step for the community and we are very proud to share this news with
users as we complete Sparkâs move to Apache. Read more about Sparkâs growth
during the past year and from contributors and users in the ASFâs <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">press
release</a>.</p>
</div>
</article>
@@ -281,8 +281,8 @@ The event will run <strong>June 30th to
<div class="entry-date">February 2, 2014</div>
</header>
<div class="entry-content"><p>We are happy to announce the availability of
<a href="/releases/spark-release-0-9-0.html" title="Spark Release 0.9.0">
-Spark 0.9.0</a>! Spark 0.9.0 is a major release and Spark’s largest
release ever, with contributions from 83 developers.
-This release expands Spark’s standard libraries, introducing a new graph
computation package (GraphX) and adding several new features to the machine
learning and stream-processing packages. It also makes major improvements to
the core engine,
+Spark 0.9.0</a>! Spark 0.9.0 is a major release and Sparkâs largest release
ever, with contributions from 83 developers.
+This release expands Sparkâs standard libraries, introducing a new graph
computation package (GraphX) and adding several new features to the machine
learning and stream-processing packages. It also makes major improvements to
the core engine,
including external aggregations, a simplified H/A mode for long lived
applications, and
hardened YARN support.</p>
@@ -294,7 +294,7 @@ hardened YARN support.</p>
<h3 class="entry-title"><a href="/news/spark-0-8-1-released.html">Spark
0.8.1 released</a></h3>
<div class="entry-date">December 19, 2013</div>
</header>
- <div class="entry-content"><p>We’ve just posted <a
href="/releases/spark-release-0-8-1.html" title="Spark Release 0.8.1">Spark
Release 0.8.1</a>, a maintenance and performance release for the Scala 2.9
version of Spark. 0.8.1 includes support for YARN 2.2, a high availability mode
for the standalone scheduler, optimizations to the shuffle, and many other
improvements. We recommend that all users update to this release. Visit the <a
href="/releases/spark-release-0-8-1.html" title="Spark Release 0.8.1">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
+ <div class="entry-content"><p>Weâve just posted <a
href="/releases/spark-release-0-8-1.html" title="Spark Release 0.8.1">Spark
Release 0.8.1</a>, a maintenance and performance release for the Scala 2.9
version of Spark. 0.8.1 includes support for YARN 2.2, a high availability mode
for the standalone scheduler, optimizations to the shuffle, and many other
improvements. We recommend that all users update to this release. Visit the <a
href="/releases/spark-release-0-8-1.html" title="Spark Release 0.8.1">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
</div>
</article>
@@ -325,7 +325,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a href="/news/spark-0-8-0-released.html">Spark
0.8.0 released</a></h3>
<div class="entry-date">September 25, 2013</div>
</header>
- <div class="entry-content"><p>We’re proud to announce the release of
<a href="/releases/spark-release-0-8-0.html" title="Spark Release 0.8.0">Apache
Spark 0.8.0</a>. Spark 0.8.0 is a major release that includes many new
capabilities and usability improvements. Itâs also our first release under
the Apache incubator. It is the largest Spark release yet, with contributions
from 67 developers and 24 companies. Major new features include an expanded
monitoring framework and UI, a machine learning library, and support for
running Spark inside of YARN.</p>
+ <div class="entry-content"><p>Weâre proud to announce the release of <a
href="/releases/spark-release-0-8-0.html" title="Spark Release 0.8.0">Apache
Spark 0.8.0</a>. Spark 0.8.0 is a major release that includes many new
capabilities and usability improvements. Itâs also our first release under
the Apache incubator. It is the largest Spark release yet, with contributions
from 67 developers and 24 companies. Major new features include an expanded
monitoring framework and UI, a machine learning library, and support for
running Spark inside of YARN.</p>
</div>
</article>
@@ -335,7 +335,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/spark-user-survey-and-powered-by-page.html">Spark user survey and
"Powered By" page</a></h3>
<div class="entry-date">September 5, 2013</div>
</header>
- <div class="entry-content"><p>As we continue developing Spark, we would
love to get feedback from users and hear what you’d like us to work on
next. We’ve decided that a good way to do that is a survey – we
hope to run this at regular intervals. If you have a few minutes to
participate, <a
href="https://docs.google.com/forms/d/1eMXp4GjcIXglxJe5vYYBzXKVm-6AiYt1KThJwhCjJiY/viewform">fill
in the survey here</a>. Your time is greatly appreciated.</p>
+ <div class="entry-content"><p>As we continue developing Spark, we would
love to get feedback from users and hear what youâd like us to work on next.
Weâve decided that a good way to do that is a survey â we hope to run this
at regular intervals. If you have a few minutes to participate, <a
href="https://docs.google.com/forms/d/1eMXp4GjcIXglxJe5vYYBzXKVm-6AiYt1KThJwhCjJiY/viewform">fill
in the survey here</a>. Your time is greatly appreciated.</p>
</div>
</article>
@@ -355,7 +355,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/amp-camp-2013-registration-ope.html">Registration open for AMP Camp
training camp in Berkeley</a></h3>
<div class="entry-date">July 23, 2013</div>
</header>
- <div class="entry-content"><p>Want to learn how to use Spark, Shark,
GraphX, and related technologies in person? The AMP Lab is hosting a two-day
training workshop for them on August 29th and 30th in Berkeley. The workshop
will include tutorials, talks from users, and over four hours of hands-on
exercises. <a
href="http://ampcamp.berkeley.edu/amp-camp-three-berkeley-2013/">Registration
is now open on the AMP Camp website</a>, for a price of $250 per person. We
recommend signing up early because last year’s workshop was sold out.</p>
+ <div class="entry-content"><p>Want to learn how to use Spark, Shark,
GraphX, and related technologies in person? The AMP Lab is hosting a two-day
training workshop for them on August 29th and 30th in Berkeley. The workshop
will include tutorials, talks from users, and over four hours of hands-on
exercises. <a
href="http://ampcamp.berkeley.edu/amp-camp-three-berkeley-2013/">Registration
is now open on the AMP Camp website</a>, for a price of $250 per person. We
recommend signing up early because last yearâs workshop was sold out.</p>
</div>
</article>
@@ -386,7 +386,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a href="/news/spark-0-7-3-released.html">Spark
0.7.3 released</a></h3>
<div class="entry-date">July 16, 2013</div>
</header>
- <div class="entry-content"><p>We’ve just posted <a
href="/releases/spark-release-0-7-3.html" title="Spark Release 0.7.3">Spark
Release 0.7.3</a>, a maintenance release that contains several fixes, including
streaming API updates and new functionality for adding JARs to a
<code>spark-shell</code> session. We recommend that all users update to this
release. Visit the <a href="/releases/spark-release-0-7-3.html" title="Spark
Release 0.7.3">release notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
+ <div class="entry-content"><p>Weâve just posted <a
href="/releases/spark-release-0-7-3.html" title="Spark Release 0.7.3">Spark
Release 0.7.3</a>, a maintenance release that contains several fixes, including
streaming API updates and new functionality for adding JARs to a
<code>spark-shell</code> session. We recommend that all users update to this
release. Visit the <a href="/releases/spark-release-0-7-3.html" title="Spark
Release 0.7.3">release notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
</div>
</article>
@@ -416,7 +416,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a href="/news/spark-0-7-2-released.html">Spark
0.7.2 released</a></h3>
<div class="entry-date">June 2, 2013</div>
</header>
- <div class="entry-content"><p>We’re happy to announce the release of
<a href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">Spark
0.7.2</a>, a new maintenance release that includes several bug fixes and
improvements, as well as new code examples and API features. We recommend that
all users update to this release. Head over to the <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
+ <div class="entry-content"><p>Weâre happy to announce the release of <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">Spark
0.7.2</a>, a new maintenance release that includes several bug fixes and
improvements, as well as new code examples and API features. We recommend that
all users update to this release. Head over to the <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
</div>
</article>
@@ -442,7 +442,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/strata-exercises-now-available-online.html">Strata exercises now
available online</a></h3>
<div class="entry-date">March 17, 2013</div>
</header>
- <div class="entry-content"><p>At this year’s <a
href="http://strataconf.com/strata2013">Strata</a> conference, the AMP Lab
hosted a full day of tutorials on Spark, Shark, and Spark Streaming, including
online exercises on Amazon EC2. Those exercises are now <a
href="http://ampcamp.berkeley.edu/big-data-mini-course/">available online</a>,
letting you learn Spark and Shark at your own pace on an EC2 cluster with real
data. They are a great resource for learning the systems. You can also find <a
href="http://ampcamp.berkeley.edu/amp-camp-two-strata-2013/">slides</a> from
the Strata tutorials online, as well as <a
href="http://ampcamp.berkeley.edu/amp-camp-one-berkeley-2012/">videos</a> from
the AMP Camp workshop we held at Berkeley in August.</p>
+ <div class="entry-content"><p>At this yearâs <a
href="http://strataconf.com/strata2013">Strata</a> conference, the AMP Lab
hosted a full day of tutorials on Spark, Shark, and Spark Streaming, including
online exercises on Amazon EC2. Those exercises are now <a
href="http://ampcamp.berkeley.edu/big-data-mini-course/">available online</a>,
letting you learn Spark and Shark at your own pace on an EC2 cluster with real
data. They are a great resource for learning the systems. You can also find <a
href="http://ampcamp.berkeley.edu/amp-camp-two-strata-2013/">slides</a> from
the Strata tutorials online, as well as <a
href="http://ampcamp.berkeley.edu/amp-camp-one-berkeley-2012/">videos</a> from
the AMP Camp workshop we held at Berkeley in August.</p>
</div>
</article>
@@ -452,7 +452,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a href="/news/spark-0-7-0-released.html">Spark
0.7.0 released</a></h3>
<div class="entry-date">February 27, 2013</div>
</header>
- <div class="entry-content"><p>We’re proud to announce the release of
<a href="/releases/spark-release-0-7-0.html" title="Spark Release 0.7.0">Spark
0.7.0</a>, a new major version of Spark that adds several key features,
including a <a href="/docs/latest/python-programming-guide.html">Python API</a>
for Spark and an <a href="/docs/latest/streaming-programming-guide.html">alpha
of Spark Streaming</a>. This release is the result of the largest group of
contributors yet behind a Spark release – 31 contributors from inside and
outside Berkeley. Head over to the <a href="/releases/spark-release-0-7-0.html"
title="Spark Release 0.7.0">release notes</a> to read more about the new
features, or <a href="/downloads.html">download</a> the release today.</p>
+ <div class="entry-content"><p>Weâre proud to announce the release of <a
href="/releases/spark-release-0-7-0.html" title="Spark Release 0.7.0">Spark
0.7.0</a>, a new major version of Spark that adds several key features,
including a <a href="/docs/latest/python-programming-guide.html">Python API</a>
for Spark and an <a href="/docs/latest/streaming-programming-guide.html">alpha
of Spark Streaming</a>. This release is the result of the largest group of
contributors yet behind a Spark release â 31 contributors from inside and
outside Berkeley. Head over to the <a href="/releases/spark-release-0-7-0.html"
title="Spark Release 0.7.0">release notes</a> to read more about the new
features, or <a href="/downloads.html">download</a> the release today.</p>
</div>
</article>
@@ -462,7 +462,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/run-spark-and-shark-on-amazon-emr.html">Spark/Shark Tutorial for
Amazon EMR</a></h3>
<div class="entry-date">February 24, 2013</div>
</header>
- <div class="entry-content"><p>This weekend, Amazon posted an <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">article</a>
and code that make it easy to launch Spark and Shark on Elastic MapReduce. The
article includes examples of how to run both interactive Scala commands and SQL
queries from Shark on data in S3. Head over to the <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">Amazon
article</a> for details. We’re very excited because, to our knowledge,
this makes Spark the first non-Hadoop engine that you can launch with EMR.</p>
+ <div class="entry-content"><p>This weekend, Amazon posted an <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">article</a>
and code that make it easy to launch Spark and Shark on Elastic MapReduce. The
article includes examples of how to run both interactive Scala commands and SQL
queries from Shark on data in S3. Head over to the <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">Amazon
article</a> for details. Weâre very excited because, to our knowledge, this
makes Spark the first non-Hadoop engine that you can launch with EMR.</p>
</div>
</article>
@@ -497,7 +497,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/video-from-first-spark-development-meetup.html">Video up from first
Spark development meetup</a></h3>
<div class="entry-date">December 21, 2012</div>
</header>
- <div class="entry-content"><p>On December 18th, we held the first of a
series of Spark development meetups, for people interested in learning the
Spark codebase and contributing to the project. There was quite a bit more
demand than we anticipated, with over 80 people signing up and 64 attending.
The first meetup was an <a
href="http://www.meetup.com/spark-users/events/94101942/">introduction to Spark
internals</a>. Thanks to one of the attendees, there’s now a <a
href="http://www.youtube.com/watch?v=49Hr5xZyTEA">video of the meetup</a> on
YouTube. We’ve also posted the <a
href="http://files.meetup.com/3138542/dev-meetup-dec-2012.pptx">slides</a>.
Look to see more development meetups on Spark and Shark in the future.</p>
+ <div class="entry-content"><p>On December 18th, we held the first of a
series of Spark development meetups, for people interested in learning the
Spark codebase and contributing to the project. There was quite a bit more
demand than we anticipated, with over 80 people signing up and 64 attending.
The first meetup was an <a
href="http://www.meetup.com/spark-users/events/94101942/">introduction to Spark
internals</a>. Thanks to one of the attendees, thereâs now a <a
href="http://www.youtube.com/watch?v=49Hr5xZyTEA">video of the meetup</a> on
YouTube. Weâve also posted the <a
href="http://files.meetup.com/3138542/dev-meetup-dec-2012.pptx">slides</a>.
Look to see more development meetups on Spark and Shark in the future.</p>
</div>
</article>
@@ -507,7 +507,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/spark-and-shark-in-the-news.html">Spark in the news</a></h3>
<div class="entry-date">December 21, 2012</div>
</header>
- <div class="entry-content"><p>Recently, we’ve seen quite a bit of
coverage of Spark in the news. I wanted to list some of the more recent
articles, for readers interested in learning more.</p>
+ <div class="entry-content"><p>Recently, weâve seen quite a bit of
coverage of Spark in the news. I wanted to list some of the more recent
articles, for readers interested in learning more.</p>
<ul>
<li>Curt Monash, editor of the popular DBMS2 blog, wrote a great <a
href="http://www.dbms2.com/2012/12/13/introduction-to-spark-shark-bdas-and-amplab/">introduction
to Spark and Shark</a>, as well as a more detailed <a
href="http://www.dbms2.com/2012/12/13/spark-shark-and-rdds-technology-notes/">technical
overview</a>.</li>
@@ -517,7 +517,7 @@ Over 450 Spark developers and enthusiast
<li><a
href="http://data-informed.com/spark-an-open-source-engine-for-iterative-data-mining/">DataInformed</a>
interviewed two Spark users and wrote about their applications in anomaly
detection, predictive analytics and data mining.</li>
</ul>
-<p>In other news, there will be a full day of tutorials on Spark and Shark at
the <a href="http://strataconf.com/strata2013">O’Reilly Strata
conference</a> in February. They include a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27438">introduction
to Spark, Shark and BDAS</a> Tuesday morning, and a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27440">hands-on
exercise session</a>. </p>
+<p>In other news, there will be a full day of tutorials on Spark and Shark at
the <a href="http://strataconf.com/strata2013">OâReilly Strata conference</a>
in February. They include a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27438">introduction
to Spark, Shark and BDAS</a> Tuesday morning, and a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27440">hands-on
exercise session</a>. </p>
</div>
</article>
@@ -527,7 +527,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a
href="/news/spark-0-6-1-and-0-5-2-released.html">Spark 0.6.1 and 0.5.2
out</a></h3>
<div class="entry-date">November 22, 2012</div>
</header>
- <div class="entry-content"><p>Today we’ve made available two
maintenance releases for Spark: <a href="/releases/spark-release-0-6-1.html"
title="Spark Release 0.6.1">0.6.1</a> and <a
href="/releases/spark-release-0-5-2.html" title="Spark Release
0.5.2">0.5.2</a>. They both contain important bug fixes as well as some new
features, such as the ability to build against Hadoop 2 distributions. We
recommend that users update to the latest version for their branch; for new
users, we recommend <a href="/releases/spark-release-0-6-1.html" title="Spark
Release 0.6.1">0.6.1</a>.</p>
+ <div class="entry-content"><p>Today weâve made available two maintenance
releases for Spark: <a href="/releases/spark-release-0-6-1.html" title="Spark
Release 0.6.1">0.6.1</a> and <a href="/releases/spark-release-0-5-2.html"
title="Spark Release 0.5.2">0.5.2</a>. They both contain important bug fixes as
well as some new features, such as the ability to build against Hadoop 2
distributions. We recommend that users update to the latest version for their
branch; for new users, we recommend <a
href="/releases/spark-release-0-6-1.html" title="Spark Release
0.6.1">0.6.1</a>.</p>
</div>
</article>
@@ -557,7 +557,7 @@ Over 450 Spark developers and enthusiast
<h3 class="entry-title"><a href="/news/spark-meetups.html">We've started
hosting a Bay Area Spark User Meetup</a></h3>
<div class="entry-date">January 10, 2012</div>
</header>
- <div class="entry-content"><p>We’ve started hosting a regular <a
href="http://www.meetup.com/spark-users/">Bay Area Spark User Meetup</a>. Sign
up on the meetup.com page to be notified about events and meet other Spark
developers and users.</p>
+ <div class="entry-content"><p>Weâve started hosting a regular <a
href="http://www.meetup.com/spark-users/">Bay Area Spark User Meetup</a>. Sign
up on the meetup.com page to be notified about events and meet other Spark
developers and users.</p>
</div>
</article>
Modified: spark/site/news/run-spark-and-shark-on-amazon-emr.html
URL:
http://svn.apache.org/viewvc/spark/site/news/run-spark-and-shark-on-amazon-emr.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/run-spark-and-shark-on-amazon-emr.html (original)
+++ spark/site/news/run-spark-and-shark-on-amazon-emr.html Tue Aug 26 01:53:10
2014
@@ -160,7 +160,7 @@
<h2>Spark/Shark Tutorial for Amazon EMR</h2>
-<p>This weekend, Amazon posted an <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">article</a>
and code that make it easy to launch Spark and Shark on Elastic MapReduce. The
article includes examples of how to run both interactive Scala commands and SQL
queries from Shark on data in S3. Head over to the <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">Amazon
article</a> for details. We’re very excited because, to our knowledge,
this makes Spark the first non-Hadoop engine that you can launch with EMR.</p>
+<p>This weekend, Amazon posted an <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">article</a>
and code that make it easy to launch Spark and Shark on Elastic MapReduce. The
article includes examples of how to run both interactive Scala commands and SQL
queries from Shark on data in S3. Head over to the <a
href="http://aws.amazon.com/articles/Elastic-MapReduce/4926593393724923">Amazon
article</a> for details. Weâre very excited because, to our knowledge, this
makes Spark the first non-Hadoop engine that you can launch with EMR.</p>
<p>
Modified: spark/site/news/spark-0-6-1-and-0-5-2-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-6-1-and-0-5-2-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-6-1-and-0-5-2-released.html (original)
+++ spark/site/news/spark-0-6-1-and-0-5-2-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 0.6.1 and 0.5.2 out</h2>
-<p>Today we’ve made available two maintenance releases for Spark: <a
href="/releases/spark-release-0-6-1.html" title="Spark Release 0.6.1">0.6.1</a>
and <a href="/releases/spark-release-0-5-2.html" title="Spark Release
0.5.2">0.5.2</a>. They both contain important bug fixes as well as some new
features, such as the ability to build against Hadoop 2 distributions. We
recommend that users update to the latest version for their branch; for new
users, we recommend <a href="/releases/spark-release-0-6-1.html" title="Spark
Release 0.6.1">0.6.1</a>.</p>
+<p>Today weâve made available two maintenance releases for Spark: <a
href="/releases/spark-release-0-6-1.html" title="Spark Release 0.6.1">0.6.1</a>
and <a href="/releases/spark-release-0-5-2.html" title="Spark Release
0.5.2">0.5.2</a>. They both contain important bug fixes as well as some new
features, such as the ability to build against Hadoop 2 distributions. We
recommend that users update to the latest version for their branch; for new
users, we recommend <a href="/releases/spark-release-0-6-1.html" title="Spark
Release 0.6.1">0.6.1</a>.</p>
<p>
Modified: spark/site/news/spark-0-7-0-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-7-0-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-7-0-released.html (original)
+++ spark/site/news/spark-0-7-0-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 0.7.0 released</h2>
-<p>We’re proud to announce the release of <a
href="/releases/spark-release-0-7-0.html" title="Spark Release 0.7.0">Spark
0.7.0</a>, a new major version of Spark that adds several key features,
including a <a href="/docs/latest/python-programming-guide.html">Python API</a>
for Spark and an <a href="/docs/latest/streaming-programming-guide.html">alpha
of Spark Streaming</a>. This release is the result of the largest group of
contributors yet behind a Spark release – 31 contributors from inside and
outside Berkeley. Head over to the <a href="/releases/spark-release-0-7-0.html"
title="Spark Release 0.7.0">release notes</a> to read more about the new
features, or <a href="/downloads.html">download</a> the release today.</p>
+<p>Weâre proud to announce the release of <a
href="/releases/spark-release-0-7-0.html" title="Spark Release 0.7.0">Spark
0.7.0</a>, a new major version of Spark that adds several key features,
including a <a href="/docs/latest/python-programming-guide.html">Python API</a>
for Spark and an <a href="/docs/latest/streaming-programming-guide.html">alpha
of Spark Streaming</a>. This release is the result of the largest group of
contributors yet behind a Spark release â 31 contributors from inside and
outside Berkeley. Head over to the <a href="/releases/spark-release-0-7-0.html"
title="Spark Release 0.7.0">release notes</a> to read more about the new
features, or <a href="/downloads.html">download</a> the release today.</p>
<p>
Modified: spark/site/news/spark-0-7-2-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-7-2-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-7-2-released.html (original)
+++ spark/site/news/spark-0-7-2-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 0.7.2 released</h2>
-<p>We’re happy to announce the release of <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">Spark
0.7.2</a>, a new maintenance release that includes several bug fixes and
improvements, as well as new code examples and API features. We recommend that
all users update to this release. Head over to the <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
+<p>Weâre happy to announce the release of <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">Spark
0.7.2</a>, a new maintenance release that includes several bug fixes and
improvements, as well as new code examples and API features. We recommend that
all users update to this release. Head over to the <a
href="/releases/spark-release-0-7-2.html" title="Spark Release 0.7.2">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
<p>
Modified: spark/site/news/spark-0-7-3-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-7-3-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-7-3-released.html (original)
+++ spark/site/news/spark-0-7-3-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 0.7.3 released</h2>
-<p>We’ve just posted <a href="/releases/spark-release-0-7-3.html"
title="Spark Release 0.7.3">Spark Release 0.7.3</a>, a maintenance release that
contains several fixes, including streaming API updates and new functionality
for adding JARs to a <code>spark-shell</code> session. We recommend that all
users update to this release. Visit the <a
href="/releases/spark-release-0-7-3.html" title="Spark Release 0.7.3">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
+<p>Weâve just posted <a href="/releases/spark-release-0-7-3.html"
title="Spark Release 0.7.3">Spark Release 0.7.3</a>, a maintenance release that
contains several fixes, including streaming API updates and new functionality
for adding JARs to a <code>spark-shell</code> session. We recommend that all
users update to this release. Visit the <a
href="/releases/spark-release-0-7-3.html" title="Spark Release 0.7.3">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
<p>
Modified: spark/site/news/spark-0-8-0-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-8-0-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-8-0-released.html (original)
+++ spark/site/news/spark-0-8-0-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 0.8.0 released</h2>
-<p>We’re proud to announce the release of <a
href="/releases/spark-release-0-8-0.html" title="Spark Release 0.8.0">Apache
Spark 0.8.0</a>. Spark 0.8.0 is a major release that includes many new
capabilities and usability improvements. Itâs also our first release under
the Apache incubator. It is the largest Spark release yet, with contributions
from 67 developers and 24 companies. Major new features include an expanded
monitoring framework and UI, a machine learning library, and support for
running Spark inside of YARN.</p>
+<p>Weâre proud to announce the release of <a
href="/releases/spark-release-0-8-0.html" title="Spark Release 0.8.0">Apache
Spark 0.8.0</a>. Spark 0.8.0 is a major release that includes many new
capabilities and usability improvements. Itâs also our first release under
the Apache incubator. It is the largest Spark release yet, with contributions
from 67 developers and 24 companies. Major new features include an expanded
monitoring framework and UI, a machine learning library, and support for
running Spark inside of YARN.</p>
<p>
Modified: spark/site/news/spark-0-8-1-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-8-1-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-8-1-released.html (original)
+++ spark/site/news/spark-0-8-1-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 0.8.1 released</h2>
-<p>We’ve just posted <a href="/releases/spark-release-0-8-1.html"
title="Spark Release 0.8.1">Spark Release 0.8.1</a>, a maintenance and
performance release for the Scala 2.9 version of Spark. 0.8.1 includes support
for YARN 2.2, a high availability mode for the standalone scheduler,
optimizations to the shuffle, and many other improvements. We recommend that
all users update to this release. Visit the <a
href="/releases/spark-release-0-8-1.html" title="Spark Release 0.8.1">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
+<p>Weâve just posted <a href="/releases/spark-release-0-8-1.html"
title="Spark Release 0.8.1">Spark Release 0.8.1</a>, a maintenance and
performance release for the Scala 2.9 version of Spark. 0.8.1 includes support
for YARN 2.2, a high availability mode for the standalone scheduler,
optimizations to the shuffle, and many other improvements. We recommend that
all users update to this release. Visit the <a
href="/releases/spark-release-0-8-1.html" title="Spark Release 0.8.1">release
notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
<p>
Modified: spark/site/news/spark-0-9-0-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-0-9-0-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-0-9-0-released.html (original)
+++ spark/site/news/spark-0-9-0-released.html Tue Aug 26 01:53:10 2014
@@ -161,8 +161,8 @@
<p>We are happy to announce the availability of <a
href="/releases/spark-release-0-9-0.html" title="Spark Release 0.9.0">
-Spark 0.9.0</a>! Spark 0.9.0 is a major release and Spark’s largest
release ever, with contributions from 83 developers.
-This release expands Spark’s standard libraries, introducing a new graph
computation package (GraphX) and adding several new features to the machine
learning and stream-processing packages. It also makes major improvements to
the core engine,
+Spark 0.9.0</a>! Spark 0.9.0 is a major release and Sparkâs largest release
ever, with contributions from 83 developers.
+This release expands Sparkâs standard libraries, introducing a new graph
computation package (GraphX) and adding several new features to the machine
learning and stream-processing packages. It also makes major improvements to
the core engine,
including external aggregations, a simplified H/A mode for long lived
applications, and
hardened YARN support.</p>
Modified: spark/site/news/spark-1-0-0-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-1-0-0-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-1-0-0-released.html (original)
+++ spark/site/news/spark-1-0-0-released.html Tue Aug 26 01:53:10 2014
@@ -160,8 +160,8 @@
<h2>Spark 1.0.0 released</h2>
-<p>We are happy to announce the availability of <a
href="/releases/spark-release-1-0-0.html" title="Spark Release 1.0.0">Spark
1.0.0</a>! Spark 1.0.0 is the first in the 1.X line of releases, providing API
stability for Spark’s core interfaces. It is Spark’s largest
release ever, with contributions from 117 developers.
-This release expands Spark’s standard libraries, introducing a new SQL
package (Spark SQL) that lets users integrate SQL queries into existing Spark
workflows. MLlib, Spark’s machine learning library, is expanded with
sparse vector support and several new algorithms. The GraphX and Streaming
libraries also introduce new features and optimizations. Spark’s core
engine adds support for secured YARN clusters, a unified tool for submitting
Spark applications, and several performance and stability improvements.</p>
+<p>We are happy to announce the availability of <a
href="/releases/spark-release-1-0-0.html" title="Spark Release 1.0.0">Spark
1.0.0</a>! Spark 1.0.0 is the first in the 1.X line of releases, providing API
stability for Sparkâs core interfaces. It is Sparkâs largest release ever,
with contributions from 117 developers.
+This release expands Sparkâs standard libraries, introducing a new SQL
package (Spark SQL) that lets users integrate SQL queries into existing Spark
workflows. MLlib, Sparkâs machine learning library, is expanded with sparse
vector support and several new algorithms. The GraphX and Streaming libraries
also introduce new features and optimizations. Sparkâs core engine adds
support for secured YARN clusters, a unified tool for submitting Spark
applications, and several performance and stability improvements.</p>
<p>Visit the <a href="/releases/spark-release-1-0-0.html" title="Spark Release
1.0.0">release notes</a> to read about the new features, or <a
href="/downloads.html">download</a> the release today.</p>
Modified: spark/site/news/spark-1-0-1-released.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-1-0-1-released.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-1-0-1-released.html (original)
+++ spark/site/news/spark-1-0-1-released.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark 1.0.1 released</h2>
-<p>We are happy to announce the availability of <a
href="/releases/spark-release-1-0-1.html" title="Spark Release 1.0.1">Spark
1.0.1</a>! This release includes contributions from 70 developers. Spark 1.0.0
includes fixes across several areas of Spark, including the core API, PySpark,
and MLlib. It also includes new features in Spark’s (alpha) SQL library,
including support for JSON data and performance and stability fixes.</p>
+<p>We are happy to announce the availability of <a
href="/releases/spark-release-1-0-1.html" title="Spark Release 1.0.1">Spark
1.0.1</a>! This release includes contributions from 70 developers. Spark 1.0.0
includes fixes across several areas of Spark, including the core API, PySpark,
and MLlib. It also includes new features in Sparkâs (alpha) SQL library,
including support for JSON data and performance and stability fixes.</p>
<p>Visit the <a href="/releases/spark-release-1-0-1.html" title="Spark Release
1.0.1">release notes</a> to read about this release or <a
href="/downloads.html">download</a> the release today.</p>
Modified: spark/site/news/spark-and-shark-in-the-news.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-and-shark-in-the-news.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-and-shark-in-the-news.html (original)
+++ spark/site/news/spark-and-shark-in-the-news.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Spark in the news</h2>
-<p>Recently, we’ve seen quite a bit of coverage of Spark in the news. I
wanted to list some of the more recent articles, for readers interested in
learning more.</p>
+<p>Recently, weâve seen quite a bit of coverage of Spark in the news. I
wanted to list some of the more recent articles, for readers interested in
learning more.</p>
<ul>
<li>Curt Monash, editor of the popular DBMS2 blog, wrote a great <a
href="http://www.dbms2.com/2012/12/13/introduction-to-spark-shark-bdas-and-amplab/">introduction
to Spark and Shark</a>, as well as a more detailed <a
href="http://www.dbms2.com/2012/12/13/spark-shark-and-rdds-technology-notes/">technical
overview</a>.</li>
@@ -170,7 +170,7 @@
<li><a
href="http://data-informed.com/spark-an-open-source-engine-for-iterative-data-mining/">DataInformed</a>
interviewed two Spark users and wrote about their applications in anomaly
detection, predictive analytics and data mining.</li>
</ul>
-<p>In other news, there will be a full day of tutorials on Spark and Shark at
the <a href="http://strataconf.com/strata2013">O’Reilly Strata
conference</a> in February. They include a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27438">introduction
to Spark, Shark and BDAS</a> Tuesday morning, and a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27440">hands-on
exercise session</a>. </p>
+<p>In other news, there will be a full day of tutorials on Spark and Shark at
the <a href="http://strataconf.com/strata2013">OâReilly Strata conference</a>
in February. They include a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27438">introduction
to Spark, Shark and BDAS</a> Tuesday morning, and a three-hour <a
href="http://strataconf.com/strata2013/public/schedule/detail/27440">hands-on
exercise session</a>. </p>
<p>
Modified: spark/site/news/spark-becomes-tlp.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-becomes-tlp.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-becomes-tlp.html (original)
+++ spark/site/news/spark-becomes-tlp.html Tue Aug 26 01:53:10 2014
@@ -160,9 +160,9 @@
<h2>Spark becomes top-level Apache project</h2>
-<p>The Apache Software Foundation <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">announced</a>
today that Spark has graduated from the Apache Incubator to become a top-level
Apache project, signifying that the project’s community and products have
been well-governed under the ASF’s meritocratic process and principles.
This is a major step for the community and we are very proud to share this news
with users as we complete Spark’s move to Apache. Read more about
Spark’s growth during the past year and from contributors and users in
the ASF’s <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">press
release</a>.</p>
+<p>The Apache Software Foundation <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">announced</a>
today that Spark has graduated from the Apache Incubator to become a top-level
Apache project, signifying that the projectâs community and products have
been well-governed under the ASFâs meritocratic process and principles. This
is a major step for the community and we are very proud to share this news with
users as we complete Sparkâs move to Apache. Read more about Sparkâs growth
during the past year and from contributors and users in the ASFâs <a
href="https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces50">press
release</a>.</p>
-<p>As part of this change, note that Spark’s <a
href="/community.html">mailing lists</a> have moved to
<tt>@spark.apache.org</tt> addresses, although the old
<tt>@spark.incubator.apache.org</tt> addresses also still work.</p>
+<p>As part of this change, note that Sparkâs <a
href="/community.html">mailing lists</a> have moved to
<tt>@spark.apache.org</tt> addresses, although the old
<tt>@spark.incubator.apache.org</tt> addresses also still work.</p>
<p>
Modified: spark/site/news/spark-meetups.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-meetups.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-meetups.html (original)
+++ spark/site/news/spark-meetups.html Tue Aug 26 01:53:10 2014
@@ -160,7 +160,7 @@
<h2>We've started hosting a Bay Area Spark User Meetup</h2>
-<p>We’ve started hosting a regular <a
href="http://www.meetup.com/spark-users/">Bay Area Spark User Meetup</a>. Sign
up on the meetup.com page to be notified about events and meet other Spark
developers and users.</p>
+<p>Weâve started hosting a regular <a
href="http://www.meetup.com/spark-users/">Bay Area Spark User Meetup</a>. Sign
up on the meetup.com page to be notified about events and meet other Spark
developers and users.</p>
<p>
Modified: spark/site/news/spark-user-survey-and-powered-by-page.html
URL:
http://svn.apache.org/viewvc/spark/site/news/spark-user-survey-and-powered-by-page.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/spark-user-survey-and-powered-by-page.html (original)
+++ spark/site/news/spark-user-survey-and-powered-by-page.html Tue Aug 26
01:53:10 2014
@@ -160,9 +160,9 @@
<h2>Spark user survey and "Powered By" page</h2>
-<p>As we continue developing Spark, we would love to get feedback from users
and hear what you’d like us to work on next. We’ve decided that a
good way to do that is a survey – we hope to run this at regular
intervals. If you have a few minutes to participate, <a
href="https://docs.google.com/forms/d/1eMXp4GjcIXglxJe5vYYBzXKVm-6AiYt1KThJwhCjJiY/viewform">fill
in the survey here</a>. Your time is greatly appreciated.</p>
+<p>As we continue developing Spark, we would love to get feedback from users
and hear what youâd like us to work on next. Weâve decided that a good way
to do that is a survey â we hope to run this at regular intervals. If you
have a few minutes to participate, <a
href="https://docs.google.com/forms/d/1eMXp4GjcIXglxJe5vYYBzXKVm-6AiYt1KThJwhCjJiY/viewform">fill
in the survey here</a>. Your time is greatly appreciated.</p>
-<p>In parallel, we are starting a <a
href="https://cwiki.apache.org/confluence/display/SPARK/Powered+By+Spark">“powered
by” page</a> on the Apache Spark wiki for organizations that are using,
or contributing to, Spark. Sign up if you’d like to support the project!
This is a great way to let the world know you’re using Spark, and can
also be helpful to generate leads for recruiting. You can also add yourself
when you fill the survey.</p>
+<p>In parallel, we are starting a <a
href="https://cwiki.apache.org/confluence/display/SPARK/Powered+By+Spark">âpowered
byâ page</a> on the Apache Spark wiki for organizations that are using, or
contributing to, Spark. Sign up if youâd like to support the project! This is
a great way to let the world know youâre using Spark, and can also be helpful
to generate leads for recruiting. You can also add yourself when you fill the
survey.</p>
<p>Thanks for taking the time to give feedback.</p>
Modified: spark/site/news/strata-exercises-now-available-online.html
URL:
http://svn.apache.org/viewvc/spark/site/news/strata-exercises-now-available-online.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/strata-exercises-now-available-online.html (original)
+++ spark/site/news/strata-exercises-now-available-online.html Tue Aug 26
01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Strata exercises now available online</h2>
-<p>At this year’s <a href="http://strataconf.com/strata2013">Strata</a>
conference, the AMP Lab hosted a full day of tutorials on Spark, Shark, and
Spark Streaming, including online exercises on Amazon EC2. Those exercises are
now <a href="http://ampcamp.berkeley.edu/big-data-mini-course/">available
online</a>, letting you learn Spark and Shark at your own pace on an EC2
cluster with real data. They are a great resource for learning the systems. You
can also find <a
href="http://ampcamp.berkeley.edu/amp-camp-two-strata-2013/">slides</a> from
the Strata tutorials online, as well as <a
href="http://ampcamp.berkeley.edu/amp-camp-one-berkeley-2012/">videos</a> from
the AMP Camp workshop we held at Berkeley in August.</p>
+<p>At this yearâs <a href="http://strataconf.com/strata2013">Strata</a>
conference, the AMP Lab hosted a full day of tutorials on Spark, Shark, and
Spark Streaming, including online exercises on Amazon EC2. Those exercises are
now <a href="http://ampcamp.berkeley.edu/big-data-mini-course/">available
online</a>, letting you learn Spark and Shark at your own pace on an EC2
cluster with real data. They are a great resource for learning the systems. You
can also find <a
href="http://ampcamp.berkeley.edu/amp-camp-two-strata-2013/">slides</a> from
the Strata tutorials online, as well as <a
href="http://ampcamp.berkeley.edu/amp-camp-one-berkeley-2012/">videos</a> from
the AMP Camp workshop we held at Berkeley in August.</p>
<p>
Modified: spark/site/news/submit-talks-to-spark-summit-2014.html
URL:
http://svn.apache.org/viewvc/spark/site/news/submit-talks-to-spark-summit-2014.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/submit-talks-to-spark-summit-2014.html (original)
+++ spark/site/news/submit-talks-to-spark-summit-2014.html Tue Aug 26 01:53:10
2014
@@ -160,12 +160,12 @@
<h2>Submissions and registration open for Spark Summit 2014</h2>
-<p>After last year’s successful <a
href="http://spark-summit.org/2013">first Spark Summit</a>, registrations
+<p>After last yearâs successful <a href="http://spark-summit.org/2013">first
Spark Summit</a>, registrations
and talk submissions are now open for <a
href="http://spark-summit.org/2014">Spark Summit 2014</a>.
This will be a 3-day event in San Francisco organized by multiple companies in
the Spark community.
The event will run <strong>June 30th to July 2nd</strong> in San Francisco,
CA.</p>
-<p>If you’d like to present at the Summit, <a
href="http://spark-summit.org/submit">submit a talk</a>
+<p>If youâd like to present at the Summit, <a
href="http://spark-summit.org/submit">submit a talk</a>
before April 11th, 2014. We welcome talks on use cases, open source
development, and applications built
on Spark.</p>
Modified: spark/site/news/two-weeks-to-spark-summit-2014.html
URL:
http://svn.apache.org/viewvc/spark/site/news/two-weeks-to-spark-summit-2014.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/two-weeks-to-spark-summit-2014.html (original)
+++ spark/site/news/two-weeks-to-spark-summit-2014.html Tue Aug 26 01:53:10 2014
@@ -165,7 +165,7 @@ will be held in San Francisco on June 30
The Summit will contain <a
href="http://spark-summit.org/2014/agenda">presentations</a> from over 50
organizations using Spark, focused on use cases and ongoing development.</p>
-<p>If you’d like to come to the Summit, you can still
+<p>If youâd like to come to the Summit, you can still
<a
href="http://www.eventbrite.com/e/2014-spark-summit-registration-registration-10381067051">register</a>
online to attend in person. Otherwise, the Summit will offer a free live video
stream;
<a
href="https://docs.google.com/forms/d/1Jv_l-m1PQ1CPwKVIpOrWIDVBaWsJlJv14ZYkbezVjTs/viewform">register
for the stream</a> in advance to view it.</p>
Modified: spark/site/news/video-from-first-spark-development-meetup.html
URL:
http://svn.apache.org/viewvc/spark/site/news/video-from-first-spark-development-meetup.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/news/video-from-first-spark-development-meetup.html (original)
+++ spark/site/news/video-from-first-spark-development-meetup.html Tue Aug 26
01:53:10 2014
@@ -160,7 +160,7 @@
<h2>Video up from first Spark development meetup</h2>
-<p>On December 18th, we held the first of a series of Spark development
meetups, for people interested in learning the Spark codebase and contributing
to the project. There was quite a bit more demand than we anticipated, with
over 80 people signing up and 64 attending. The first meetup was an <a
href="http://www.meetup.com/spark-users/events/94101942/">introduction to Spark
internals</a>. Thanks to one of the attendees, there’s now a <a
href="http://www.youtube.com/watch?v=49Hr5xZyTEA">video of the meetup</a> on
YouTube. We’ve also posted the <a
href="http://files.meetup.com/3138542/dev-meetup-dec-2012.pptx">slides</a>.
Look to see more development meetups on Spark and Shark in the future.</p>
+<p>On December 18th, we held the first of a series of Spark development
meetups, for people interested in learning the Spark codebase and contributing
to the project. There was quite a bit more demand than we anticipated, with
over 80 people signing up and 64 attending. The first meetup was an <a
href="http://www.meetup.com/spark-users/events/94101942/">introduction to Spark
internals</a>. Thanks to one of the attendees, thereâs now a <a
href="http://www.youtube.com/watch?v=49Hr5xZyTEA">video of the meetup</a> on
YouTube. Weâve also posted the <a
href="http://files.meetup.com/3138542/dev-meetup-dec-2012.pptx">slides</a>.
Look to see more development meetups on Spark and Shark in the future.</p>
<p>
Modified: spark/site/releases/spark-release-0-3.html
URL:
http://svn.apache.org/viewvc/spark/site/releases/spark-release-0-3.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/releases/spark-release-0-3.html (original)
+++ spark/site/releases/spark-release-0-3.html Tue Aug 26 01:53:10 2014
@@ -176,7 +176,7 @@
<h3>Native Types for SequenceFiles</h3>
-<p>In working with SequenceFiles, which store objects that implement
Hadoop’s Writable interface, Spark will now let you use native types for
certain common Writable types, like IntWritable and Text. For example:</p>
+<p>In working with SequenceFiles, which store objects that implement
Hadoopâs Writable interface, Spark will now let you use native types for
certain common Writable types, like IntWritable and Text. For example:</p>
<div class="code">
<span class="comment">// Will read a SequenceFile of (IntWritable,
Text)</span><br />
Modified: spark/site/releases/spark-release-0-5-0.html
URL:
http://svn.apache.org/viewvc/spark/site/releases/spark-release-0-5-0.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/releases/spark-release-0-5-0.html (original)
+++ spark/site/releases/spark-release-0-5-0.html Tue Aug 26 01:53:10 2014
@@ -164,10 +164,10 @@
<h3>Mesos 0.9 Support</h3>
-<p>This release runs on <a href="http://www.mesosproject.org/">Apache Mesos
0.9</a>, the first Apache Incubator release of Mesos, which contains
significant usability and stability improvements. Most notable are better
memory accounting for applications with long-term memory use, easier access of
old jobs’ traces and logs (by keeping a history of executed tasks on the
web UI), and simpler installation.</p>
+<p>This release runs on <a href="http://www.mesosproject.org/">Apache Mesos
0.9</a>, the first Apache Incubator release of Mesos, which contains
significant usability and stability improvements. Most notable are better
memory accounting for applications with long-term memory use, easier access of
old jobsâ traces and logs (by keeping a history of executed tasks on the web
UI), and simpler installation.</p>
<h3>Performance Improvements</h3>
-<p>Spark’s scheduling is more communication-efficient when sending out
operations on RDDs with large lineage graphs. In addition, the cache
replacement policy has been improved to more smartly replace data when an RDD
does not fit in the cache, shuffles are more efficient, and the serializer used
for shipping closures is now configurable, making it possible to use faster
libraries than Java serialization there.</p>
+<p>Sparkâs scheduling is more communication-efficient when sending out
operations on RDDs with large lineage graphs. In addition, the cache
replacement policy has been improved to more smartly replace data when an RDD
does not fit in the cache, shuffles are more efficient, and the serializer used
for shipping closures is now configurable, making it possible to use faster
libraries than Java serialization there.</p>
<h3>Debug Improvements</h3>
@@ -179,11 +179,11 @@
<h3>EC2 Launch Script Improvements</h3>
-<p>Spark’s EC2 launch scripts are now included in the main package, and
have the ability to discover and use the latest Spark AMI automatically instead
of launching a hardcoded machine image ID.</p>
+<p>Sparkâs EC2 launch scripts are now included in the main package, and have
the ability to discover and use the latest Spark AMI automatically instead of
launching a hardcoded machine image ID.</p>
<h3>New Hadoop API Support</h3>
-<p>You can now use Spark to read and write data to storage formats in the new
<tt>org.apache.mapreduce</tt> packages (the “new Hadoop” API). In
addition, this release fixes an issue caused by a HDFS initialization bug in
some recent versions of HDFS.</p>
+<p>You can now use Spark to read and write data to storage formats in the new
<tt>org.apache.mapreduce</tt> packages (the ânew Hadoopâ API). In addition,
this release fixes an issue caused by a HDFS initialization bug in some recent
versions of HDFS.</p>
<p>
Modified: spark/site/releases/spark-release-0-5-1.html
URL:
http://svn.apache.org/viewvc/spark/site/releases/spark-release-0-5-1.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/releases/spark-release-0-5-1.html (original)
+++ spark/site/releases/spark-release-0-5-1.html Tue Aug 26 01:53:10 2014
@@ -193,7 +193,7 @@
<h3>EC2 Improvements</h3>
-<p>Spark’s EC2 launch script now configures Spark’s memory limit
automatically based on the machine’s available RAM.</p>
+<p>Sparkâs EC2 launch script now configures Sparkâs memory limit
automatically based on the machineâs available RAM.</p>
<p>
Modified: spark/site/releases/spark-release-0-6-0.html
URL:
http://svn.apache.org/viewvc/spark/site/releases/spark-release-0-6-0.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/releases/spark-release-0-6-0.html (original)
+++ spark/site/releases/spark-release-0-6-0.html Tue Aug 26 01:53:10 2014
@@ -172,11 +172,11 @@
<h3>Java API</h3>
-<p>Java programmers can now use Spark through a new <a
href="/docs/0.6.0/java-programming-guide.html">Java API layer</a>. This layer
makes available all of Spark’s features, including parallel
transformations, distributed datasets, broadcast variables, and accumulators,
in a Java-friendly manner.</p>
+<p>Java programmers can now use Spark through a new <a
href="/docs/0.6.0/java-programming-guide.html">Java API layer</a>. This layer
makes available all of Sparkâs features, including parallel transformations,
distributed datasets, broadcast variables, and accumulators, in a Java-friendly
manner.</p>
<h3>Expanded Documentation</h3>
-<p>Spark’s <a href="/docs/0.6.0/">documentation</a> has been expanded
with a new <a href="/docs/0.6.0/quick-start.html">quick start guide</a>,
additional deployment instructions, configuration guide, tuning guide, and
improved <a href="/docs/0.6.0/api/core">Scaladoc</a> API documentation.</p>
+<p>Sparkâs <a href="/docs/0.6.0/">documentation</a> has been expanded with a
new <a href="/docs/0.6.0/quick-start.html">quick start guide</a>, additional
deployment instructions, configuration guide, tuning guide, and improved <a
href="/docs/0.6.0/api/core">Scaladoc</a> API documentation.</p>
<h3>Engine Changes</h3>
@@ -199,7 +199,7 @@
<h3>Enhanced Debugging</h3>
-<p>Spark’s log now prints which operation in your program each RDD and
job described in your logs belongs to, making it easier to tie back to which
parts of your code experience problems.</p>
+<p>Sparkâs log now prints which operation in your program each RDD and job
described in your logs belongs to, making it easier to tie back to which parts
of your code experience problems.</p>
<h3>Maven Artifacts</h3>
Modified: spark/site/releases/spark-release-0-7-0.html
URL:
http://svn.apache.org/viewvc/spark/site/releases/spark-release-0-7-0.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/releases/spark-release-0-7-0.html (original)
+++ spark/site/releases/spark-release-0-7-0.html Tue Aug 26 01:53:10 2014
@@ -186,7 +186,7 @@
<h3>New Operations</h3>
-<p>This release adds several RDD transformations, including <tt>keys</tt>,
<tt>values</tt>, <tt>keyBy</tt>, <tt>subtract</tt>, <tt>coalesce</tt>,
<tt>zip</tt>. It also adds <tt>SparkContext.hadoopConfiguration</tt> to allow
programs to configure Hadoop input/output settings globally across operations.
Finally, it adds the <tt>RDD.toDebugString()</tt> method, which can be used to
print an RDD’s lineage graph for troubleshooting.</p>
+<p>This release adds several RDD transformations, including <tt>keys</tt>,
<tt>values</tt>, <tt>keyBy</tt>, <tt>subtract</tt>, <tt>coalesce</tt>,
<tt>zip</tt>. It also adds <tt>SparkContext.hadoopConfiguration</tt> to allow
programs to configure Hadoop input/output settings globally across operations.
Finally, it adds the <tt>RDD.toDebugString()</tt> method, which can be used to
print an RDDâs lineage graph for troubleshooting.</p>
<h3>EC2 Improvements</h3>
@@ -223,7 +223,7 @@
<h3>Credits</h3>
-<p>Spark 0.7 was the work of many contributors from Berkeley and
outside—in total, 31 different contributors, of which 20 were from
outside Berkeley. Here are the people who contributed, along with areas they
worked on:</p>
+<p>Spark 0.7 was the work of many contributors from Berkeley and outsideâin
total, 31 different contributors, of which 20 were from outside Berkeley. Here
are the people who contributed, along with areas they worked on:</p>
<ul>
<li>Mikhail Bautin -- Maven build</li>
Modified: spark/site/releases/spark-release-0-8-0.html
URL:
http://svn.apache.org/viewvc/spark/site/releases/spark-release-0-8-0.html?rev=1620493&r1=1620492&r2=1620493&view=diff
==============================================================================
--- spark/site/releases/spark-release-0-8-0.html (original)
+++ spark/site/releases/spark-release-0-8-0.html Tue Aug 26 01:53:10 2014
@@ -204,13 +204,13 @@
<li>The examples build has been isolated from the core build, substantially
reducing the potential for dependency conflicts.</li>
<li>The Spark Streaming Twitter API has been updated to use OAuth
authentication instead of the deprecated username/password authentication in
Spark 0.7.0.</li>
<li>Several new example jobs have been added, including PageRank
implementations in Java, Scala and Python, examples for accessing HBase and
Cassandra, and MLlib examples.</li>
- <li>Support for running on Mesos has been improved – now you can
deploy a Spark assembly JAR as part of the Mesos job, instead of having Spark
pre-installed on each machine. The default Mesos version has also been updated
to 0.13.</li>
+ <li>Support for running on Mesos has been improved â now you can deploy a
Spark assembly JAR as part of the Mesos job, instead of having Spark
pre-installed on each machine. The default Mesos version has also been updated
to 0.13.</li>
<li>This release includes various optimizations to PySpark and to the job
scheduler.</li>
</ul>
<h3 id="compatibility">Compatibility</h3>
<ul>
- <li><strong>This release changes Sparkâs package name to
‘org.apache.spark’</strong>, so those upgrading from Spark 0.7 will
need to adjust their imports accordingly. In addition, weâve moved the
<code>RDD</code> class to the org.apache.spark.rdd package (it was previously
in the top-level package). The Spark artifacts published through Maven have
also changed to the new package name.</li>
+ <li><strong>This release changes Sparkâs package name to
âorg.apache.sparkâ</strong>, so those upgrading from Spark 0.7 will need to
adjust their imports accordingly. In addition, weâve moved the
<code>RDD</code> class to the org.apache.spark.rdd package (it was previously
in the top-level package). The Spark artifacts published through Maven have
also changed to the new package name.</li>
<li>In the Java API, use of Scalaâs <code>Option</code> class has been
replaced with <code>Optional</code> from the Guava library.</li>
<li>Linking against Spark for arbitrary Hadoop versions is now possible by
specifying a dependency on <code>hadoop-client</code>, instead of rebuilding
<code>spark-core</code> against your version of Hadoop. See the documentation
<a
href="http://spark.incubator.apache.org/docs/0.8.0/scala-programming-guide.html#linking-with-spark">here</a>
for details.</li>
<li>If you are building Spark, youâll now need to run <code>sbt/sbt
assembly</code> instead of <code>package</code>.</li>
@@ -220,73 +220,73 @@
<p>Spark 0.8.0 was the result of the largest team of contributors yet. The
following developers contributed to this release:</p>
<ul>
- <li>Andrew Ash – documentation, code cleanup and logging
improvements</li>
- <li>Mikhail Bautin – bug fix</li>
- <li>Konstantin Boudnik – Maven build, bug fixes, and documentation</li>
- <li>Ian Buss – sbt configuration improvement</li>
- <li>Evan Chan – API improvement, bug fix, and documentation</li>
- <li>Lian Cheng – bug fix</li>
- <li>Tathagata Das – performance improvement in streaming receiver and
streaming bug fix</li>
- <li>Aaron Davidson – Python improvements, bug fix, and unit tests</li>
- <li>Giovanni Delussu – coalesced RDD feature</li>
- <li>Joseph E. Gonzalez – improvement to zipPartitions</li>
- <li>Karen Feng – several improvements to web UI</li>
- <li>Andy Feng – HDFS metrics</li>
- <li>Ali Ghodsi – configuration improvements and locality-aware
coalesce</li>
- <li>Christoph Grothaus – bug fix</li>
- <li>Thomas Graves – support for secure YARN cluster and various
YARN-related improvements</li>
- <li>Stephen Haberman – bug fix, documentation, and code cleanup</li>
- <li>Mark Hamstra – bug fixes and Maven build</li>
- <li>Benjamin Hindman – Mesos compatibility and documentation</li>
- <li>Liang-Chi Hsieh – bug fixes in build and in YARN mode</li>
- <li>Shane Huang – shuffle improvements, bug fix</li>
- <li>Ethan Jewett – Spark/HBase example</li>
- <li>Holden Karau – bug fix and EC2 improvement</li>
- <li>Kody Koeniger – JDBV RDD implementation</li>
- <li>Andy Konwinski – documentation</li>
- <li>Jey Kottalam – PySpark optimizations, Hadoop agnostic build
(lead), and bug fixes</li>
- <li>Andrey Kouznetsov – Bug fix</li>
- <li>S. Kumar – Spark Streaming example</li>
- <li>Ryan LeCompte – topK method optimization and serialization
improvements</li>
- <li>Gavin Li – compression codecs and pipe support</li>
- <li>Harold Lim – fair scheduler</li>
- <li>Dmitriy Lyubimov – bug fix</li>
- <li>Chris Mattmann – Apache mentor</li>
- <li>David McCauley – JSON API improvement</li>
- <li>Sean McNamara – added <code>takeOrdered</code> function, bug
fixes, and a build fix</li>
- <li>Mridul Muralidharan – YARN integration (lead) and scheduler
improvements</li>
- <li>Marc Mercer – improvements to UI json output</li>
- <li>Christopher Nguyen – bug fixes</li>
- <li>Erik van Oosten – example fix</li>
- <li>Kay Ousterhout – fix for scheduler regression and bug fixes</li>
- <li>Xinghao Pan – MLLib contributions</li>
- <li>Hiral Patel – bug fix</li>
- <li>James Phillpotts – updated Twitter API for Spark streaming</li>
- <li>Nick Pentreath – scala pageRank example, bagel improvement, and
several Java examples</li>
- <li>Alexander Pivovarov – logging improvement and Maven build</li>
- <li>Mike Potts – configuration improvement</li>
- <li>Rohit Rai – Spark/Cassandra example</li>
- <li>Imran Rashid – bug fixes and UI improvement</li>
- <li>Charles Reiss – bug fixes, code cleanup, performance
improvements</li>
- <li>Josh Rosen – Python API improvements, Java API improvements, EC2
scripts and bug fixes</li>
- <li>Henry Saputra – Apache mentor</li>
- <li>Jerry Shao – bug fixes, metrics system</li>
- <li>Prashant Sharma – documentation</li>
- <li>Mingfei Shi – joblogger and bug fix</li>
- <li>Andre Schumacher – several PySpark features</li>
- <li>Ginger Smith – MLLib contribution</li>
- <li>Evan Sparks – contributions to MLLib</li>
- <li>Ram Sriharsha – bug fix and RDD removal feature</li>
- <li>Ameet Talwalkar – MLlib contributions</li>
- <li>Roman Tkalenko – code refactoring and cleanup</li>
- <li>Chu Tong – Java PageRank algorithm and bug fix in bash scripts</li>
- <li>Shivaram Venkataraman – bug fixes, contributions to MLLib, netty
shuffle fixes, and Java API additions</li>
- <li>Patrick Wendell – release manager, bug fixes, documentation,
metrics system, and web UI</li>
- <li>Andrew Xia – fair scheduler (lead), metrics system, and ui
improvements</li>
- <li>Reynold Xin – shuffle improvements, bug fixes, code refactoring,
usability improvements, MLLib contributions</li>
- <li>Matei Zaharia – MLLib contributions, documentation, examples, UI
improvements, PySpark improvements, and bug fixes</li>
- <li>Wu Zeming – bug fix in scheduler</li>
- <li>Bill Zhao – log message improvement</li>
+ <li>Andrew Ash â documentation, code cleanup and logging improvements</li>
+ <li>Mikhail Bautin â bug fix</li>
+ <li>Konstantin Boudnik â Maven build, bug fixes, and documentation</li>
+ <li>Ian Buss â sbt configuration improvement</li>
+ <li>Evan Chan â API improvement, bug fix, and documentation</li>
+ <li>Lian Cheng â bug fix</li>
+ <li>Tathagata Das â performance improvement in streaming receiver and
streaming bug fix</li>
+ <li>Aaron Davidson â Python improvements, bug fix, and unit tests</li>
+ <li>Giovanni Delussu â coalesced RDD feature</li>
+ <li>Joseph E. Gonzalez â improvement to zipPartitions</li>
+ <li>Karen Feng â several improvements to web UI</li>
+ <li>Andy Feng â HDFS metrics</li>
+ <li>Ali Ghodsi â configuration improvements and locality-aware
coalesce</li>
+ <li>Christoph Grothaus â bug fix</li>
+ <li>Thomas Graves â support for secure YARN cluster and various
YARN-related improvements</li>
+ <li>Stephen Haberman â bug fix, documentation, and code cleanup</li>
+ <li>Mark Hamstra â bug fixes and Maven build</li>
+ <li>Benjamin Hindman â Mesos compatibility and documentation</li>
+ <li>Liang-Chi Hsieh â bug fixes in build and in YARN mode</li>
+ <li>Shane Huang â shuffle improvements, bug fix</li>
+ <li>Ethan Jewett â Spark/HBase example</li>
+ <li>Holden Karau â bug fix and EC2 improvement</li>
+ <li>Kody Koeniger â JDBV RDD implementation</li>
+ <li>Andy Konwinski â documentation</li>
+ <li>Jey Kottalam â PySpark optimizations, Hadoop agnostic build (lead),
and bug fixes</li>
+ <li>Andrey Kouznetsov â Bug fix</li>
+ <li>S. Kumar â Spark Streaming example</li>
+ <li>Ryan LeCompte â topK method optimization and serialization
improvements</li>
+ <li>Gavin Li â compression codecs and pipe support</li>
+ <li>Harold Lim â fair scheduler</li>
+ <li>Dmitriy Lyubimov â bug fix</li>
+ <li>Chris Mattmann â Apache mentor</li>
+ <li>David McCauley â JSON API improvement</li>
+ <li>Sean McNamara â added <code>takeOrdered</code> function, bug fixes,
and a build fix</li>
+ <li>Mridul Muralidharan â YARN integration (lead) and scheduler
improvements</li>
+ <li>Marc Mercer â improvements to UI json output</li>
+ <li>Christopher Nguyen â bug fixes</li>
+ <li>Erik van Oosten â example fix</li>
+ <li>Kay Ousterhout â fix for scheduler regression and bug fixes</li>
+ <li>Xinghao Pan â MLLib contributions</li>
+ <li>Hiral Patel â bug fix</li>
+ <li>James Phillpotts â updated Twitter API for Spark streaming</li>
+ <li>Nick Pentreath â scala pageRank example, bagel improvement, and
several Java examples</li>
+ <li>Alexander Pivovarov â logging improvement and Maven build</li>
+ <li>Mike Potts â configuration improvement</li>
+ <li>Rohit Rai â Spark/Cassandra example</li>
+ <li>Imran Rashid â bug fixes and UI improvement</li>
+ <li>Charles Reiss â bug fixes, code cleanup, performance improvements</li>
+ <li>Josh Rosen â Python API improvements, Java API improvements, EC2
scripts and bug fixes</li>
+ <li>Henry Saputra â Apache mentor</li>
+ <li>Jerry Shao â bug fixes, metrics system</li>
+ <li>Prashant Sharma â documentation</li>
+ <li>Mingfei Shi â joblogger and bug fix</li>
+ <li>Andre Schumacher â several PySpark features</li>
+ <li>Ginger Smith â MLLib contribution</li>
+ <li>Evan Sparks â contributions to MLLib</li>
+ <li>Ram Sriharsha â bug fix and RDD removal feature</li>
+ <li>Ameet Talwalkar â MLlib contributions</li>
+ <li>Roman Tkalenko â code refactoring and cleanup</li>
+ <li>Chu Tong â Java PageRank algorithm and bug fix in bash scripts</li>
+ <li>Shivaram Venkataraman â bug fixes, contributions to MLLib, netty
shuffle fixes, and Java API additions</li>
+ <li>Patrick Wendell â release manager, bug fixes, documentation, metrics
system, and web UI</li>
+ <li>Andrew Xia â fair scheduler (lead), metrics system, and ui
improvements</li>
+ <li>Reynold Xin â shuffle improvements, bug fixes, code refactoring,
usability improvements, MLLib contributions</li>
+ <li>Matei Zaharia â MLLib contributions, documentation, examples, UI
improvements, PySpark improvements, and bug fixes</li>
+ <li>Wu Zeming â bug fix in scheduler</li>
+ <li>Bill Zhao â log message improvement</li>
</ul>
<p>Thanks to everyone who contributed!
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]