This is an automated email from the ASF dual-hosted git repository.
jnioche pushed a commit to branch asf-site
in repository
https://gitbox.apache.org/repos/asf/incubator-stormcrawler-site.git
The following commit(s) were added to refs/heads/asf-site by this push:
new b7b1461 Added refs to Incubating + minor fixes to main page. Fixed
links in header
b7b1461 is described below
commit b7b1461aa1ef7459456aed1cafd44842a17544aa
Author: Julien Nioche <[email protected]>
AuthorDate: Thu Apr 18 10:08:22 2024 +0100
Added refs to Incubating + minor fixes to main page. Fixed links in header
Signed-off-by: Julien Nioche <[email protected]>
---
faq/index.html | 8 ++++----
feed.xml | 4 ++--
getting-started/index.html | 8 ++++----
index.html | 23 +++++++++++------------
support/index.html | 8 ++++----
5 files changed, 25 insertions(+), 26 deletions(-)
diff --git a/faq/index.html b/faq/index.html
index adfb1fb..fb2dc6e 100644
--- a/faq/index.html
+++ b/faq/index.html
@@ -24,17 +24,17 @@
<header class="site-header">
<div class="site-header__wrap">
<div class="site-header__logo">
- <a href="/"><img src="/img/logo.png" alt="Storm Crawler"></a>
+ <a href="/"><img src="/img/logo.png" alt="Apache StormCrawler"></a>
</div>
</div>
</header>
<nav class="site-nav">
<ul>
<li><a href="/index.html">Home</a>
- <li><a
href="https://github.com/DigitalPebble/storm-crawler/releases/tag/2.11">Download</a></li>
- <li><a href="https://github.com/DigitalPebble/storm-crawler">Source
Code</a></li>
+ <li><a
href="https://github.com/apache/incubator-stormcrawler/releases/tag/2.11">Download</a></li>
+ <li><a href="https://github.com/apache/incubator-stormcrawler">Source
Code</a></li>
<li><a href="/getting-started/">Getting Started</a></li>
- <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">Docs</a>
+ <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">JavaDocs</a>
<li><a href="/faq/">FAQ</a></li>
<li><a href="/support/">Support</a></li>
</ul>
diff --git a/feed.xml b/feed.xml
index 1d3bb43..c17d989 100644
--- a/feed.xml
+++ b/feed.xml
@@ -6,8 +6,8 @@
</description>
<link>https://stormcrawler.apache.org/</link>
<atom:link href="https://stormcrawler.apache.org/feed.xml" rel="self"
type="application/rss+xml"/>
- <pubDate>Mon, 15 Apr 2024 14:28:22 -0500</pubDate>
- <lastBuildDate>Mon, 15 Apr 2024 14:28:22 -0500</lastBuildDate>
+ <pubDate>Thu, 18 Apr 2024 04:04:49 -0500</pubDate>
+ <lastBuildDate>Thu, 18 Apr 2024 04:04:49 -0500</lastBuildDate>
<generator>Jekyll v3.9.5</generator>
</channel>
diff --git a/getting-started/index.html b/getting-started/index.html
index 3e99aff..61ead66 100644
--- a/getting-started/index.html
+++ b/getting-started/index.html
@@ -24,17 +24,17 @@
<header class="site-header">
<div class="site-header__wrap">
<div class="site-header__logo">
- <a href="/"><img src="/img/logo.png" alt="Storm Crawler"></a>
+ <a href="/"><img src="/img/logo.png" alt="Apache StormCrawler"></a>
</div>
</div>
</header>
<nav class="site-nav">
<ul>
<li><a href="/index.html">Home</a>
- <li><a
href="https://github.com/DigitalPebble/storm-crawler/releases/tag/2.11">Download</a></li>
- <li><a href="https://github.com/DigitalPebble/storm-crawler">Source
Code</a></li>
+ <li><a
href="https://github.com/apache/incubator-stormcrawler/releases/tag/2.11">Download</a></li>
+ <li><a href="https://github.com/apache/incubator-stormcrawler">Source
Code</a></li>
<li><a href="/getting-started/">Getting Started</a></li>
- <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">Docs</a>
+ <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">JavaDocs</a>
<li><a href="/faq/">FAQ</a></li>
<li><a href="/support/">Support</a></li>
</ul>
diff --git a/index.html b/index.html
index 6b995f0..20f758f 100644
--- a/index.html
+++ b/index.html
@@ -24,17 +24,17 @@
<header class="site-header">
<div class="site-header__wrap">
<div class="site-header__logo">
- <a href="/"><img src="/img/logo.png" alt="Storm Crawler"></a>
+ <a href="/"><img src="/img/logo.png" alt="Apache StormCrawler"></a>
</div>
</div>
</header>
<nav class="site-nav">
<ul>
<li><a href="/index.html">Home</a>
- <li><a
href="https://github.com/DigitalPebble/storm-crawler/releases/tag/2.11">Download</a></li>
- <li><a href="https://github.com/DigitalPebble/storm-crawler">Source
Code</a></li>
+ <li><a
href="https://github.com/apache/incubator-stormcrawler/releases/tag/2.11">Download</a></li>
+ <li><a href="https://github.com/apache/incubator-stormcrawler">Source
Code</a></li>
<li><a href="/getting-started/">Getting Started</a></li>
- <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">Docs</a>
+ <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">JavaDocs</a>
<li><a href="/faq/">FAQ</a></li>
<li><a href="/support/">Support</a></li>
</ul>
@@ -47,7 +47,7 @@
</div>
</div>
<div class="row row-col">
- <p><strong>StormCrawler</strong> is an open source SDK for building
distributed web crawlers based on <a href="http://storm.apache.org">Apache
Storm®</a>. The project is under Apache license v2 and consists of a collection
of reusable resources and components, written mostly in Java.</p>
+ <p><strong>Apache StormCrawler (Incubating)</strong> is an open source SDK
for building distributed web crawlers based on <a
href="http://storm.apache.org">Apache Storm®</a>. The project is under Apache
license v2 and consists of a collection of reusable resources and components,
written mostly in Java.</p>
<p>The aim of StormCrawler is to help build web crawlers that are :</p>
<ul>
<li>scalable</li>
@@ -56,17 +56,16 @@
<li>easy to extend</li>
<li>polite yet efficient</li>
</ul>
- <p><strong>StormCrawler</strong> is a library and collection of resources
that developers can leverage to build their own crawlers. The good news is that
doing so can be pretty straightforward! Have a look at the <a
href="getting-started/">Getting Started</a> section for more details.</p>
- <p>Apart from the core components, we provide some <a
href="https://github.com/DigitalPebble/storm-crawler/tree/master/external">external
resources</a> that you can reuse in your project, like for instance our spout
and bolts for <a href="https://www.elastic.co/">ElasticSearch®</a> and <a
href="https://opensearch.org/">OpenSearch®</a> or a ParserBolt which uses <a
href="http://tika.apache.org">Apache Tika®</a> to parse various document
formats.</p>
- <p><strong>StormCrawler</strong> is perfectly suited to use cases where the
URL to fetch and parse come as streams but is also an appropriate solution for
large scale recursive crawls, particularly where low latency is required. The
project is used in production by <a
href="https://github.com/DigitalPebble/storm-crawler/wiki/Powered-By">many
organisations</a> and is actively developed and maintained.</p>
- <p>The <a
href="https://github.com/DigitalPebble/storm-crawler/wiki/Presentations">Presentations</a>
page contains links to some recent presentations made about this project.</p>
- <p>We are very grateful to our <a
href="https://github.com/DigitalPebble/storm-crawler/wiki/Sponsors">sponsors</a>
for their continued support.</p>
+ <p><strong>Apache StormCrawler (Incubating)</strong> is a library and
collection of resources that developers can leverage to build their own
crawlers. The good news is that doing so can be pretty straightforward! Have a
look at the <a href="getting-started/">Getting Started</a> section for more
details.</p>
+ <p>Apart from the core components, we provide some <a
href="https://github.com/apache/incubator-stormcrawler/tree/main/external">external
resources</a> that you can reuse in your project, like for instance our spout
and bolts for <a href="https://opensearch.org/">OpenSearch®</a> or a ParserBolt
which uses <a href="http://tika.apache.org">Apache Tika®</a> to parse various
document formats.</p>
+ <p><strong>Apache StormCrawler</strong> is perfectly suited to use cases
where the URL to fetch and parse come as streams but is also an appropriate
solution for large scale recursive crawls, particularly where low latency is
required. The project is used in production by <a
href="https://github.com/apache/incubator-stormcrawler/wiki/Powered-By">many
organisations</a> and is actively developed and maintained.</p>
+ <p>The <a
href="https://github.com/apache/incubator-stormcrawler/wiki/Presentations">Presentations</a>
page contains links to some recent presentations made about this project.</p>
</div>
<div class="row row-col">
<div class="used-by-panel">
<h2>Used by</h2>
- <a
href="https://digitalpebble.blogspot.com/2019/02/meet-stormcrawler-users-q-with-pixray.html"
target="_blank">
+ <a href="https://pixray.com/" target="_blank">
<img src="/img/pixray.png" alt="Pixray" height=80>
</a>
<a href="https://www.gov.nt.ca/" target="_blank">
@@ -79,7 +78,7 @@
<img src="/img/polecat.svg" alt="Polecat" height=70>
</a>
<br>
- <a
href="http://github.com/DigitalPebble/storm-crawler/wiki/Powered-By">and many
more...</a>
+ <a
href="http://github.com/apache/incubator-stormcrawler/wiki/Powered-By">and many
more...</a>
</div>
</div>
diff --git a/support/index.html b/support/index.html
index 04f1f37..207c060 100644
--- a/support/index.html
+++ b/support/index.html
@@ -24,17 +24,17 @@
<header class="site-header">
<div class="site-header__wrap">
<div class="site-header__logo">
- <a href="/"><img src="/img/logo.png" alt="Storm Crawler"></a>
+ <a href="/"><img src="/img/logo.png" alt="Apache StormCrawler"></a>
</div>
</div>
</header>
<nav class="site-nav">
<ul>
<li><a href="/index.html">Home</a>
- <li><a
href="https://github.com/DigitalPebble/storm-crawler/releases/tag/2.11">Download</a></li>
- <li><a href="https://github.com/DigitalPebble/storm-crawler">Source
Code</a></li>
+ <li><a
href="https://github.com/apache/incubator-stormcrawler/releases/tag/2.11">Download</a></li>
+ <li><a href="https://github.com/apache/incubator-stormcrawler">Source
Code</a></li>
<li><a href="/getting-started/">Getting Started</a></li>
- <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">Docs</a>
+ <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">JavaDocs</a>
<li><a href="/faq/">FAQ</a></li>
<li><a href="/support/">Support</a></li>
</ul>