This is an automated email from the ASF dual-hosted git repository.
jnioche pushed a commit to branch main
in repository
https://gitbox.apache.org/repos/asf/incubator-stormcrawler-site.git
The following commit(s) were added to refs/heads/main by this push:
new d989ed1 Added refs to Incubating + minor fixes to main page. Fixed
links in header
d989ed1 is described below
commit d989ed169a83d2e87a24238caa80172b8b2caf5a
Author: Julien Nioche <[email protected]>
AuthorDate: Thu Apr 18 09:55:46 2024 +0100
Added refs to Incubating + minor fixes to main page. Fixed links in header
Signed-off-by: Julien Nioche <[email protected]>
---
readme.md => README.md | 9 +++++----
_includes/header.html | 8 ++++----
index.html | 15 +++++++--------
3 files changed, 16 insertions(+), 16 deletions(-)
diff --git a/readme.md b/README.md
similarity index 72%
rename from readme.md
rename to README.md
index e0f3a7b..708f97e 100644
--- a/readme.md
+++ b/README.md
@@ -10,17 +10,18 @@ To build and run server locally type 'jekyll serve'. The
site can then be access
You can build the website for local testing by running
```bash
-docker compose up local
+docker compose up local -d
```
## Deployment
-This website uses the typical ASF layout. The actual website is found on
`asf-site`. Any changes need to be pushed to this branch.
+This website uses the typical ASF layout. The actual website is found on the
branch `asf-site`, any changes need to be pushed there.
+
You can run
```bash
-docker compose up build
+docker compose up build -d
```
-which will create a folder `_site`. This is the content, which needs to be
copied ofter to the `asf-site` branch.
\ No newline at end of file
+which will create a folder `_site`. This is the content, which needs to be
copied afterwards to the `asf-site` branch.
diff --git a/_includes/header.html b/_includes/header.html
index 0def1f5..1878c54 100644
--- a/_includes/header.html
+++ b/_includes/header.html
@@ -1,17 +1,17 @@
<header class="site-header">
<div class="site-header__wrap">
<div class="site-header__logo">
- <a href="{{ site.baseurl }}/"><img src="{{ site.baseurl }}/img/logo.png"
alt="Storm Crawler"></a>
+ <a href="{{ site.baseurl }}/"><img src="{{ site.baseurl }}/img/logo.png"
alt="Apache StormCrawler"></a>
</div>
</div>
</header>
<nav class="site-nav">
<ul>
<li><a href="{{ site.baseurl }}/index.html">Home</a>
- <li><a
href="https://github.com/DigitalPebble/storm-crawler/releases/tag/2.11">Download</a></li>
- <li><a href="https://github.com/DigitalPebble/storm-crawler">Source
Code</a></li>
+ <li><a
href="https://github.com/apache/incubator-stormcrawler/releases/tag/2.11">Download</a></li>
+ <li><a href="https://github.com/apache/incubator-stormcrawler">Source
Code</a></li>
<li><a href="{{ site.baseurl }}/getting-started/">Getting Started</a></li>
- <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">Docs</a>
+ <li><a
href="https://javadoc.io/doc/com.digitalpebble.stormcrawler/storm-crawler-core/latest/index.html">JavaDocs</a>
<li><a href="{{ site.baseurl }}/faq/">FAQ</a></li>
<li><a href="{{ site.baseurl }}/support/">Support</a></li>
</ul>
diff --git a/index.html b/index.html
index b99c064..8f86388 100644
--- a/index.html
+++ b/index.html
@@ -8,7 +8,7 @@ slug: home
</div>
</div>
<div class="row row-col">
- <p><strong>StormCrawler</strong> is an open source SDK for building
distributed web crawlers based on <a href="http://storm.apache.org">Apache
Storm®</a>. The project is under Apache license v2 and consists of a collection
of reusable resources and components, written mostly in Java.</p>
+ <p><strong>Apache StormCrawler (Incubating)</strong> is an open source SDK
for building distributed web crawlers based on <a
href="http://storm.apache.org">Apache Storm®</a>. The project is under Apache
license v2 and consists of a collection of reusable resources and components,
written mostly in Java.</p>
<p>The aim of StormCrawler is to help build web crawlers that are :</p>
<ul>
<li>scalable</li>
@@ -17,17 +17,16 @@ slug: home
<li>easy to extend</li>
<li>polite yet efficient</li>
</ul>
- <p><strong>StormCrawler</strong> is a library and collection of resources
that developers can leverage to build their own crawlers. The good news is that
doing so can be pretty straightforward! Have a look at the <a
href="getting-started/">Getting Started</a> section for more details.</p>
- <p>Apart from the core components, we provide some <a
href="https://github.com/DigitalPebble/storm-crawler/tree/master/external">external
resources</a> that you can reuse in your project, like for instance our spout
and bolts for <a href="https://www.elastic.co/">ElasticSearch®</a> and <a
href="https://opensearch.org/">OpenSearch®</a> or a ParserBolt which uses <a
href="http://tika.apache.org">Apache Tika®</a> to parse various document
formats.</p>
- <p><strong>StormCrawler</strong> is perfectly suited to use cases where the
URL to fetch and parse come as streams but is also an appropriate solution for
large scale recursive crawls, particularly where low latency is required. The
project is used in production by <a
href="https://github.com/DigitalPebble/storm-crawler/wiki/Powered-By">many
organisations</a> and is actively developed and maintained.</p>
- <p>The <a
href="https://github.com/DigitalPebble/storm-crawler/wiki/Presentations">Presentations</a>
page contains links to some recent presentations made about this project.</p>
- <p>We are very grateful to our <a
href="https://github.com/DigitalPebble/storm-crawler/wiki/Sponsors">sponsors</a>
for their continued support.</p>
+ <p><strong>Apache StormCrawler (Incubating)</strong> is a library and
collection of resources that developers can leverage to build their own
crawlers. The good news is that doing so can be pretty straightforward! Have a
look at the <a href="getting-started/">Getting Started</a> section for more
details.</p>
+ <p>Apart from the core components, we provide some <a
href="https://github.com/apache/incubator-stormcrawler/tree/main/external">external
resources</a> that you can reuse in your project, like for instance our spout
and bolts for <a href="https://opensearch.org/">OpenSearch®</a> or a ParserBolt
which uses <a href="http://tika.apache.org">Apache Tika®</a> to parse various
document formats.</p>
+ <p><strong>Apache StormCrawler</strong> is perfectly suited to use cases
where the URL to fetch and parse come as streams but is also an appropriate
solution for large scale recursive crawls, particularly where low latency is
required. The project is used in production by <a
href="https://github.com/apache/incubator-stormcrawler/wiki/Powered-By">many
organisations</a> and is actively developed and maintained.</p>
+ <p>The <a
href="https://github.com/apache/incubator-stormcrawler/wiki/Presentations">Presentations</a>
page contains links to some recent presentations made about this project.</p>
</div>
<div class="row row-col">
<div class="used-by-panel">
<h2>Used by</h2>
- <a
href="https://digitalpebble.blogspot.com/2019/02/meet-stormcrawler-users-q-with-pixray.html"
target="_blank">
+ <a href="https://pixray.com/" target="_blank">
<img src="{{ site.baseurl }}/img/pixray.png" alt="Pixray" height=80>
</a>
<a href="https://www.gov.nt.ca/" target="_blank">
@@ -40,6 +39,6 @@ slug: home
<img src="{{ site.baseurl }}/img/polecat.svg" alt="Polecat" height=70>
</a>
<br>
- <a
href="http://github.com/DigitalPebble/storm-crawler/wiki/Powered-By">and many
more...</a>
+ <a
href="http://github.com/apache/incubator-stormcrawler/wiki/Powered-By">and many
more...</a>
</div>
</div>