Repository: any23
Updated Branches:
  refs/heads/master 7a7db2006 -> 616bdf892


Update poweredby page to account for Nutch usage


Project: http://git-wip-us.apache.org/repos/asf/any23/repo
Commit: http://git-wip-us.apache.org/repos/asf/any23/commit/616bdf89
Tree: http://git-wip-us.apache.org/repos/asf/any23/tree/616bdf89
Diff: http://git-wip-us.apache.org/repos/asf/any23/diff/616bdf89

Branch: refs/heads/master
Commit: 616bdf8929bce01eded77a0d65c6fbd3d15dd201
Parents: 7a7db20
Author: Lewis John McGibbney <lewis.mcgibb...@gmail.com>
Authored: Fri Feb 9 12:35:10 2018 -0800
Committer: Lewis John McGibbney <lewis.mcgibb...@gmail.com>
Committed: Fri Feb 9 12:35:10 2018 -0800

----------------------------------------------------------------------
 src/site/apt/poweredby.apt                  |  22 ++++++++++++++++------
 src/site/resources/images/nutch_logo_tm.png | Bin 0 -> 2275 bytes
 2 files changed, 16 insertions(+), 6 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/any23/blob/616bdf89/src/site/apt/poweredby.apt
----------------------------------------------------------------------
diff --git a/src/site/apt/poweredby.apt b/src/site/apt/poweredby.apt
index 4d39adf..eb47572 100644
--- a/src/site/apt/poweredby.apt
+++ b/src/site/apt/poweredby.apt
@@ -26,11 +26,9 @@ PoweredBy
 
 [./images/logo-sindice-90x30.png]  
 
-  {{{http://sindice.com/}Sindice}}
-
-  Sindice is a platform to build applications on top of this data.
-  Sindice collects Web Data in many ways, following existing web standards,
-  and offers Search and Querying across this data, updated live every few 
minutes.
+  Sindice was a platform to build applications on top of structured data.
+  Sindice collected Web Data in many ways, following existing web standards,
+  and offered Search and Querying across this data, updated live every few 
minutes.
 
 [./images/fu-logo-90x25.png][./images/kit-logo-90x40.png]
 
@@ -41,4 +39,16 @@ PoweredBy
   people, organizations, places, events into their HTML pages. The Web Data 
Commons
   project extracts this data from several billion web pages and provides the 
extracted 
   data for download. Web Data Commons thus enables you to use the data without 
needing 
-  to crawl the Web yourself.
\ No newline at end of file
+  to crawl the Web yourself.
+
+[./images/nutch_logo_tm.png]
+
+  {{{http://nutch.apache.org}Apache Nutch}}
+
+  Apache Nutch is a well matured, production ready Web crawler. Nutch enables 
fine 
+  grained configuration, relying on Apache Hadoop data structures, which are 
+  great for batch processing.
+  Nutch uses Any23 within it's plugin infrastructure to extract structured 
data markup
+  from Webpages. This data can then be indexed into one of the Nutch supported 
storage
+  mechanisms.
+

http://git-wip-us.apache.org/repos/asf/any23/blob/616bdf89/src/site/resources/images/nutch_logo_tm.png
----------------------------------------------------------------------
diff --git a/src/site/resources/images/nutch_logo_tm.png 
b/src/site/resources/images/nutch_logo_tm.png
new file mode 100644
index 0000000..325ca1e
Binary files /dev/null and b/src/site/resources/images/nutch_logo_tm.png differ

Reply via email to