Author: rwesten
Date: Tue Jun 26 09:49:12 2012
New Revision: 1353900
URL: http://svn.apache.org/viewvc?rev=1353900&view=rev
Log:
suggested new start page for Apache Stanbol - work in progres
Modified:
incubator/stanbol/site/trunk/content/stanbol/index.mdtext
Modified: incubator/stanbol/site/trunk/content/stanbol/index.mdtext
URL:
http://svn.apache.org/viewvc/incubator/stanbol/site/trunk/content/stanbol/index.mdtext?rev=1353900&r1=1353899&r2=1353900&view=diff
==============================================================================
--- incubator/stanbol/site/trunk/content/stanbol/index.mdtext (original)
+++ incubator/stanbol/site/trunk/content/stanbol/index.mdtext Tue Jun 26
09:49:12 2012
@@ -1,19 +1,52 @@
Title: Welcome to Apache Stanbol (incubating)
-Apache Stanbol (currently in [incubation](http://incubator.apache.org)) is an
open source modular software stack and reusable set of components for semantic
content management.
+Apache Stanbol ([incubating](http://incubator.apache.org)) provides a set of
RESTful services that aim to extend traditional content management systems with
semantic services. bring semantic technologies to Content Management Systems.
-Apache Stanbol [components](docs/trunk/components.html) are meant to be
accessed over RESTful interfaces to provide semantic services for content
management. Thus, one application is to extend traditional content management
systems with (internal or external) semantic services. Additionally, Apache
Stanbol let's you create new types of content management systems with semantics
at their core. The current code is written in Java and based on the
[OSGi](http://www2.osgi.org/Specifications/HomePage) component framework.
+
+
+the main **Features** are
+
+### Content Enhancement
+
+<span style="float:right"> </span>
+
+The Stanbol Enhancer provides a [RESTful
API](docs/trunk/enhancer/enhancerrest.html) that allows to [extract semantic
information](docs/trunk/contentenhancement.html) from parsed Content. Results
are represented as RDF graph and can be used on the front end e.g. to improve
[Tagging](docs/trunk/enhancementusage.html#entity-tagging-use-tags-to-relate-you-content-to-persons-places-events)
or even [inline Content
Annotation](docs/trunk/enhancementusage.html#entity-checker-inline-editing-of-content-enhancements).
In the backend extracted information can be used to improve search and content
processing workflows.
+
+In more details the Stanbol Enhancer can enhancer Content in [multiple
languages](docs/trunk/multilingual.html). It can be configured to use [custom
vocabularies](docs/trunk/customvocabulary.html). This [List of Enhancement
Engines](docs/trunk/enhancer/engines/list.html) provides a good overview about
the different option. However users with special needs can also extend the
Stanbol Enhancers by implementing their own [Enhancement
Engine](docs/trunk/enhancer/engines/).
+
+### Manage your Domain Data
+
+<span style="float:right"> </span>
+
+Information (Data/Knowledge) about relevant Entities are key for any semantic
System. But those information may come from a variety of source. The Stanbol
Entityhub provides tools and services that allow you and the other Stanbol
components to access those data via RESTful services.
+
+The Entityhub allows you to create/manage your own local Entities. It also
provides a tool for [create local indexes](docs/trunk/customvocabulary.html) of
datasets. This is important for supporting fast local queries as required by
the Stanbol Enhancer. The Entityhub can also reference and cache remotely
hosted datasets. It directly supports [Linked Data](http://linkeddata.org)
protocols but can also be extended to work with services that use special
protocols and formats.
+
+The Apache Entityhub also implements the [Google
Refine](http://code.google.com/p/google-refine/) [Reconciliation
API](http://code.google.com/p/google-refine/wiki/ReconciliationServiceApi).
Google Refine is a powerful tool for working with messy data. The
reconciliation step allows to link those data with information of other data
sets - similar to the Stanbol Enhancer suggesting links within parsed content.
-Apache Stanbol's main features are:
+### Knowledge Models and Reasoning
+
+<span style="float:right"> </span>
+
+TODO: Services that are used to define and manipulate the data models (e.g.
ontologies) that are used to store the semantic information.
+
+
+### Semantic Indexing and Search
+
+<span style="float:right"> </span>
+
+TODO: Services that store (or cache) semantic information, i.e. enhanced
content, entities, facts, and make it searchable
+
+
+### Used Technologies
+
+- Features provided as **RESTful Services* returning results as
**[RDF](http://www.w3.org/RDF/)**(Resource Description Language) and
[JSON](http://www.json.org/). Stanbol also supports the use of
**[JSON-LD](http://json-ld.org/)**.
+- Available as **Standalone application** (runable Jar) or **Web Archive**(WAR
file) deployable on any Servlet Container such as Apache Tomcat.
+- Written in **Java** based on
**[OSGi](http://www2.osgi.org/Specifications/HomePage)** as component framework
+- Implemented using frameworks such as [Apache
Solr](http://lucene.apache.org/solr/) - for semantic search; [Apache
Tika](http://tika.apache.org/) - for plain text and metadata extraction;
[Apache OpenNLP](http://opennlp.apache.org/) - for natural language processing;
[Apache Clerezza](http://incubator.apache.org/clerezza/) and [Apache
Jena](http://jena.apache.org/) - as RDF framework and Storage; [Apache
Felix](http://felix.apache.org) as default OSGI framework and [Apache
Sling](http://sling.apache.org) for deployment.
+
+Apache Stanbol [components](docs/trunk/components.html) are meant to be
accessed over RESTful interfaces to provide semantic services for content
management. Thus, one application is to extend traditional content management
systems with (internal or external) semantic services. Additionally, Apache
Stanbol let's you create new types of content management systems with semantics
at their core. The current code is written in Java and based on the
[OSGi](http://www2.osgi.org/Specifications/HomePage) component framework.
-- **[Content Enhancement](docs/trunk/enhancer)**<br>
- Services that add semantic information to ânon-semanticâ pieces of
content.
-- **[Reasoning](docs/trunk/reasoners.html)**<br>
- Services that are able to retrieve additional semantic information about the
content based on the semantic information retrieved via content enhancement.
-- **[Knowledge Models](docs/trunk/ontologymanager)**<br>
- Services that are used to define and manipulate the data models (e.g.
ontologies) that are used to store the semantic information.
-- **[Persistence](docs/trunk/contenthub)**<br>
- Services that store (or cache) semantic information, i.e. enhanced content,
entities, facts, and make it searchable.
Apache Stanbol features provide the basics to create content management
systems with semantically advanced user interfaces. Those user interfaces
benefit from the semantic information that can be handled by Apache Stanbol.
See the [documentation](docs/trunk/) pages for more details.