Hi Staboler,
during ApacheCon in Sinsheim I had some interesting conversations with
Fabian, Rupert and Anil as result I want to summarize one of the
discussions as an user story telling a typical requirement for us as CMS
provider.
Talking about traditional Content Management Systems and assuming that
they don't store semantic informations is not correct. For example CMS
Systems already deliver RDFa annotated HTML, nearly all systems are
providing some tagging/categorizing mechanism. Specially OpenCms
provides a generic approach to define a structured content and therefore
we have the information that a specific field/item of a content has a
specified type and a defined label. E.g. A technology event named
ApacheCon takes place in Sinsheim from 05. Nov until 08. Nov 2012 is the
information that is already stored in OpenCms. More over OpenCms is able
to connect that event with all speakers/persons that will make a
presentation on that event, ...
What we would like to achieve is not only a plain text enhancement more
over we are interested in telling Stanbol all informations and
associations we already know. In other words we absolutely don't want to
lose the semantic information that is already existent in OpenCms.
A good starting point would be a REST endpoint providing the ability to
retrieve a RDFa annotated HTML document and than extracts the RDFa in
order to store those inside the semantic-index/entity-hub/... as I
previously suggested on the list under the subject "Extend stanbol
content hub for RDFa support". Maybe the content hub is not the right
component, but the requirement of RDFa extraction is still existent.
--
Kind Regards,
Rüdiger.
-------------------
Rüdiger Kurz
Alkacon Software GmbH - The OpenCms Experts
http://www.alkacon.com- http://www.opencms.org