Hi Staboler,

during ApacheCon in Sinsheim I had some interesting conversations with Fabian, Rupert and Anil as result I want to summarize one of the discussions as an user story telling a typical requirement for us as CMS provider.

Talking about traditional Content Management Systems and assuming that they don't store semantic informations is not correct. For example CMS Systems already deliver RDFa annotated HTML, nearly all systems are providing some tagging/categorizing mechanism. Specially OpenCms provides a generic approach to define a structured content and therefore we have the information that a specific field/item of a content has a specified type and a defined label. E.g. A technology event named ApacheCon takes place in Sinsheim from 05. Nov until 08. Nov 2012 is the information that is already stored in OpenCms. More over OpenCms is able to connect that event with all speakers/persons that will make a presentation on that event, ...

What we would like to achieve is not only a plain text enhancement more over we are interested in telling Stanbol all informations and associations we already know. In other words we absolutely don't want to lose the semantic information that is already existent in OpenCms.

A good starting point would be a REST endpoint providing the ability to retrieve a RDFa annotated HTML document and than extracts the RDFa in order to store those inside the semantic-index/entity-hub/... as I previously suggested on the list under the subject "Extend stanbol content hub for RDFa support". Maybe the content hub is not the right component, but the requirement of RDFa extraction is still existent.

--
Kind Regards,
Rüdiger.

-------------------

Rüdiger Kurz

Alkacon Software GmbH  - The OpenCms Experts
http://www.alkacon.com- http://www.opencms.org

Reply via email to