Re: How to extract entities from documents using Microdata?

Rüdiger Kurz Sat, 16 Mar 2013 08:56:57 -0700

Hi Walter,

thanks for the quick reply. Are the extracted entities from thehtmlextractor enhancement engine automatically stored into the entity hub?

What I want to reach is to get an index that stores the extractedentities and also the document itself with references on the entitiesrelated to this document. It would be great if that could be done byconfiguration only. Maybe someone could lend me a hand with building theright enhancement chain as a first step.

In my mind is building up a Solr Search UI offering entity basedautosuggestion including spellchecker and faceted search.


Thanks again.

Am 16.03.2013 16:29, schrieb Walter Kasper:

Dear Rüdiger,

The htmlextractor enhancement engine provides a microdata extractor that
should work well for schema.org annotations. Just test it with your data.

Best regards,

Walter

Rüdiger Kurz wrote:

Hello Stanbolers,

I want to extract and then store entities from HTML documents that are
using Microdata annotations based on the type hierarchy of schema.org
as Ontology. I appreciate any kind of approach including the use of VIE.

Many thanks in advance
Rüdiger


--
Rüdiger Kurz

-------------------

Alkacon Software GmbH - The OpenCms Experts
An der Wachsfabrik 13
50996 Koeln, DE

http://www.alkacon.com
http://www.opencms.org

Geschäftsführer: Alexander Kandzior, Amtsgericht Köln, HRB 54613

Re: How to extract entities from documents using Microdata?

Reply via email to