Hi Dileepa,

Why not just enhance the content before indexing it in your Solr. If you are using a Solr DataImportHandler, you could customize it for making the proper REST requests to the Enhancer.

WDYT?

Cheers,
Rafa

El 25/10/13 11:37, Dileepa Jayakody escribió:
Hi Antonio,

Thanks for your valuable input.

My idea of the best data flow will be;

mysqlDB >> Solr contenthub >> enhancement-chain (to extract
Person/Organization) >> Solr contenthub (so facet search on the content can
be done now)

It will be even better to run the entity linking enhancement flow (to tag
Person and Organization) when importing data from mysql db straight to
Stanbol.

What will be the best way to achieve my requirement? Appreciate all the
ideas I can get.. :)

Thanks,
Dileepa


On Fri, Oct 25, 2013 at 3:02 PM, Antonio David Perez Morales <
ape...@zaizi.com> wrote:

Hi

I don't know if what you want is possible, but importing directly data into
the Solr ContentHub will not extract entities neither store them in Solr
because it's done using the REST interface of ContentHub component.

Regards


On Fri, Oct 25, 2013 at 11:26 AM, Dileepa Jayakody <
dileepajayak...@gmail.com> wrote:

Hi Antonio and all,

My requirement is not extracting entities from the external Solr
instance.
It contains indexed content imported from a mysql database periodically
using a Solr DataImportHandler.
I need to enhance those content using Stanbol to identify Person and
Organization entities, so that they can be used as
facets<http://searchhub.org/2009/09/02/faceted-search-with-solr/> for
Solr search.

So I was thinking, this will be easier if I can configure Stanbol
contenthub to use the external Solr instance as a contenthub-store.
Another option is to import data straight from the mysql database to the
Solr index in Stanbol contenthub store..Since Solr has the data import
handler functionality does Stanbol support it by default?

Open for ideas.

Thanks,
Dileepa




On Fri, Oct 25, 2013 at 2:31 PM, Antonio David Perez Morales <
ape...@zaizi.com> wrote:

Hi Dileepa.

I don't know if you can change the Solr instance of the ContentHub (I
think
it is not possible) but thinking in your problem, you could do the
following:

- Create a new SolrYard pointing to your Solr instance (instead of
creating
a new core in the the Embedded Solr instance by default)
- Create a new Site for your entities using this SolrYard
- Create an Enhancement Chain configuring the linking engine pointing
to
your site and the specific field of your site to perform the linking
- Create a new ContentHub using this EnhancementChain and defining a
LDPath
program to extract the properties of the extracted entities you want to
store in the Solr Content Hub.
- Use the ContentHub to perform queries over the content and the
extracted
entities of it (only the fields extracted by the LDPath program)

I would do so but I hope Rupert or another Stanbol guy can give more
information about this problem.

Regards


On Thu, Oct 24, 2013 at 8:03 PM, Dileepa Jayakody <
dileepajayak...@gmail.com
wrote:
Hi All,

Is $subject possible?
I have a Solr server which imports data from a mysql database and
index
the
content periodically. I need to identify Person and Organization
mentions
in those content and make them searchable in the Solr index.


Can I configure Stanbol contenthub to use my existing Solr core and
enhance
content by adding Person and Organization metadata to it?

Thanks,
Dileepa

--

------------------------------
This message should be regarded as confidential. If you have received
this
email in error please notify the sender and destroy it immediately.
Statements of intent shall only become binding when confirmed in hard
copy
by an authorised signatory.

Zaizi Ltd is registered in England and Wales with the registration
number
6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
London W6 7AN.

--

------------------------------
This message should be regarded as confidential. If you have received this
email in error please notify the sender and destroy it immediately.
Statements of intent shall only become binding when confirmed in hard copy
by an authorised signatory.

Zaizi Ltd is registered in England and Wales with the registration number
6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
London W6 7AN.



--

------------------------------
This message should be regarded as confidential. If you have received this email in error please notify the sender and destroy it immediately. Statements of intent shall only become binding when confirmed in hard copy by an authorised signatory.

Zaizi Ltd is registered in England and Wales with the registration number 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road, London W6 7AN.

Reply via email to