Thanks Rafa for the idea.

Are you suggesting that I should customize the data handler of the external
Solr instance which extracts data from the mysql datasource and send the
content to Stanbol for enhancement prior to indexing?

That means I may not need any modifications to Stanbol.. I will check the
possibilities from Solr perspective ..I just started learning Solr.. :)

Thanks,
Dileepa



On Fri, Oct 25, 2013 at 4:14 PM, Rafa Haro <rh...@zaizi.com> wrote:

> Hi Dileepa,
>
> Why not just enhance the content before indexing it in your Solr. If you
> are using a Solr DataImportHandler, you could customize it for making the
> proper REST requests to the Enhancer.
>
> WDYT?
>
> Cheers,
> Rafa
>
> El 25/10/13 11:37, Dileepa Jayakody escribió:
>
>  Hi Antonio,
>>
>> Thanks for your valuable input.
>>
>> My idea of the best data flow will be;
>>
>> mysqlDB >> Solr contenthub >> enhancement-chain (to extract
>> Person/Organization) >> Solr contenthub (so facet search on the content
>> can
>> be done now)
>>
>> It will be even better to run the entity linking enhancement flow (to tag
>> Person and Organization) when importing data from mysql db straight to
>> Stanbol.
>>
>> What will be the best way to achieve my requirement? Appreciate all the
>> ideas I can get.. :)
>>
>> Thanks,
>> Dileepa
>>
>>
>> On Fri, Oct 25, 2013 at 3:02 PM, Antonio David Perez Morales <
>> ape...@zaizi.com> wrote:
>>
>>  Hi
>>>
>>> I don't know if what you want is possible, but importing directly data
>>> into
>>> the Solr ContentHub will not extract entities neither store them in Solr
>>> because it's done using the REST interface of ContentHub component.
>>>
>>> Regards
>>>
>>>
>>> On Fri, Oct 25, 2013 at 11:26 AM, Dileepa Jayakody <
>>> dileepajayak...@gmail.com> wrote:
>>>
>>>  Hi Antonio and all,
>>>>
>>>> My requirement is not extracting entities from the external Solr
>>>>
>>> instance.
>>>
>>>> It contains indexed content imported from a mysql database periodically
>>>> using a Solr DataImportHandler.
>>>> I need to enhance those content using Stanbol to identify Person and
>>>> Organization entities, so that they can be used as
>>>> facets<http://searchhub.org/**2009/09/02/faceted-search-**with-solr/<http://searchhub.org/2009/09/02/faceted-search-with-solr/>>
>>>> for
>>>> Solr search.
>>>>
>>>> So I was thinking, this will be easier if I can configure Stanbol
>>>> contenthub to use the external Solr instance as a contenthub-store.
>>>> Another option is to import data straight from the mysql database to the
>>>> Solr index in Stanbol contenthub store..Since Solr has the data import
>>>> handler functionality does Stanbol support it by default?
>>>>
>>>> Open for ideas.
>>>>
>>>> Thanks,
>>>> Dileepa
>>>>
>>>>
>>>>
>>>>
>>>> On Fri, Oct 25, 2013 at 2:31 PM, Antonio David Perez Morales <
>>>> ape...@zaizi.com> wrote:
>>>>
>>>>  Hi Dileepa.
>>>>>
>>>>> I don't know if you can change the Solr instance of the ContentHub (I
>>>>>
>>>> think
>>>>
>>>>> it is not possible) but thinking in your problem, you could do the
>>>>> following:
>>>>>
>>>>> - Create a new SolrYard pointing to your Solr instance (instead of
>>>>>
>>>> creating
>>>>
>>>>> a new core in the the Embedded Solr instance by default)
>>>>> - Create a new Site for your entities using this SolrYard
>>>>> - Create an Enhancement Chain configuring the linking engine pointing
>>>>>
>>>> to
>>>
>>>> your site and the specific field of your site to perform the linking
>>>>> - Create a new ContentHub using this EnhancementChain and defining a
>>>>>
>>>> LDPath
>>>>
>>>>> program to extract the properties of the extracted entities you want to
>>>>> store in the Solr Content Hub.
>>>>> - Use the ContentHub to perform queries over the content and the
>>>>>
>>>> extracted
>>>>
>>>>> entities of it (only the fields extracted by the LDPath program)
>>>>>
>>>>> I would do so but I hope Rupert or another Stanbol guy can give more
>>>>> information about this problem.
>>>>>
>>>>> Regards
>>>>>
>>>>>
>>>>> On Thu, Oct 24, 2013 at 8:03 PM, Dileepa Jayakody <
>>>>> dileepajayak...@gmail.com
>>>>>
>>>>>> wrote:
>>>>>> Hi All,
>>>>>>
>>>>>> Is $subject possible?
>>>>>> I have a Solr server which imports data from a mysql database and
>>>>>>
>>>>> index
>>>
>>>> the
>>>>>
>>>>>> content periodically. I need to identify Person and Organization
>>>>>>
>>>>> mentions
>>>>
>>>>> in those content and make them searchable in the Solr index.
>>>>>>
>>>>>>
>>>>>> Can I configure Stanbol contenthub to use my existing Solr core and
>>>>>>
>>>>> enhance
>>>>>
>>>>>> content by adding Person and Organization metadata to it?
>>>>>>
>>>>>> Thanks,
>>>>>> Dileepa
>>>>>>
>>>>>>  --
>>>>>
>>>>> ------------------------------
>>>>> This message should be regarded as confidential. If you have received
>>>>>
>>>> this
>>>>
>>>>> email in error please notify the sender and destroy it immediately.
>>>>> Statements of intent shall only become binding when confirmed in hard
>>>>>
>>>> copy
>>>>
>>>>> by an authorised signatory.
>>>>>
>>>>> Zaizi Ltd is registered in England and Wales with the registration
>>>>>
>>>> number
>>>
>>>> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
>>>>> London W6 7AN.
>>>>>
>>>>>  --
>>>
>>> ------------------------------
>>> This message should be regarded as confidential. If you have received
>>> this
>>> email in error please notify the sender and destroy it immediately.
>>> Statements of intent shall only become binding when confirmed in hard
>>> copy
>>> by an authorised signatory.
>>>
>>> Zaizi Ltd is registered in England and Wales with the registration number
>>> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
>>> London W6 7AN.
>>>
>>>
>
> --
>
> ------------------------------
> This message should be regarded as confidential. If you have received this
> email in error please notify the sender and destroy it immediately.
> Statements of intent shall only become binding when confirmed in hard copy
> by an authorised signatory.
>
> Zaizi Ltd is registered in England and Wales with the registration number
> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
> London W6 7AN.
>

Reply via email to