Hi Dileepa,
I agree with my colleagues, i don't think that trying to modify contehub
concept is the right way.
An easy solution can be to move the Data Import handler logic for your use
case to a specific Solrj index application.
Using java will be really easy to write a custom indexing application that
extract the info from your database, enrich the content and finally it
indexes the content in your Solr instance.

Customizing the Data import handler can be another good solution, in that
case you have to write a customised transformer[1] to enrich your rows and
add new Solr fields.
The you can simply include that in your db-config for the DIH.
Another strategy is eventually customise the Sql Processor. But i prefer
the first strategy. it's more polite.

Cheers

[1] http://wiki.apache.org/solr/DIHCustomTransformer


2013/10/25 Rafa Haro <rh...@zaizi.com>

> Hi Dileepa,
>
> Why not just enhance the content before indexing it in your Solr. If you
> are using a Solr DataImportHandler, you could customize it for making the
> proper REST requests to the Enhancer.
>
> WDYT?
>
> Cheers,
> Rafa
>
> El 25/10/13 11:37, Dileepa Jayakody escribió:
>
>  Hi Antonio,
>>
>> Thanks for your valuable input.
>>
>> My idea of the best data flow will be;
>>
>> mysqlDB >> Solr contenthub >> enhancement-chain (to extract
>> Person/Organization) >> Solr contenthub (so facet search on the content
>> can
>> be done now)
>>
>> It will be even better to run the entity linking enhancement flow (to tag
>> Person and Organization) when importing data from mysql db straight to
>> Stanbol.
>>
>> What will be the best way to achieve my requirement? Appreciate all the
>> ideas I can get.. :)
>>
>> Thanks,
>> Dileepa
>>
>>
>> On Fri, Oct 25, 2013 at 3:02 PM, Antonio David Perez Morales <
>> ape...@zaizi.com> wrote:
>>
>>  Hi
>>>
>>> I don't know if what you want is possible, but importing directly data
>>> into
>>> the Solr ContentHub will not extract entities neither store them in Solr
>>> because it's done using the REST interface of ContentHub component.
>>>
>>> Regards
>>>
>>>
>>> On Fri, Oct 25, 2013 at 11:26 AM, Dileepa Jayakody <
>>> dileepajayak...@gmail.com> wrote:
>>>
>>>  Hi Antonio and all,
>>>>
>>>> My requirement is not extracting entities from the external Solr
>>>>
>>> instance.
>>>
>>>> It contains indexed content imported from a mysql database periodically
>>>> using a Solr DataImportHandler.
>>>> I need to enhance those content using Stanbol to identify Person and
>>>> Organization entities, so that they can be used as
>>>> facets<http://searchhub.org/**2009/09/02/faceted-search-**with-solr/<http://searchhub.org/2009/09/02/faceted-search-with-solr/>>
>>>> for
>>>> Solr search.
>>>>
>>>> So I was thinking, this will be easier if I can configure Stanbol
>>>> contenthub to use the external Solr instance as a contenthub-store.
>>>> Another option is to import data straight from the mysql database to the
>>>> Solr index in Stanbol contenthub store..Since Solr has the data import
>>>> handler functionality does Stanbol support it by default?
>>>>
>>>> Open for ideas.
>>>>
>>>> Thanks,
>>>> Dileepa
>>>>
>>>>
>>>>
>>>>
>>>> On Fri, Oct 25, 2013 at 2:31 PM, Antonio David Perez Morales <
>>>> ape...@zaizi.com> wrote:
>>>>
>>>>  Hi Dileepa.
>>>>>
>>>>> I don't know if you can change the Solr instance of the ContentHub (I
>>>>>
>>>> think
>>>>
>>>>> it is not possible) but thinking in your problem, you could do the
>>>>> following:
>>>>>
>>>>> - Create a new SolrYard pointing to your Solr instance (instead of
>>>>>
>>>> creating
>>>>
>>>>> a new core in the the Embedded Solr instance by default)
>>>>> - Create a new Site for your entities using this SolrYard
>>>>> - Create an Enhancement Chain configuring the linking engine pointing
>>>>>
>>>> to
>>>
>>>> your site and the specific field of your site to perform the linking
>>>>> - Create a new ContentHub using this EnhancementChain and defining a
>>>>>
>>>> LDPath
>>>>
>>>>> program to extract the properties of the extracted entities you want to
>>>>> store in the Solr Content Hub.
>>>>> - Use the ContentHub to perform queries over the content and the
>>>>>
>>>> extracted
>>>>
>>>>> entities of it (only the fields extracted by the LDPath program)
>>>>>
>>>>> I would do so but I hope Rupert or another Stanbol guy can give more
>>>>> information about this problem.
>>>>>
>>>>> Regards
>>>>>
>>>>>
>>>>> On Thu, Oct 24, 2013 at 8:03 PM, Dileepa Jayakody <
>>>>> dileepajayak...@gmail.com
>>>>>
>>>>>> wrote:
>>>>>> Hi All,
>>>>>>
>>>>>> Is $subject possible?
>>>>>> I have a Solr server which imports data from a mysql database and
>>>>>>
>>>>> index
>>>
>>>> the
>>>>>
>>>>>> content periodically. I need to identify Person and Organization
>>>>>>
>>>>> mentions
>>>>
>>>>> in those content and make them searchable in the Solr index.
>>>>>>
>>>>>>
>>>>>> Can I configure Stanbol contenthub to use my existing Solr core and
>>>>>>
>>>>> enhance
>>>>>
>>>>>> content by adding Person and Organization metadata to it?
>>>>>>
>>>>>> Thanks,
>>>>>> Dileepa
>>>>>>
>>>>>>  --
>>>>>
>>>>> ------------------------------
>>>>> This message should be regarded as confidential. If you have received
>>>>>
>>>> this
>>>>
>>>>> email in error please notify the sender and destroy it immediately.
>>>>> Statements of intent shall only become binding when confirmed in hard
>>>>>
>>>> copy
>>>>
>>>>> by an authorised signatory.
>>>>>
>>>>> Zaizi Ltd is registered in England and Wales with the registration
>>>>>
>>>> number
>>>
>>>> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
>>>>> London W6 7AN.
>>>>>
>>>>>  --
>>>
>>> ------------------------------
>>> This message should be regarded as confidential. If you have received
>>> this
>>> email in error please notify the sender and destroy it immediately.
>>> Statements of intent shall only become binding when confirmed in hard
>>> copy
>>> by an authorised signatory.
>>>
>>> Zaizi Ltd is registered in England and Wales with the registration number
>>> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
>>> London W6 7AN.
>>>
>>>
>
> --
>
> ------------------------------
> This message should be regarded as confidential. If you have received this
> email in error please notify the sender and destroy it immediately.
> Statements of intent shall only become binding when confirmed in hard copy
> by an authorised signatory.
>
> Zaizi Ltd is registered in England and Wales with the registration number
> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road,
> London W6 7AN.
>



-- 
--------------------------

Benedetti Alessandro
Visiting card : http://about.me/alessandro_benedetti

"Tyger, tyger burning bright
In the forests of the night,
What immortal hand or eye
Could frame thy fearful symmetry?"

William Blake - Songs of Experience -1794 England

Reply via email to