[ 
https://issues.apache.org/jira/browse/SOLR-469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12570591#action_12570591
 ] 

Shalin Shekhar Mangar commented on SOLR-469:
--------------------------------------------

Hi Otis,

Thanks for showing interest in this issue and your feedback.

Originally we started developing this to be a pure DB Import tool. But our own 
requirements led to us to keep this general enough to be used with other kinds 
of data sources. For example, we're using this internally for reading from REST 
API's (including RSS/ATOM feeds). Therefore, we kept the name as 
DataImportHandler on purpose. Previously, our data source was JdbcDataSource 
and EntityProcessor was called SqlEntityProcessor. We later extracted 
interfaces out of them as DataSource and EntityProcessor to make them as 
generic as possible. Also note that the DataImportHandler does not care about 
the name of data-config.xml. It could be called anything, all we need is that 
it should be specified in solrconfig.xml

We're developing our generic REST datasources and entity processors and plan to 
contribute them as well. We too are looking forward to see this in Solr and 
we're committed to do whatever it takes to make sure it becomes a part of Solr.

> DB Import RequestHandler
> ------------------------
>
>                 Key: SOLR-469
>                 URL: https://issues.apache.org/jira/browse/SOLR-469
>             Project: Solr
>          Issue Type: New Feature
>          Components: update
>    Affects Versions: 1.3
>            Reporter: Noble Paul
>            Priority: Minor
>             Fix For: 1.3
>
>         Attachments: SOLR-469.patch, SOLR-469.patch, SOLR-469.patch
>
>
> We need a RequestHandler Which can import data from a DB or other dataSources 
> into the Solr index .Think of it as an advanced form of SqlUpload Plugin 
> (SOLR-103).
> The way it works is as follows.
>     * Provide a configuration file (xml) to the Handler which takes in the 
> necessary SQL queries and mappings to a solr schema
>           - It also takes in a properties file for the data source 
> configuraution
>     * Given the configuration it can also generate the solr schema.xml
>     * It is registered as a RequestHandler which can take two commands 
> do-full-import, do-delta-import
>           -  do-full-import - dumps all the data from the Database into the 
> index (based on the SQL query in configuration)
>           - do-delta-import - dumps all the data that has changed since last 
> import. (We assume a modified-timestamp column in tables)
>     * It provides a admin page
>           - where we can schedule it to be run automatically at regular 
> intervals
>           - It shows the status of the Handler (idle, full-import, 
> delta-import)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to