[ https://issues.apache.org/jira/browse/SOLR-469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12591273#action_12591273 ]
Grant Ingersoll commented on SOLR-469: -------------------------------------- This is some really cool stuff and should be added at some point soon. Some high level questions/comments, as I haven't looked in depth into the patch yet: Is it possible to just pass in SQL statements, etc. via a request? Or do they have to be configured ahead of time? What about connections? On the one hand, having to configure it ahead of time can lock things down and be a little more secure, on the other hand, having to configure it ahead of time can lock things down and take away flexibility. I hope to combine some of the stuff I've written to do this, with your patch. Not sure how to say it, but all the configuration starts to have the feel of Hibernate and/or the other ORMs. Would there be a way to leverage something that already exists? Although I do see from the comment the other day that you have reduced some of the verbosity How is scheduling handled? Finally, I'm not totally sure where this should live. Solr doesn't currently have a "contrib" area, but this feels like a (major) contrib and may warrant adding it under a contrib area. > Data Import RequestHandler > -------------------------- > > Key: SOLR-469 > URL: https://issues.apache.org/jira/browse/SOLR-469 > Project: Solr > Issue Type: New Feature > Components: update > Affects Versions: 1.3 > Reporter: Noble Paul > Assignee: Grant Ingersoll > Fix For: 1.3 > > Attachments: SOLR-469.patch, SOLR-469.patch, SOLR-469.patch, > SOLR-469.patch, SOLR-469.patch, SOLR-469.patch, SOLR-469.patch, SOLR-469.patch > > > We need a RequestHandler Which can import data from a DB or other dataSources > into the Solr index .Think of it as an advanced form of SqlUpload Plugin > (SOLR-103). > The way it works is as follows. > * Provide a configuration file (xml) to the Handler which takes in the > necessary SQL queries and mappings to a solr schema > - It also takes in a properties file for the data source > configuraution > * Given the configuration it can also generate the solr schema.xml > * It is registered as a RequestHandler which can take two commands > do-full-import, do-delta-import > - do-full-import - dumps all the data from the Database into the > index (based on the SQL query in configuration) > - do-delta-import - dumps all the data that has changed since last > import. (We assume a modified-timestamp column in tables) > * It provides a admin page > - where we can schedule it to be run automatically at regular > intervals > - It shows the status of the Handler (idle, full-import, > delta-import) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.