[ 
https://issues.apache.org/jira/browse/SOLR-1613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Inger updated SOLR-1613:
-----------------------------

    Attachment: SOLR-1613.patch

> Segmentation of data imports (not just full or single record imports)
> ---------------------------------------------------------------------
>
>                 Key: SOLR-1613
>                 URL: https://issues.apache.org/jira/browse/SOLR-1613
>             Project: Solr
>          Issue Type: New Feature
>          Components: contrib - DataImportHandler
>    Affects Versions: 1.4
>            Reporter: Matt Inger
>         Attachments: SOLR-1613.patch
>
>
> It is desirable to able to segment imports by a particular field in the root 
> entity record so that you can update a particular segment of your database 
> when bulk updates occur on the backend database.  For instance, if a bulk 
> update occurs for a particular customer, it would be more efficient to be 
> able to update a full segment of your index for that customer rather than 
> issuing updates for every single user in your index for that customer, or 
> updating the entire index.  That would be a waste of processing power.
> Instead, it would be more efficient to specify that a particular document 
> field in the root entity was a segmentation field, and define an additional 
> query on the root entity (i'm basing my example on a jdbc based datasource):
> <entity name="user" pk="userid" segment="customerid" ... 
>              query="..." segmentQuery="select ... where 
> customerid=${dataimporter.request.segment}" />
> Then, when you request a segment update, you specify the segment as a 
> parameter to your request
>     /solr/db/dataimport?command=segment-import&segment=1000
> I've worked out the code segments required to do this for the JdbcDataSource, 
> though I'm not sure what additional changes would be necessary for other 
> datasource types, and am attaching a patch which includes these changes.
>              

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to