SolrEntityProcessor - DIH EntityProcessor that queries an external Solr via 
SolrJ
---------------------------------------------------------------------------------

                 Key: SOLR-1499
                 URL: https://issues.apache.org/jira/browse/SOLR-1499
             Project: Solr
          Issue Type: New Feature
          Components: contrib - DataImportHandler
            Reporter: Lance Norskog


The SolrEntityProcessor queries an external Solr instance. The Solr documents 
returned are unpacked and emitted as DIH fields.

The SolrEntityProcessor uses the following attributes:

* solr='http://localhost:8983/solr/sms'
** This gives the URL of the target Solr instance.
*** Note: the connection to the target Solr uses the binary SolrJ format.

* query='Jefferson&sort=id+asc'
** This gives the base query string use with Solr. It can include any standard 
Solr request parameter. This attribute is processed under the variable 
resolution rules and can be driven in an inner stage of the indexing pipeline.

* rows='10'
** This gives the number of rows to fetch per request..
** The SolrEntityProcessor always fetches every document that matches the 
request..

* fields='id,tag'
** This selects the fields to be returned from the Solr request.
** These must also be declared as <field> elements.
** As with all fields, template processors can be used to alter the contents to 
be passed downwards.

* timeout='30'
** This limits the query to 5 seconds. This can be used as a fail-safe to 
prevent the indexing session from freezing up. By default the timeout is 5 
minutes.

Limitations:
* Solr errors are not handled correctly.
* Loop control constructs have not been tested.
* Multi-valued returned fields have not been tested.

The unit tests give examples of how to use it as the root entity and an inner 
entity.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to