Mikhail Khludnev created SOLR-4799:
--------------------------------------

             Summary: SQLEntityProcessor for zip joins
                 Key: SOLR-4799
                 URL: https://issues.apache.org/jira/browse/SOLR-4799
             Project: Solr
          Issue Type: New Feature
          Components: contrib - DataImportHandler
            Reporter: Mikhail Khludnev
            Priority: Minor


DIH is mostly considered as a playground tool, and real usages end up with 
SolrJ. I want to contribute few improvements target DIH performance.

This one provides performant approach for joining SQL Entities with miserable 
memory at contrast to 
http://wiki.apache.org/solr/DataImportHandler#CachedSqlEntityProcessor  

The idea is:

* parent table is explicitly ordered by it’s PK in SQL
* children table is explicitly ordered by parent_id FK in SQL
* children entity processor joins ordered resultsets by ‘zipper’ algorithm.

Do you think it’s worth to contribute it into DIH?

cc: [~goksron] [~jdyer]

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to