[ https://issues.apache.org/jira/browse/SOLR-7584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14555348#comment-14555348 ]
Dennis Gove edited comment on SOLR-7584 at 5/22/15 12:27 AM: ------------------------------------------------------------- Adds abstract JoinStream to support joins of N sub-streams. Adds abstract BiJoinStream to limit JoinStream to 2 sub-streams, left and right. Adds concrete InnerJoinStream with support for merge join. Does not handle hash joins. Uses aliasing concept already available in CloudSolrStream. Still work to be done. was (Author: dpgove): Adds JoinStream to support joins of N sub-streams. Adds BiJoinStream to limit JoinStream to 2 sub-streams, left and right. Adds InnerJoinStream with support for merge join. Does not handle hash joins. Uses aliasing concept already available in CloudSolrStream. Still work to be done. > Add Joins to the Streaming API and Streaming Expressions > -------------------------------------------------------- > > Key: SOLR-7584 > URL: https://issues.apache.org/jira/browse/SOLR-7584 > Project: Solr > Issue Type: Improvement > Components: SolrJ > Reporter: Dennis Gove > Priority: Minor > Labels: Streaming > Attachments: SOLR-7584.patch > > > Add InnerJoinStream, LeftOuterJoinStream, and supporting classes to the > Streaming API to allow for joining between sub-streams. > At its basic, it would look something like this > {code} > innerJoin( > search(collection1, q=*:*, fl="fieldA, fieldB, fieldC", ...), > search(collection2, q=*:*, fl="fieldA, fieldD, fieldE", ...), > on="fieldA=fieldA" > ) > {code} > or with multi-field on clauses > {code} > innerJoin( > search(collection1, q=*:*, fl="fieldA, fieldB, fieldC", ...), > search(collection2, q=*:*, fl="fieldA, fieldD, fieldE", ...), > on="fieldA=fieldA, fieldB=fieldD" > ) > {code} > I'd also like to support the option of doing a hash join instead of the > default merge join but I haven't yet figured out the best way to express > that. I'd like to let the user tell us which sub-stream should be hashed (the > least-cost one). > Also, I've been thinking about field aliasing and might want to add a > SelectStream which serves the purpose of allowing us to limit the fields > coming out and rename fields. > Depends on SOLR-7554 -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org