[jira] [Commented] (SOLR-12630) Query children without using block join

David Smiley (JIRA) Wed, 08 Aug 2018 08:46:14 -0700


    [ 
https://issues.apache.org/jira/browse/SOLR-12630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16573404#comment-16573404
 ]


David Smiley commented on SOLR-12630:
-------------------------------------

Although not a strict alternative, I've been interested in ways of exploring 
how to make updates faster, and thus make choices like the above more favorable 
to simply sticking with block join if only updates cold be cheaper.  Admittedly 
these ideas are wildly out of scope with this issue (and broadly beneficial to 
many use-cases unrelated to nested docs!). But what the heck; I'll share them 
here any way. Some ideas:
 * For full-text fields with an expensive analysis configuration, the update 
process could be made to leverage term vectors to avoid re-analysis.  Although 
given the weight of them; I'm unsure how fruitful it is. I have a pretty good 
idea of what's involved internally though, and have always wanted to try a 
POC/experiment.
 * We have numeric in-place updates but not yet String in-place updates for 
docValue only fields. I recall there was WIP for that by Shai but for whatever 
reason it stalled.
 * Lucene's ParallelLeafReader is a hidden gem, allowing you to partition 
updates of fields to different internal indexes (and other use-cases). PLR is 
itself actually rather simple; it would take a lot of other (complex) code to 
completely implement the strategy. If you only need to update certain fields, 
typically small ones (while leaving lots of other data alone), then this is an 
interesting possibility. Implementation of this would likely result in a new 
attribute in the schema on fields called "fieldGroup" or some-such to segment 
certain fields aside that are able to be updated faster alone, since it 
wouldn't involve re-indexing fields in other field groups.

> Query children without using block join
> ---------------------------------------
>
>                 Key: SOLR-12630
>                 URL: https://issues.apache.org/jira/browse/SOLR-12630
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: mosh
>            Priority: Major
>
> Currently work is being put into Solr to support nested document hierarchies, 
> keeping the original relation between parent and child docs 
> ([SOLR-12298|https://issues.apache.org/jira/browse/SOLR-12298], 
> [SOLR-12441|https://issues.apache.org/jira/browse/SOLR-12441]). Currently the 
> only way to query child documents is using BlockJoin. I was thinking perhaps 
> we could introduce a way to query documents that are not indexed in the same 
> block with a different transformer, or perhaps add this feature to the 
> ChildDocTransformer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-12630) Query children without using block join

Reply via email to