[ 
https://issues.apache.org/jira/browse/LUCENE-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866128#action_12866128
 ] 

Mark Harwood commented on LUCENE-2454:
--------------------------------------

Robust use of this feature is dependent on careful management of segments i.e. 
that all compound documents are held in the same segment.

Michael Busch suggested the introduction of a new "FlushPolicy" on IndexWriter 
to offer the required control. (see 
http://mail-archives.apache.org/mod_mbox/lucene-dev/201005.mbox/%3c4be5a14c.6040...@gmail.com%3e
 )
Sounds sensible to me given that IndexWriter currently manages to muddle 2 
alternative policies in the one implementation and it looks like we now need a 
third.

Is this the place to start the debate on "FlushPolicy" ?
My guess is this change would involve :
* Deprecating/removing IndexWriter's setMaxBufferedDocs and setRAMBufferSizeMB.
* Providing a new "FlushPolicy" abstract class that is called with a 
"BufferContext " class to hold number buffered docs + ram usage. FlushPolicy is 
asked if flushing of various structures should be triggered given the context
* Provide default implementations of FlushPolicy that are 
number-of-documents-based and RAM-based.
* Provide a special "NestedDocumentFlushPolicy" that can wrap any other policy 
(ram/num docs) but only triggers flushes when application code has primed it to 
say a batch of related documents is completed.

Let me know where it's best to continue the thinking on these IndexWriter 
changes.

> Nested Document query support
> -----------------------------
>
>                 Key: LUCENE-2454
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2454
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Search
>    Affects Versions: 3.0.2
>            Reporter: Mark Harwood
>            Assignee: Mark Harwood
>            Priority: Minor
>         Attachments: LuceneNestedDocumentSupport-1.zip
>
>
> A facility for querying nested documents in a Lucene index as outlined in 
> http://www.slideshare.net/MarkHarwood/proposal-for-nested-document-support-in-lucene

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to