[jira] [Commented] (LUCENE-4462) Publishing flushed segments is single threaded and too costly

2013-03-22 Thread Commit Tag Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-4462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13610688#comment-13610688
 ] 

Commit Tag Bot commented on LUCENE-4462:


[branch_4x commit] Simon Willnauer
http://svn.apache.org/viewvc?view=revisionrevision=1397237

LUCENE-4462: Flush Deletes, SegmentInfos and build CFS concurrently in DWPT


 Publishing flushed segments is single threaded and too costly
 -

 Key: LUCENE-4462
 URL: https://issues.apache.org/jira/browse/LUCENE-4462
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Affects Versions: 4.0-ALPHA, 4.0-BETA, 4.0
Reporter: Michael McCandless
Assignee: Simon Willnauer
 Fix For: 4.1, 5.0

 Attachments: LUCENE-4462.patch, LUCENE-4462.patch


 Spinoff from http://lucene.markmail.org/thread/4li6bbomru35qn7w
 The new TestBagOfPostings failed the build because it timed out after 2 hours 
 ... but in digging I found that it was a starvation issue: the 4 threads were 
 flushing segments much faster than the 1 thread could publish them.
 I think this is because publishing segments 
 (DocumentsWriter.publishFlushedSegment) is actually rather costly (creates 
 CFS file if necessary, writes .si, etc.).
 I committed a workaround for now, to prevent starvation (see svn diff -c 
 1394704 https://svn.apache.org/repos/asf/lucene/dev/trunk), but we really 
 should address the root cause by moving these costly ops into flush() so that 
 publishing is a low cost operation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-4462) Publishing flushed segments is single threaded and too costly

2012-10-11 Thread Simon Willnauer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-4462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13474445#comment-13474445
 ] 

Simon Willnauer commented on LUCENE-4462:
-

backported to 4x in revision 1397237.


 Publishing flushed segments is single threaded and too costly
 -

 Key: LUCENE-4462
 URL: https://issues.apache.org/jira/browse/LUCENE-4462
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Affects Versions: 4.0-ALPHA, 4.0-BETA, 4.0
Reporter: Michael McCandless
Assignee: Simon Willnauer
 Fix For: 4.1, 5.0

 Attachments: LUCENE-4462.patch, LUCENE-4462.patch


 Spinoff from http://lucene.markmail.org/thread/4li6bbomru35qn7w
 The new TestBagOfPostings failed the build because it timed out after 2 hours 
 ... but in digging I found that it was a starvation issue: the 4 threads were 
 flushing segments much faster than the 1 thread could publish them.
 I think this is because publishing segments 
 (DocumentsWriter.publishFlushedSegment) is actually rather costly (creates 
 CFS file if necessary, writes .si, etc.).
 I committed a workaround for now, to prevent starvation (see svn diff -c 
 1394704 https://svn.apache.org/repos/asf/lucene/dev/trunk), but we really 
 should address the root cause by moving these costly ops into flush() so that 
 publishing is a low cost operation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-4462) Publishing flushed segments is single threaded and too costly

2012-10-10 Thread Simon Willnauer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-4462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473069#comment-13473069
 ] 

Simon Willnauer commented on LUCENE-4462:
-

Committed to trunk in revision 1396500


 Publishing flushed segments is single threaded and too costly
 -

 Key: LUCENE-4462
 URL: https://issues.apache.org/jira/browse/LUCENE-4462
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Affects Versions: 4.0-ALPHA, 4.0-BETA, 4.0
Reporter: Michael McCandless
Assignee: Simon Willnauer
 Fix For: 4.1, 5.0

 Attachments: LUCENE-4462.patch, LUCENE-4462.patch


 Spinoff from http://lucene.markmail.org/thread/4li6bbomru35qn7w
 The new TestBagOfPostings failed the build because it timed out after 2 hours 
 ... but in digging I found that it was a starvation issue: the 4 threads were 
 flushing segments much faster than the 1 thread could publish them.
 I think this is because publishing segments 
 (DocumentsWriter.publishFlushedSegment) is actually rather costly (creates 
 CFS file if necessary, writes .si, etc.).
 I committed a workaround for now, to prevent starvation (see svn diff -c 
 1394704 https://svn.apache.org/repos/asf/lucene/dev/trunk), but we really 
 should address the root cause by moving these costly ops into flush() so that 
 publishing is a low cost operation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-4462) Publishing flushed segments is single threaded and too costly

2012-10-08 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-4462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13471662#comment-13471662
 ] 

Michael McCandless commented on LUCENE-4462:


Patch looks good, thanks Simon!

I think we should keep the safety in there (the fallback to forcePurge if too 
many segments are backlogged)...?  Hopefully it never needs to run... but just 
in case.

 Publishing flushed segments is single threaded and too costly
 -

 Key: LUCENE-4462
 URL: https://issues.apache.org/jira/browse/LUCENE-4462
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Michael McCandless
Assignee: Simon Willnauer
 Attachments: LUCENE-4462.patch


 Spinoff from http://lucene.markmail.org/thread/4li6bbomru35qn7w
 The new TestBagOfPostings failed the build because it timed out after 2 hours 
 ... but in digging I found that it was a starvation issue: the 4 threads were 
 flushing segments much faster than the 1 thread could publish them.
 I think this is because publishing segments 
 (DocumentsWriter.publishFlushedSegment) is actually rather costly (creates 
 CFS file if necessary, writes .si, etc.).
 I committed a workaround for now, to prevent starvation (see svn diff -c 
 1394704 https://svn.apache.org/repos/asf/lucene/dev/trunk), but we really 
 should address the root cause by moving these costly ops into flush() so that 
 publishing is a low cost operation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-4462) Publishing flushed segments is single threaded and too costly

2012-10-08 Thread Simon Willnauer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-4462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13471882#comment-13471882
 ] 

Simon Willnauer commented on LUCENE-4462:
-

bq. I think we should keep the safety in there (the fallback to forcePurge if 
too many segments are backlogged)...? Hopefully it never needs to run... but 
just in case.

I agree, I remove it for beasting. I will add back and commit. I will let this 
bake in a bit and then port to 4.x

 Publishing flushed segments is single threaded and too costly
 -

 Key: LUCENE-4462
 URL: https://issues.apache.org/jira/browse/LUCENE-4462
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Michael McCandless
Assignee: Simon Willnauer
 Attachments: LUCENE-4462.patch


 Spinoff from http://lucene.markmail.org/thread/4li6bbomru35qn7w
 The new TestBagOfPostings failed the build because it timed out after 2 hours 
 ... but in digging I found that it was a starvation issue: the 4 threads were 
 flushing segments much faster than the 1 thread could publish them.
 I think this is because publishing segments 
 (DocumentsWriter.publishFlushedSegment) is actually rather costly (creates 
 CFS file if necessary, writes .si, etc.).
 I committed a workaround for now, to prevent starvation (see svn diff -c 
 1394704 https://svn.apache.org/repos/asf/lucene/dev/trunk), but we really 
 should address the root cause by moving these costly ops into flush() so that 
 publishing is a low cost operation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org