[
https://issues.apache.org/jira/browse/LUCENE-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12790506#action_12790506
]
John Wang commented on LUCENE-2120:
-----------------------------------
Hi Michael:
bq: Why does Zoie even retain 3 readers? Why not keep only the current one?
1 mem reader for when the disk batch, 1 mem reader for the time disk reader
indexes, 1 disk reader
bq: It looks like the test uses both Wikipedia & Medline for document sources?
Do I really need both?
By default, it only runs with Medline data. You don't need both.
perf/settings/index.properties->data.type dictates which to use, file->medline,
wiki->wikipedia
Also,
You should use the branch: BR_DELETE_OPT
It has the optimization you suggested on handling deleted docs, e.g. should not
check for each hit candidate with IntSetAccelerator.
Also, I have added a DataConsumer to handle delayed reopen for NRT case. You
see the file handle leakage quickly with it: see perf/conf/zoie.properties to
turn on ThrottledLuceneNRTDataConsumer.
On my mac, I use lsof to see the file handle count.
-John
> Possible file handle leak in near real-time reader
> --------------------------------------------------
>
> Key: LUCENE-2120
> URL: https://issues.apache.org/jira/browse/LUCENE-2120
> Project: Lucene - Java
> Issue Type: Bug
> Components: Index
> Affects Versions: 3.1
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: 3.1
>
>
> Spinoff of LUCENE-1526: Jake/John hit file descriptor exhaustion when testing
> NRT.
> I've tried to repro this, stress testing NRT, saturating reopens, indexing,
> searching, but haven't found any issue.
> Let's try to get to the bottom of it, here...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]