[ 
https://issues.apache.org/jira/browse/LUCENE-8267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16448272#comment-16448272
 ] 

Robert Muir commented on LUCENE-8267:
-------------------------------------

There are a lot of other alternatives to putting data in heap memory directly 
in the postings format.

The best (IMO) is for the user to use MMapDirectory.preload with the standard 
index format. This way it doesn't impact their java heap and they use supported 
index format. Users can also use RAMDirectory/FileSwitchDirectory to load 
specified files into heap. 

Finally, users can use FSTPostingsFormat which will load *term dictionary only* 
into heap fst. This is way different than Memory/Direct which load not only 
terms but also postings lists and positions and stuff all into heap RAM.

So i don't really see any technical merit for your objection: there are many 
other ways to have a ram-resident terms dictionary, many of them better than 
the inefficient Memory/Direct formats.

> Remove memory codecs from the codebase
> --------------------------------------
>
>                 Key: LUCENE-8267
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8267
>             Project: Lucene - Core
>          Issue Type: Task
>            Reporter: Dawid Weiss
>            Priority: Major
>
> Memory codecs (MemoryPostings*, MemoryDocValues*) are part of random 
> selection of codecs for tests and cause occasional OOMs when a test with huge 
> data is selected. We don't use those memory codecs anywhere outside of tests, 
> it has been suggested to just remove them to avoid maintenance costs and OOMs 
> in tests. [1]
> [1] https://apache.markmail.org/thread/mj53os2ekyldsoy3



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to