[jira] [Updated] (SOLR-3443) Optimize hunspell dictionary loading with multiple cores

Gus Heck (JIRA) Sun, 29 Nov 2015 09:36:43 -0800

     [ 
https://issues.apache.org/jira/browse/SOLR-3443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Gus Heck updated SOLR-3443:
---------------------------
    Attachment: Screen Shot 2015-11-29 at 9.52.06 AM.png

Screen shot of profiler showing single dictionary for multiple factories.

> Optimize hunspell dictionary loading with multiple cores
> --------------------------------------------------------
>
>                 Key: SOLR-3443
>                 URL: https://issues.apache.org/jira/browse/SOLR-3443
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Luca Cavanna
>         Attachments: SOLR-3443.patch, Screen Shot 2015-11-29 at 9.52.06 AM.png
>
>
> The Hunspell dictionary is actually loaded into memory. Each core using 
> hunspell loads its own dictionary, no matter if all the cores are using the 
> same dictionary files. As a result, the same dictionary is loaded into memory 
> multiple times, once for each core. I think we should share those 
> dictionaries between all cores in order to optimize the memory usage. In 
> fact, let's say a dictionary takes 20MB into memory (this is what I 
> detected), if you have 20 cores you are going to use 400MB only for 
> dictionaries, which doesn't seem a good idea to me.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (SOLR-3443) Optimize hunspell dictionary loading with multiple cores

Reply via email to