>> I'm having big troubles with nutch 0.9 that I hadn't with 0.8. It seems
>> that the plugin repository initializes itself all the timem until I get
>> an out of memory exception. I've been seeing the code... the plugin
>> repository mantains a map from Configuration to plugin repositories, but
>> the Configuration object does not have an equals or hashCode method...
>> wouldn't it be nice to add such a method (comparing property values)?
>> Wouldn't that help prevent initializing many plugin repositories? What
>> could be the cause to may problem? (Aaah.. so many questions... =) )
>
> Which job causes the problem? Perhaps, we can find out what keeps
> creating a conf object over and over.
>
> Also, I have tried what you have suggested (better caching for plugin
> repository) and it really seems to make a difference. Can you try with
> this patch(*) to see if it solves your problem?
>
> (*) http://www.ceng.metu.edu.tr/~e1345172/plugin_repository_cache.patch

I'm running it. So far it's working ok, and I haven't seen all those 
plugin loadings...

I've modified your patch though to define CACHE like this:

  private static final Map<PluginProperty, PluginRepository> CACHE =
      new LinkedHashMap<PluginProperty, PluginRepository>() {
    @Override
    protected boolean removeEldestEntry(
        Entry<PluginProperty, PluginRepository> eldest) {
          return size() > 10;
    }
  };

...which means an LRU cache with a fixed size of 10.


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to