>> I'm having big troubles with nutch 0.9 that I hadn't with 0.8. It seems
>> that the plugin repository initializes itself all the timem until I get
>> an out of memory exception. I've been seeing the code... the plugin
>> repository mantains a map from Configuration to plugin repositories, but
>> the Configuration object does not have an equals or hashCode method...
>> wouldn't it be nice to add such a method (comparing property values)?
>> Wouldn't that help prevent initializing many plugin repositories? What
>> could be the cause to may problem? (Aaah.. so many questions... =) )
>
> Which job causes the problem? Perhaps, we can find out what keeps
> creating a conf object over and over.
>
> Also, I have tried what you have suggested (better caching for plugin
> repository) and it really seems to make a difference. Can you try with
> this patch(*) to see if it solves your problem?
>
> (*) http://www.ceng.metu.edu.tr/~e1345172/plugin_repository_cache.patch
I'm running it. So far it's working ok, and I haven't seen all those
plugin loadings...
I've modified your patch though to define CACHE like this:
private static final Map<PluginProperty, PluginRepository> CACHE =
new LinkedHashMap<PluginProperty, PluginRepository>() {
@Override
protected boolean removeEldestEntry(
Entry<PluginProperty, PluginRepository> eldest) {
return size() > 10;
}
};
...which means an LRU cache with a fixed size of 10.
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers