hi,all

i got some error when generate product similarity according to rating file, and 
there is about 250,000 recordes in rating file.

it works when there is only 10,000 recordes in rating file.


do you have some suggestion? any help is appreciated

thanks in advance.


following is code and log:


 File file = new File(ratingFile);
 logger.log(Level.INFO, "begin to load rating file...");
 FileDataModel model = new FileDataModel(file);
 logger.log(Level.INFO, "load rating file OK.");
ItemSimilarity pearson = new LogLikelihoodSimilarity(model);
GenericItemSimilarity gif = new GenericItemSimilarity(pearson,model);



INFO: load rating file OK.
- Reading file info...
- Processed 100000 lines
- Processed 200000 lines
Exception in thread "Thread-9" java.lang.OutOfMemoryError: GC overhead limit 
exceeded
at org.apache.mahout.cf.taste.impl.common.FastSet.<init>(FastSet.java:74)
at 
org.apache.mahout.cf.taste.impl.model.GenericDataModel.getNumUsersWithPreferenceFor(GenericDataModel.java:195)
at 
org.apache.mahout.cf.taste.impl.model.file.FileDataModel.getNumUsersWithPreferenceFor(FileDataModel.java:314)
at 
org.apache.mahout.cf.taste.impl.similarity.LogLikelihoodSimilarity.itemSimilarity(LogLikelihoodSimilarity.java:48)
at 
org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity$DataModelSimilaritiesIterator.next(GenericItemSimilarity.java:291)
at 
org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity$DataModelSimilaritiesIterator.next(GenericItemSimilarity.java:260)
at 
org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity.initSimilarityMaps(GenericItemSimilarity.java:128)
at 
org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity.<init>(GenericItemSimilarity.java:103)

2009-11-09 



cumtyjh 

Reply via email to