hi,all i got some error when generate product similarity according to rating file, and there is about 250,000 recordes in rating file.
it works when there is only 10,000 recordes in rating file. do you have some suggestion? any help is appreciated thanks in advance. following is code and log: File file = new File(ratingFile); logger.log(Level.INFO, "begin to load rating file..."); FileDataModel model = new FileDataModel(file); logger.log(Level.INFO, "load rating file OK."); ItemSimilarity pearson = new LogLikelihoodSimilarity(model); GenericItemSimilarity gif = new GenericItemSimilarity(pearson,model); INFO: load rating file OK. - Reading file info... - Processed 100000 lines - Processed 200000 lines Exception in thread "Thread-9" java.lang.OutOfMemoryError: GC overhead limit exceeded at org.apache.mahout.cf.taste.impl.common.FastSet.<init>(FastSet.java:74) at org.apache.mahout.cf.taste.impl.model.GenericDataModel.getNumUsersWithPreferenceFor(GenericDataModel.java:195) at org.apache.mahout.cf.taste.impl.model.file.FileDataModel.getNumUsersWithPreferenceFor(FileDataModel.java:314) at org.apache.mahout.cf.taste.impl.similarity.LogLikelihoodSimilarity.itemSimilarity(LogLikelihoodSimilarity.java:48) at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity$DataModelSimilaritiesIterator.next(GenericItemSimilarity.java:291) at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity$DataModelSimilaritiesIterator.next(GenericItemSimilarity.java:260) at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity.initSimilarityMaps(GenericItemSimilarity.java:128) at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity.<init>(GenericItemSimilarity.java:103) 2009-11-09 cumtyjh
