Re: Commercializing Mahout: the Myrrix recommender platform

2012-04-06 Thread Grant Ingersoll
It's Apache licensed. People can use it pretty much however they want per the terms in the license. There is no obligation for anyone to contribute back anything unless they so choose and no one should expect anyone else to do so. It would be great if they do contribute back, but not

[jira] [Updated] (MAHOUT-997) Make splitData smart enough to not consider a CSV header to be part of the data

2012-04-06 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-997: - Fix Version/s: (was: 0.6) Make splitData smart enough to not consider a CSV header to be part

[jira] [Resolved] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-04-06 Thread Sean Owen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-973. -- Resolution: Fixed Assignee: Sean Owen (was: Grant Ingersoll) Grant never heard back here but I

Build failed in Jenkins: Mahout-Quality #1426

2012-04-06 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/1426/changes Changes: [srowen] MAHOUT-973 fix treatment of value as percentage -- [...truncated 34367 lines...] Running org.apache.mahout.cf.taste.impl.similarity.EuclideanDistanceSimilarityTest Tests run:

[jira] [Commented] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-04-06 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13248382#comment-13248382 ] Hudson commented on MAHOUT-973: --- Integrated in Mahout-Quality #1426 (See

Jenkins build is unstable: Mahout-Quality #1427

2012-04-06 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/1427/changes

[jira] [Commented] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-04-06 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13248501#comment-13248501 ] Hudson commented on MAHOUT-973: --- Integrated in Mahout-Quality #1427 (See

Build failed in Jenkins: Mahout-Examples-Cluster-Reuters #94

2012-04-06 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Examples-Cluster-Reuters/94/changes Changes: [srowen] Allow choice of load factor in custom maps [srowen] MAHOUT-973 one more file needed for fix to compute maxDF as a percent of total count [srowen] MAHOUT-973 fix treatment of value as percentage

Re: some new clustering code

2012-04-06 Thread Ted Dunning
On Fri, Apr 6, 2012 at 12:48 PM, Federico Castanedo castanedof...@gmail.com wrote: ... The only difference I notice between Kmeans and StreamingKmeans class is the dynamic increment of maxClusters and the distanceCutoff test. So, i execute the KMeans class against a subset of the BigCross

Jenkins build is still unstable: Mahout-Quality #1428

2012-04-06 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/changes