These are all complete with unit tests and are ready to go: Implement Boosting <https://issues.apache.org/jira/browse/MAHOUT-716> Implement Gradient machine<https://issues.apache.org/jira/browse/MAHOUT-703> Implement Online Passive Aggressive learner< https://issues.apache.org/jira/browse/MAHOUT-702>
On Fri, Jun 3, 2011 at 8:41 AM, Shannon Quinn <[email protected]> wrote: > 537 and 524/712 here, along with any others I can help with. > > On Fri, Jun 3, 2011 at 11:37 AM, Dhruv Kumar <[email protected]> wrote: > > > Actively working on this and should be ready for 0.6: > > > > Baum-Welch Algorithm on Map-Reduce for Parallel Hidden Markov Model > > Training. <https://issues.apache.org/jira/browse/MAHOUT-627> > > > > > > > > On Fri, Jun 3, 2011 at 5:20 AM, Sean Owen <[email protected]> wrote: > > > > > 0.5 is out the door, and we already have 40 issues tagged for 0.6 -- > > quite > > > a > > > bit of work to do! I see Sebastian and I are already cracking through > > > several of them. > > > Committer activity is still not quite keeping up with the backlog. It's > > an > > > important time, and a time for those who can move issues to resolution > or > > > close them to do so. Stay involved, stay relevant, especially those of > > you > > > who have been quiet. Because we need more "bandwidth" to begin dealing > > with > > > the meta-issues: docs, packaging, roadmap, etc. > > > > > > Here is the current list, to inspire you: > > > > > > > > > KeySummaryStatusCreated < > > https://issues.apache.org/jira/browse/MAHOUT-609> > > > MAHOUT-609 <https://issues.apache.org/jira/browse/MAHOUT-609> > > > > > > Add an option to make RecommenderJob write out it's computed item > > > similarities <https://issues.apache.org/jira/browse/MAHOUT-609> > > > [image: Open] Open08/Feb/11< > > > https://issues.apache.org/jira/browse/MAHOUT-627> > > > MAHOUT-627 <https://issues.apache.org/jira/browse/MAHOUT-627> > > > > > > Baum-Welch Algorithm on Map-Reduce for Parallel Hidden Markov Model > > > Training. <https://issues.apache.org/jira/browse/MAHOUT-627> > > > [image: Open] Open17/Mar/11< > > > https://issues.apache.org/jira/browse/MAHOUT-537> > > > MAHOUT-537 <https://issues.apache.org/jira/browse/MAHOUT-537> > > > > > > Bring DistributedRowMatrix into compliance with Hadoop > > > 0.20.2<https://issues.apache.org/jira/browse/MAHOUT-537> > > > [image: Open] Open03/Nov/10< > > > https://issues.apache.org/jira/browse/MAHOUT-546> > > > MAHOUT-546 <https://issues.apache.org/jira/browse/MAHOUT-546> > > > > > > Bug creating vector from Solr index with > > > TrieFields<https://issues.apache.org/jira/browse/MAHOUT-546> > > > [image: Open] Open16/Nov/10< > > > https://issues.apache.org/jira/browse/MAHOUT-714> > > > MAHOUT-714 <https://issues.apache.org/jira/browse/MAHOUT-714> > > > > > > CollocDriver not runnable with ToolRunner due to private > > > Constructor<https://issues.apache.org/jira/browse/MAHOUT-714> > > > [image: Patch Available] Patch > > > Available29/May/11<https://issues.apache.org/jira/browse/MAHOUT-696> > > > MAHOUT-696 <https://issues.apache.org/jira/browse/MAHOUT-696> > > > > > > Command line program for > > > AdaptiveLogiscticRegression< > > > https://issues.apache.org/jira/browse/MAHOUT-696> > > > [image: Open] Open15/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-712> > > > MAHOUT-712 <https://issues.apache.org/jira/browse/MAHOUT-712> > > > > > > DisplaySpectralKMeans Example Surfaces FileNotFoundException in > > > DistributedRowMatrix.times() > > > Usage/Implementation<https://issues.apache.org/jira/browse/MAHOUT-712> > > > [image: Open] Open25/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-524> > > > MAHOUT-524 <https://issues.apache.org/jira/browse/MAHOUT-524> > > > > > > DisplaySpectralKMeans example > > > fails<https://issues.apache.org/jira/browse/MAHOUT-524> > > > [image: Open] Open12/Oct/10< > > > https://issues.apache.org/jira/browse/MAHOUT-598> > > > MAHOUT-598 <https://issues.apache.org/jira/browse/MAHOUT-598> > > > > > > Downstream steps in the seq2sparse job flow looking in wrong location > for > > > output from previous steps when running in Elastic MapReduce (EMR) > > > cluster<https://issues.apache.org/jira/browse/MAHOUT-598> > > > [image: Open] Open27/Jan/11< > > > https://issues.apache.org/jira/browse/MAHOUT-629> > > > MAHOUT-629 <https://issues.apache.org/jira/browse/MAHOUT-629> > > > > > > FP Growth performance > > > improvement<https://issues.apache.org/jira/browse/MAHOUT-629> > > > [image: Open] Open21/Mar/11< > > > https://issues.apache.org/jira/browse/MAHOUT-709> > > > MAHOUT-709 <https://issues.apache.org/jira/browse/MAHOUT-709> > > > > > > FP-Growth Redundant patterns< > > > https://issues.apache.org/jira/browse/MAHOUT-709> > > > [image: Open] Open22/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-688> > > > MAHOUT-688 <https://issues.apache.org/jira/browse/MAHOUT-688> > > > > > > High Document Frequency pruning for > > > seq2sparse<https://issues.apache.org/jira/browse/MAHOUT-688> > > > [image: Open] Open05/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-716> > > > MAHOUT-716 <https://issues.apache.org/jira/browse/MAHOUT-716> > > > > > > Implement Boosting <https://issues.apache.org/jira/browse/MAHOUT-716> > > > [image: Patch Available] Patch > > > Available01/Jun/11<https://issues.apache.org/jira/browse/MAHOUT-703> > > > MAHOUT-703 <https://issues.apache.org/jira/browse/MAHOUT-703> > > > > > > Implement Gradient machine< > > > https://issues.apache.org/jira/browse/MAHOUT-703> > > > [image: Patch Available] Patch > > > Available19/May/11<https://issues.apache.org/jira/browse/MAHOUT-499> > > > MAHOUT-499 <https://issues.apache.org/jira/browse/MAHOUT-499> > > > > > > Implement LSMR in-memory < > > https://issues.apache.org/jira/browse/MAHOUT-499 > > > > > > > [image: Open] Open09/Sep/10< > > > https://issues.apache.org/jira/browse/MAHOUT-525> > > > MAHOUT-525 <https://issues.apache.org/jira/browse/MAHOUT-525> > > > > > > Implement LatentFactorLogLinear > > > models<https://issues.apache.org/jira/browse/MAHOUT-525> > > > [image: Open] Open14/Oct/10< > > > https://issues.apache.org/jira/browse/MAHOUT-702> > > > MAHOUT-702 <https://issues.apache.org/jira/browse/MAHOUT-702> > > > > > > Implement Online Passive Aggressive > > > learner<https://issues.apache.org/jira/browse/MAHOUT-702> > > > [image: Patch Available] Patch > > > Available18/May/11<https://issues.apache.org/jira/browse/MAHOUT-384> > > > MAHOUT-384 <https://issues.apache.org/jira/browse/MAHOUT-384> > > > > > > Implement of AVF algorithm< > > > https://issues.apache.org/jira/browse/MAHOUT-384> > > > [image: Open] Open22/Apr/10< > > > https://issues.apache.org/jira/browse/MAHOUT-672> > > > MAHOUT-672 <https://issues.apache.org/jira/browse/MAHOUT-672> > > > > > > Implementation of Conjugate Gradient for solving large linear > > > systems<https://issues.apache.org/jira/browse/MAHOUT-672> > > > [image: Patch Available] Patch > > > Available16/Apr/11<https://issues.apache.org/jira/browse/MAHOUT-487> > > > MAHOUT-487 <https://issues.apache.org/jira/browse/MAHOUT-487> > > > > > > Issues with memory use and inconsistent or state-influenced results > when > > > using CBayesAlgorithm < > https://issues.apache.org/jira/browse/MAHOUT-487> > > > [image: Open] Open24/Aug/10< > > > https://issues.apache.org/jira/browse/MAHOUT-597> > > > MAHOUT-597 <https://issues.apache.org/jira/browse/MAHOUT-597> > > > > > > Kernels in Mean Shift < > https://issues.apache.org/jira/browse/MAHOUT-597> > > > [image: Open] Open27/Jan/11< > > > https://issues.apache.org/jira/browse/MAHOUT-399> > > > MAHOUT-399 <https://issues.apache.org/jira/browse/MAHOUT-399> > > > > > > LDA on Mahout 0.3 does not converge to correct solution for overlapping > > > pyramids toy problem. < > https://issues.apache.org/jira/browse/MAHOUT-399> > > > [image: Open] Open24/May/10< > > > https://issues.apache.org/jira/browse/MAHOUT-690> > > > MAHOUT-690 <https://issues.apache.org/jira/browse/MAHOUT-690> > > > > > > LanczosSolver tests take forever. No > > > fun.<https://issues.apache.org/jira/browse/MAHOUT-690> > > > [image: Open] Open06/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-415> > > > MAHOUT-415 <https://issues.apache.org/jira/browse/MAHOUT-415> > > > > > > Lucene filter for Collocations< > > > https://issues.apache.org/jira/browse/MAHOUT-415> > > > [image: Open] Open14/Jun/10< > > > https://issues.apache.org/jira/browse/MAHOUT-705> > > > MAHOUT-705 <https://issues.apache.org/jira/browse/MAHOUT-705> > > > > > > MongoDB DataModel support < > > > https://issues.apache.org/jira/browse/MAHOUT-705> > > > [image: Open] Open20/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-678> > > > MAHOUT-678 <https://issues.apache.org/jira/browse/MAHOUT-678> > > > > > > NullPointerException while using MixedGradient with SGD > > > algorithm<https://issues.apache.org/jira/browse/MAHOUT-678> > > > [image: Open] Open22/Apr/11< > > > https://issues.apache.org/jira/browse/MAHOUT-692> > > > MAHOUT-692 <https://issues.apache.org/jira/browse/MAHOUT-692> > > > > > > OnlineSummarizer does not tolerate fewer than 100 > > > samples<https://issues.apache.org/jira/browse/MAHOUT-692> > > > [image: Open] Open10/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-695> > > > MAHOUT-695 <https://issues.apache.org/jira/browse/MAHOUT-695> > > > > > > Option to determine number of words for LDADriver from a specified > > > dictionary <https://issues.apache.org/jira/browse/MAHOUT-695> > > > [image: Open] Open13/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-632> > > > MAHOUT-632 <https://issues.apache.org/jira/browse/MAHOUT-632> > > > > > > PFPGrowth : Exceeded max jobconf > > > size<https://issues.apache.org/jira/browse/MAHOUT-632> > > > [image: Patch Available] Patch > > > Available22/Mar/11<https://issues.apache.org/jira/browse/MAHOUT-663> > > > MAHOUT-663 <https://issues.apache.org/jira/browse/MAHOUT-663> > > > > > > Rationalize hadoop job creation with respect to > > > setJarByClass<https://issues.apache.org/jira/browse/MAHOUT-663> > > > [image: Open] Open08/Apr/11< > > > https://issues.apache.org/jira/browse/MAHOUT-664> > > > MAHOUT-664 <https://issues.apache.org/jira/browse/MAHOUT-664> > > > > > > Remove usage of XStream string serialization > > > too?<https://issues.apache.org/jira/browse/MAHOUT-664> > > > [image: Open] Open10/Apr/11< > > > https://issues.apache.org/jira/browse/MAHOUT-719> > > > MAHOUT-719 <https://issues.apache.org/jira/browse/MAHOUT-719> > > > > > > Rename current runLogistic command line program to validateLogistic and > > let > > > runLogistic do predicting against new production > > > data<https://issues.apache.org/jira/browse/MAHOUT-719> > > > [image: Open] Open02/Jun/11< > > > https://issues.apache.org/jira/browse/MAHOUT-699> > > > MAHOUT-699 <https://issues.apache.org/jira/browse/MAHOUT-699> > > > > > > Rename taste-webapp module to integration; move integration code there > > from > > > examples <https://issues.apache.org/jira/browse/MAHOUT-699> > > > [image: Open] Open18/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-707> > > > MAHOUT-707 <https://issues.apache.org/jira/browse/MAHOUT-707> > > > > > > Setup Jenkins Jobs to validate our Examples/bin > > > Scripts<https://issues.apache.org/jira/browse/MAHOUT-707> > > > [image: Open] Open20/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-626> > > > MAHOUT-626 <https://issues.apache.org/jira/browse/MAHOUT-626> > > > > > > T1 and T2 Values in Canopy (& > > > MeanShift)<https://issues.apache.org/jira/browse/MAHOUT-626> > > > [image: Reopened] > > > Reopened13/Mar/11<https://issues.apache.org/jira/browse/MAHOUT-596> > > > MAHOUT-596 <https://issues.apache.org/jira/browse/MAHOUT-596> > > > > > > Testing if the weight assigned to points when calling the observe > method > > in > > > AbstractCluster incorrectly affect the number of points in a > > > cluster<https://issues.apache.org/jira/browse/MAHOUT-596> > > > [image: Open] Open27/Jan/11< > > > https://issues.apache.org/jira/browse/MAHOUT-294> > > > MAHOUT-294 <https://issues.apache.org/jira/browse/MAHOUT-294> > > > > > > Uniform API behavior for Jobs< > > > https://issues.apache.org/jira/browse/MAHOUT-294> > > > [image: Open] Open16/Feb/10< > > > https://issues.apache.org/jira/browse/MAHOUT-652> > > > MAHOUT-652 <https://issues.apache.org/jira/browse/MAHOUT-652> > > > > > > [GSoC Proposal] Parallel Viterbi algorithm for > > > HMM<https://issues.apache.org/jira/browse/MAHOUT-652> > > > [image: Open] Open06/Apr/11< > > > https://issues.apache.org/jira/browse/MAHOUT-711> > > > MAHOUT-711 <https://issues.apache.org/jira/browse/MAHOUT-711> > > > > > > outputs miss some right frequent > > > itemsets<https://issues.apache.org/jira/browse/MAHOUT-711> > > > [image: Open] Open25/May/11< > > > https://issues.apache.org/jira/browse/MAHOUT-706> > > > MAHOUT-706 <https://issues.apache.org/jira/browse/MAHOUT-706> > > > > > > reuse lucene tokenstreams < > > > https://issues.apache.org/jira/browse/MAHOUT-706> > > > [image: Open] Open20/May/11 > > > > > > -- Yee Yang Li Hector http://hectorgon.blogspot.com/ (tech + travel) http://hectorgon.com (book reviews)
