It would be nice to get MAHOUT-505 resolved, and I've been meaning to update MAHOUT-451 to use hadoop fs primitives instead of java.io.File. I'll see if I can make progress on the latter this week.
On Tue, Sep 21, 2010 at 2:03 PM, Ted Dunning <[email protected]> wrote: > I pushed a few that I knew about to 0.5. I put ticklers on others. If > nobody responds to the tickled bugs, I think we should > move them on Friday. > > On Tue, Sep 21, 2010 at 8:13 AM, Sebastian Schelter <[email protected] >> wrote: > >> I'm ready too, only thing I want to do is ensure that MAHOUT-493 works >> on ElasticMapReduce but that should be done until next week. >> >> --sebastian >> >> Am 21.09.2010 17:08, schrieb Sean Owen: >> > Agree, I'm ready. It's time to simply find a sensible point to draw a >> > line and put out 0.4. Anything else is simply moved along to 0.5. >> > >> > Can I suggest we leave until Friday to update all issues? Anything >> > left as 0.4 means you expect a patch is imminent next week to close >> > the issue. Anything else, simply mark as 0.5. Here's what's open: >> > >> > >> > >> > T Key Summary Pr Status Updated Created >> > MAHOUT-227 Parallel SVM Open 10/Feb/10 >> 20/Dec/09 >> > MAHOUT-279 Make RandomSeedGenerator a M/R Job >> Open 14/Feb/10 07/Feb/10 >> > MAHOUT-293 Add more tunable parameters to PFPGrowth >> implementation >> > Open 15/Feb/10 15/Feb/10 >> > MAHOUT-303 Exhaustive Tests for Vector implementations >> Open >> > 20/Feb/10 20/Feb/10 >> > MAHOUT-306 Profile and improve performance of algorithms based >> on >> > vectors Open 22/Feb/10 22/Feb/10 >> > MAHOUT-309 Implement Stochastic Decomposition >> Open 24/Feb/10 24/Feb/10 >> > MAHOUT-319 SVD solvers should be gracefully >> stoppable/restartable >> > Open 01/Mar/10 01/Mar/10 >> > MAHOUT-369 Issues with DistributedLanczosSolver output >> Open >> > 25/Apr/10 07/Apr/10 >> > MAHOUT-397 SparseVectorsFromSequenceFiles only outputs a >> single >> > vector file Patch Available 19/May/10 19/May/10 >> > MAHOUT-153 Implement kmeans++ for initial cluster selection in >> > kmeans Open 27/May/10 27/Jul/09 >> > MAHOUT-376 Implement Map-reduce version of stochastic SVD >> Open >> > 08/May/10 11/Apr/10 >> > MAHOUT-414 Usability: Mahout applications need a consistent >> API to >> > allow users to specify desired map/reduce concurrency Open >> > 13/Jun/10 13/Jun/10 >> > MAHOUT-419 Convert decomposer code to Hadoop 0.20 API >> Open >> > 20/Jun/10 20/Jun/10 >> > MAHOUT-308 Improve Lanczos to handle extremely large feature >> sets >> > (without hashing) Patch Available 30/Jun/10 >> 24/Feb/10 >> > MAHOUT-401 Use NamedVector in seq2sparse Reopened >> 02/Jul/10 27/May/10 >> > MAHOUT-232 Implementation of sequential SVM solver based on >> > Pegasos Patch Available 06/Jul/10 27/Dec/09 >> > MAHOUT-287 Bayes Classifier should use Vector as input >> Open >> > 10/Feb/10 10/Feb/10 >> > MAHOUT-167 Convert code to Hadoop 0.20 API Open >> 24/Jul/10 28/Aug/09 >> > MAHOUT-458 The LDA output does not include the >> topic-probability >> > distribution per document (p(z|d)). It outputs only the topics and >> > corresponding words. Open 09/Aug/10 06/Aug/10 >> > MAHOUT-344 Minhash based clustering Patch >> Available 10/Aug/10 22/Mar/10 >> > MAHOUT-334 Proposal for GSoC2010 (Linear SVM for Mahout) >> Patch >> > Available 14/Aug/10 12/Mar/10 >> > MAHOUT-467 Change Iterable<Cooccurrence> in >> > >> org.apache.mahout.math.hadoop.similarity.RowSimilarityJob.SimilarityReducer >> > to list or array to improve the performance Open 18/Aug/10 >> > 12/Aug/10 >> > MAHOUT-483 Job RowSimilarityJob-Mapper-EntriesToVectorsReducer >> > improvement Open 18/Aug/10 18/Aug/10 >> > MAHOUT-495 Undeprecate Normal and Exponential distributions >> Open >> > 04/Sep/10 31/Aug/10 >> > MAHOUT-294 Uniform API behavior for Jobs Open >> 14/Sep/10 16/Feb/10 >> > MAHOUT-214 Implement Stacked RBM Open 08/Feb/10 >> 08/Dec/09 >> > MAHOUT-155 ARFF VectorIterable Open 07/Feb/10 >> 01/Aug/09 >> > MAHOUT-274 Use avro for serialization of structured documents. >> > Open 30/Mar/10 05/Feb/10 >> > MAHOUT-379 SequentialAccessSparseVector.equals does not agree >> with >> > AbstractVector.equivalent Reopened 01/Aug/10 >> 14/Apr/10 >> > MAHOUT-459 Reading an Index from Lucene/Solr 4.0-dev >> Open >> > 06/Aug/10 06/Aug/10 >> > MAHOUT-471 RowSimilarityJob-Mapper-EntriesToVectorsReducer >> failure >> > Open 17/Aug/10 12/Aug/10 >> > MAHOUT-480 Replace manual precondition checking with >> Precondition >> > utility class from Guava Open 18/Aug/10 13/Aug/10 >> > MAHOUT-396 Proposal for Implementing Hidden Markov Model >> Patch >> > Available 17/Sep/10 16/May/10 >> > MAHOUT-271 Make WikipediaDatasetCreatorMapper fuzzy category >> match >> > respect word boundaries Open 08/Feb/10 28/Jan/10 >> > >> > >> > >> > On Tue, Sep 21, 2010 at 3:50 PM, Jeff Eastman >> > <[email protected]> wrote: >> > >> >> We've been thinking of a September-October timeframe for the 0.4 >> release >> >> but I still see some major work items in the 34 Jira issues targeted at >> this >> >> release. As an Agile practitioner it seems to me we need to push most of >> >> these into 0.5 if we are going to hold this schedule. How about we >> triage >> >> this list again and shoot for feature freeze at the end of the month? >> >> >> >> >> >> >
