I checked whether MAHOUT-493 runs on Elastic MapReduce and everything worked fine.
So I can give my go for release 0.4 ;) --sebastian Am 21.09.2010 20:31, schrieb Drew Farris: > It would be nice to get MAHOUT-505 resolved, and I've been meaning to > update MAHOUT-451 to use hadoop fs primitives instead of java.io.File. > I'll see if I can make progress on the latter this week. > > On Tue, Sep 21, 2010 at 2:03 PM, Ted Dunning <[email protected]> wrote: > >> I pushed a few that I knew about to 0.5. I put ticklers on others. If >> nobody responds to the tickled bugs, I think we should >> move them on Friday. >> >> On Tue, Sep 21, 2010 at 8:13 AM, Sebastian Schelter <[email protected] >> >>> wrote: >>> >> >>> I'm ready too, only thing I want to do is ensure that MAHOUT-493 works >>> on ElasticMapReduce but that should be done until next week. >>> >>> --sebastian >>> >>> Am 21.09.2010 17:08, schrieb Sean Owen: >>> >>>> Agree, I'm ready. It's time to simply find a sensible point to draw a >>>> line and put out 0.4. Anything else is simply moved along to 0.5. >>>> >>>> Can I suggest we leave until Friday to update all issues? Anything >>>> left as 0.4 means you expect a patch is imminent next week to close >>>> the issue. Anything else, simply mark as 0.5. Here's what's open: >>>> >>>> >>>> >>>> T Key Summary Pr Status Updated Created >>>> MAHOUT-227 Parallel SVM Open 10/Feb/10 >>>> >>> 20/Dec/09 >>> >>>> MAHOUT-279 Make RandomSeedGenerator a M/R Job >>>> >>> Open 14/Feb/10 07/Feb/10 >>> >>>> MAHOUT-293 Add more tunable parameters to PFPGrowth >>>> >>> implementation >>> >>>> Open 15/Feb/10 15/Feb/10 >>>> MAHOUT-303 Exhaustive Tests for Vector implementations >>>> >>> Open >>> >>>> 20/Feb/10 20/Feb/10 >>>> MAHOUT-306 Profile and improve performance of algorithms based >>>> >>> on >>> >>>> vectors Open 22/Feb/10 22/Feb/10 >>>> MAHOUT-309 Implement Stochastic Decomposition >>>> >>> Open 24/Feb/10 24/Feb/10 >>> >>>> MAHOUT-319 SVD solvers should be gracefully >>>> >>> stoppable/restartable >>> >>>> Open 01/Mar/10 01/Mar/10 >>>> MAHOUT-369 Issues with DistributedLanczosSolver output >>>> >>> Open >>> >>>> 25/Apr/10 07/Apr/10 >>>> MAHOUT-397 SparseVectorsFromSequenceFiles only outputs a >>>> >>> single >>> >>>> vector file Patch Available 19/May/10 19/May/10 >>>> MAHOUT-153 Implement kmeans++ for initial cluster selection in >>>> kmeans Open 27/May/10 27/Jul/09 >>>> MAHOUT-376 Implement Map-reduce version of stochastic SVD >>>> >>> Open >>> >>>> 08/May/10 11/Apr/10 >>>> MAHOUT-414 Usability: Mahout applications need a consistent >>>> >>> API to >>> >>>> allow users to specify desired map/reduce concurrency Open >>>> 13/Jun/10 13/Jun/10 >>>> MAHOUT-419 Convert decomposer code to Hadoop 0.20 API >>>> >>> Open >>> >>>> 20/Jun/10 20/Jun/10 >>>> MAHOUT-308 Improve Lanczos to handle extremely large feature >>>> >>> sets >>> >>>> (without hashing) Patch Available 30/Jun/10 >>>> >>> 24/Feb/10 >>> >>>> MAHOUT-401 Use NamedVector in seq2sparse Reopened >>>> >>> 02/Jul/10 27/May/10 >>> >>>> MAHOUT-232 Implementation of sequential SVM solver based on >>>> Pegasos Patch Available 06/Jul/10 27/Dec/09 >>>> MAHOUT-287 Bayes Classifier should use Vector as input >>>> >>> Open >>> >>>> 10/Feb/10 10/Feb/10 >>>> MAHOUT-167 Convert code to Hadoop 0.20 API Open >>>> >>> 24/Jul/10 28/Aug/09 >>> >>>> MAHOUT-458 The LDA output does not include the >>>> >>> topic-probability >>> >>>> distribution per document (p(z|d)). It outputs only the topics and >>>> corresponding words. Open 09/Aug/10 06/Aug/10 >>>> MAHOUT-344 Minhash based clustering Patch >>>> >>> Available 10/Aug/10 22/Mar/10 >>> >>>> MAHOUT-334 Proposal for GSoC2010 (Linear SVM for Mahout) >>>> >>> Patch >>> >>>> Available 14/Aug/10 12/Mar/10 >>>> MAHOUT-467 Change Iterable<Cooccurrence> in >>>> >>>> >>> org.apache.mahout.math.hadoop.similarity.RowSimilarityJob.SimilarityReducer >>> >>>> to list or array to improve the performance Open 18/Aug/10 >>>> 12/Aug/10 >>>> MAHOUT-483 Job RowSimilarityJob-Mapper-EntriesToVectorsReducer >>>> improvement Open 18/Aug/10 18/Aug/10 >>>> MAHOUT-495 Undeprecate Normal and Exponential distributions >>>> >>> Open >>> >>>> 04/Sep/10 31/Aug/10 >>>> MAHOUT-294 Uniform API behavior for Jobs Open >>>> >>> 14/Sep/10 16/Feb/10 >>> >>>> MAHOUT-214 Implement Stacked RBM Open 08/Feb/10 >>>> >>> 08/Dec/09 >>> >>>> MAHOUT-155 ARFF VectorIterable Open 07/Feb/10 >>>> >>> 01/Aug/09 >>> >>>> MAHOUT-274 Use avro for serialization of structured documents. >>>> Open 30/Mar/10 05/Feb/10 >>>> MAHOUT-379 SequentialAccessSparseVector.equals does not agree >>>> >>> with >>> >>>> AbstractVector.equivalent Reopened 01/Aug/10 >>>> >>> 14/Apr/10 >>> >>>> MAHOUT-459 Reading an Index from Lucene/Solr 4.0-dev >>>> >>> Open >>> >>>> 06/Aug/10 06/Aug/10 >>>> MAHOUT-471 RowSimilarityJob-Mapper-EntriesToVectorsReducer >>>> >>> failure >>> >>>> Open 17/Aug/10 12/Aug/10 >>>> MAHOUT-480 Replace manual precondition checking with >>>> >>> Precondition >>> >>>> utility class from Guava Open 18/Aug/10 13/Aug/10 >>>> MAHOUT-396 Proposal for Implementing Hidden Markov Model >>>> >>> Patch >>> >>>> Available 17/Sep/10 16/May/10 >>>> MAHOUT-271 Make WikipediaDatasetCreatorMapper fuzzy category >>>> >>> match >>> >>>> respect word boundaries Open 08/Feb/10 28/Jan/10 >>>> >>>> >>>> >>>> On Tue, Sep 21, 2010 at 3:50 PM, Jeff Eastman >>>> <[email protected]> wrote: >>>> >>>> >>>>> We've been thinking of a September-October timeframe for the 0.4 >>>>> >>> release >>> >>>>> but I still see some major work items in the 34 Jira issues targeted at >>>>> >>> this >>> >>>>> release. As an Agile practitioner it seems to me we need to push most of >>>>> these into 0.5 if we are going to hold this schedule. How about we >>>>> >>> triage >>> >>>>> this list again and shoot for feature freeze at the end of the month? >>>>> >>>>> >>>>> >>> >>> >>
