Re: GSOC 2012
On Sun, Mar 18, 2012 at 9:38 PM, Shannon Quinn squ...@gatech.edu wrote: I'd love to toss my name into the hat for this summer's mentors. That makes for one mentor already - awesome. Anyone else interested in helping students find their way in our project? For those of you not too familiar with GSoC at Apache http://community.apache.org/gsoc.html ... lists detailed information for mentors and students including which lists to subscribe to and a list of important deadlines. Isabel
Re: GSOC 2012
What could be possible projects for this year? Or should the student provide a proposal for an improvement? I would probably be interested in working on a project for this years GSOC. I am also currently in contact with dbpedia spotlight developers for GSOC. They are interested in a topical classifier for a given plaintext. I saw, that something similar was already implemented in mahout. 2012/3/19 Isabel Drost isa...@apache.org On Sun, Mar 18, 2012 at 9:38 PM, Shannon Quinn squ...@gatech.edu wrote: I'd love to toss my name into the hat for this summer's mentors. That makes for one mentor already - awesome. Anyone else interested in helping students find their way in our project? For those of you not too familiar with GSoC at Apache http://community.apache.org/gsoc.html ... lists detailed information for mentors and students including which lists to subscribe to and a list of important deadlines. Isabel
Re: GSOC 2012
On 19.03.2012 Dirk Weissenborn wrote: What could be possible projects for this year? General advise would be to look for open JIRA issues. There are a few known issues with existing implementations. Also for some algorithms integration and API design could be improved. In the end the general advise on how to contribute and become a committer applies: Find a topic that you are yourself interested in and work on that. Or should the student provide a proposal for an improvement? From previous experience students who came up with their own proposals have been most successful - both in applying but also in succeeding. Keep in mind to limit the scope of your project to a manageable size: You can only design, test, implement, document and provide examples for so much code. I would probably be interested in working on a project for this years GSOC. I am also currently in contact with dbpedia spotlight developers for GSOC. They are interested in a topical classifier for a given plaintext. I saw, that something similar was already implemented in mahout. I think working on the intersection of two projects could be valuable for both sides. Sounds like an interesting idea to me. Isabel signature.asc Description: This is a digitally signed message part.
Re: Build failed in Jenkins: Mahout-Examples-Cluster-Reuters #74
On 17.03.2012 Paritosh Ranjan wrote: Is there any way to test this build before commit? The trunk is building successfully and till now, that's all I check before commit. How do I test this build before commit? As far as I know all you have to do is to use the same commands that Jenkins uses to trigger the build*. Don't know at the top of my head which options it uses, you should be able to find out by going to Jenkins, selecting our build and clicking configure (you need to be logged into Jenkins for that) or selecting the last build that failed and looking at its console output. From just briefly scanning the output it looks like after successfully triggering the maven build it fails when executing the cluster_reuters.sh script. Isabel * Obvious other culprits for Jenkins behaving differently than you local box are different network settings, different maven versions, different settings.xml, different java version, screwed up local maven repositories on either side and such - however I don't think neither of those is particularly likely in this case. signature.asc Description: This is a digitally signed message part.
[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13232997#comment-13232997 ] Saikat Kanjilal commented on MAHOUT-984: Never mind, figured it out, will be committing patch in the next few days. Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning --- Key: MAHOUT-984 URL: https://issues.apache.org/jira/browse/MAHOUT-984 Project: Mahout Issue Type: Sub-task Components: Clustering Affects Versions: 0.6 Reporter: Paritosh Ranjan Assignee: Paritosh Ranjan Labels: clustering Fix For: 0.7 Use ClusterClassificationDriver to refactor clustering out of FuzzyKMeansDriver with outlier pruning support. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
Build failed in Jenkins: Mahout-Quality #1403
See https://builds.apache.org/job/Mahout-Quality/1403/ -- [...truncated 150 lines...] A integration/src/main/java/org/apache/mahout/utils/email/MailProcessor.java A integration/src/main/java/org/apache/mahout/utils/email/MailOptions.java A integration/src/main/java/org/apache/mahout/utils/io A integration/src/main/java/org/apache/mahout/utils/io/WrappedWriter.java A integration/src/main/java/org/apache/mahout/utils/io/ChunkedWrapper.java A integration/src/main/java/org/apache/mahout/utils/io/IOWriterWrapper.java A integration/src/main/java/org/apache/mahout/utils/io/ChunkedWriter.java A integration/src/main/java/org/apache/mahout/utils/nlp A integration/src/main/java/org/apache/mahout/utils/nlp/collocations A integration/src/main/java/org/apache/mahout/utils/nlp/collocations/llr A integration/src/main/java/org/apache/mahout/utils/nlp/collocations/llr/BloomTokenFilter.java A integration/src/main/java/org/apache/mahout/utils/regex A integration/src/main/java/org/apache/mahout/utils/regex/RegexMapper.java A integration/src/main/java/org/apache/mahout/utils/regex/FPGFormatter.java A integration/src/main/java/org/apache/mahout/utils/regex/ChainTransformer.java A integration/src/main/java/org/apache/mahout/utils/regex/AnalyzerTransformer.java A integration/src/main/java/org/apache/mahout/utils/regex/URLDecodeTransformer.java A integration/src/main/java/org/apache/mahout/utils/regex/RegexUtils.java A integration/src/main/java/org/apache/mahout/utils/regex/IdentityTransformer.java A integration/src/main/java/org/apache/mahout/utils/regex/IdentityFormatter.java A integration/src/main/java/org/apache/mahout/utils/regex/RegexTransformer.java A integration/src/main/java/org/apache/mahout/utils/regex/RegexConverterDriver.java A integration/src/main/java/org/apache/mahout/utils/regex/RegexFormatter.java A integration/src/main/java/org/apache/mahout/utils/Bump125.java A integration/src/main/java/org/apache/mahout/utils/SplitInput.java A integration/src/main/java/org/apache/mahout/classifier A integration/src/main/java/org/apache/mahout/classifier/ConfusionMatrixDumper.java A integration/src/main/java/org/apache/mahout/text A integration/src/main/java/org/apache/mahout/text/PrefixAdditionFilter.java A integration/src/main/java/org/apache/mahout/text/TextParagraphSplittingJob.java A integration/src/main/java/org/apache/mahout/text/SequenceFilesFromDirectory.java AU integration/src/main/java/org/apache/mahout/text/SequenceFilesFromMailArchives.java A integration/src/main/java/org/apache/mahout/text/SequenceFilesFromDirectoryFilter.java AU integration/src/main/java/org/apache/mahout/text/MailArchivesClusteringAnalyzer.java A integration/src/main/java/org/apache/mahout/cf A integration/src/main/java/org/apache/mahout/cf/taste A integration/src/main/java/org/apache/mahout/cf/taste/impl A integration/src/main/java/org/apache/mahout/cf/taste/impl/model A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/cassandra A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/cassandra/CassandraDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/MySQLBooleanPrefJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/AbstractJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/PostgreSQLJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/MySQLJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/ConnectionPoolDataSource.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/SQL92BooleanPrefJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/ReloadFromJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/SQL92JDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/GenericJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/AbstractBooleanPrefJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/PostgreSQLBooleanPrefJDBCDataModel.java A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/mongodb A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/mongodb/MongoDBDataModel.java A
Re: Build failed in Jenkins: Mahout-Examples-Cluster-Reuters #74
I don't have the privilege to view/edit the Jenkins job configuration, so, I can not see the command from there. Somehow, I am not able to figure out the command from the console, can you help me here by telling the command ( for running the Mahout-Examples-Cluster-Reuters build )? Can you also help by clarifying the protocols on the builds? i.e. which builds to test before committing. How much time is allowed to fix failing builds ( how much time for which build ), or is there something like this? Is it needed to run/build the Mahout-Examples-Cluster-Reuters ( along with Mahout Quality i.e. mvn clean install on trunk ) before committing, and if yes, is it a common practice? Is this info about the builds documented somewhere? If the information about the builds is not documented anywhere, then I would will like to add it to the mahout wiki/site/somewhere i.e. which all builds do we have, and which one is needed for what purpose and other rules for them. If it is already documented, can you please share that link? I don't think Jenkins is behaving differently than my local box. Paritosh On 20-03-2012 02:51, Isabel Drost wrote: On 17.03.2012 Paritosh Ranjan wrote: Is there any way to test this build before commit? The trunk is building successfully and till now, that's all I check before commit. How do I test this build before commit? As far as I know all you have to do is to use the same commands that Jenkins uses to trigger the build*. Don't know at the top of my head which options it uses, you should be able to find out by going to Jenkins, selecting our build and clicking configure (you need to be logged into Jenkins for that) or selecting the last build that failed and looking at its console output. From just briefly scanning the output it looks like after successfully triggering the maven build it fails when executing the cluster_reuters.sh script. Isabel * Obvious other culprits for Jenkins behaving differently than you local box are different network settings, different maven versions, different settings.xml, different java version, screwed up local maven repositories on either side and such - however I don't think neither of those is particularly likely in this case.