Re: GSOC 2012

2012-03-19 Thread Isabel Drost
On Sun, Mar 18, 2012 at 9:38 PM, Shannon Quinn squ...@gatech.edu wrote:
 I'd love to toss my name into the hat for this summer's mentors.

That makes for one mentor already - awesome. Anyone else interested in
helping students find their way in our project?

For those of you not too familiar with GSoC at Apache
http://community.apache.org/gsoc.html ... lists detailed information
for mentors and students including which lists to subscribe to and a
list of important deadlines.

Isabel


Re: GSOC 2012

2012-03-19 Thread Dirk Weissenborn
What could be possible projects for this year? Or should the student
provide a proposal for an improvement? I would probably be interested in
working on a project for this years GSOC. I am also currently in contact
with dbpedia spotlight developers for GSOC. They are interested in a
topical classifier for a given plaintext. I saw, that something similar was
already implemented in mahout.

2012/3/19 Isabel Drost isa...@apache.org

 On Sun, Mar 18, 2012 at 9:38 PM, Shannon Quinn squ...@gatech.edu wrote:
  I'd love to toss my name into the hat for this summer's mentors.

 That makes for one mentor already - awesome. Anyone else interested in
 helping students find their way in our project?

 For those of you not too familiar with GSoC at Apache
 http://community.apache.org/gsoc.html ... lists detailed information
 for mentors and students including which lists to subscribe to and a
 list of important deadlines.

 Isabel



Re: GSOC 2012

2012-03-19 Thread Isabel Drost
On 19.03.2012 Dirk Weissenborn wrote:
 What could be possible projects for this year?

General advise would be to look for open JIRA issues. There are a few known 
issues with existing implementations. Also for some algorithms integration and 
API design could be improved. In the end the general advise on how to 
contribute 
and become a committer applies: Find a topic that you are yourself interested 
in 
and work on that.


 Or should the student
 provide a proposal for an improvement?

From previous experience students who came up with their own proposals have 
been 
most successful - both in applying but also in succeeding. Keep in mind to 
limit 
the scope of your project to a manageable size: You can only design, test, 
implement, document and provide examples for so much code.


 I would probably be interested in
 working on a project for this years GSOC. I am also currently in contact
 with dbpedia spotlight developers for GSOC. They are interested in a
 topical classifier for a given plaintext. I saw, that something similar was
 already implemented in mahout.

I think working on the intersection of two projects could be valuable for both 
sides. Sounds like an interesting idea to me.

Isabel


signature.asc
Description: This is a digitally signed message part.


Re: Build failed in Jenkins: Mahout-Examples-Cluster-Reuters #74

2012-03-19 Thread Isabel Drost
On 17.03.2012 Paritosh Ranjan wrote:
 Is there any way to test this build before commit? The trunk is building
 successfully and till now, that's all I check before commit. How do I
 test this build before commit?

As far as I know all you have to do is to use the same commands that Jenkins 
uses to trigger the build*. Don't know at the top of my head which options it 
uses, you should be able to find out by going to Jenkins, selecting our build 
and clicking configure (you need to be logged into Jenkins for that) or 
selecting the last build that failed and looking at its console output.

From just briefly scanning the output it looks like after successfully 
triggering the maven build it fails when executing the cluster_reuters.sh 
script.


Isabel

* Obvious other culprits for Jenkins behaving differently than you local box 
are 
different network settings, different maven versions, different settings.xml, 
different java version, screwed up local maven repositories on either side and 
such - however I don't think neither of those is particularly likely in this 
case.


signature.asc
Description: This is a digitally signed message part.


[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-19 Thread Saikat Kanjilal (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13232997#comment-13232997
 ] 

Saikat Kanjilal commented on MAHOUT-984:


Never mind, figured it out, will be committing patch in the next few days.

 Refactor Fuzzy K Means Clustering into a separate post process with outlier 
 pruning
 ---

 Key: MAHOUT-984
 URL: https://issues.apache.org/jira/browse/MAHOUT-984
 Project: Mahout
  Issue Type: Sub-task
  Components: Clustering
Affects Versions: 0.6
Reporter: Paritosh Ranjan
Assignee: Paritosh Ranjan
  Labels: clustering
 Fix For: 0.7


 Use ClusterClassificationDriver to refactor clustering out of 
 FuzzyKMeansDriver with outlier pruning support.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Build failed in Jenkins: Mahout-Quality #1403

2012-03-19 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/1403/

--
[...truncated 150 lines...]
A 
integration/src/main/java/org/apache/mahout/utils/email/MailProcessor.java
A 
integration/src/main/java/org/apache/mahout/utils/email/MailOptions.java
A integration/src/main/java/org/apache/mahout/utils/io
A 
integration/src/main/java/org/apache/mahout/utils/io/WrappedWriter.java
A 
integration/src/main/java/org/apache/mahout/utils/io/ChunkedWrapper.java
A 
integration/src/main/java/org/apache/mahout/utils/io/IOWriterWrapper.java
A 
integration/src/main/java/org/apache/mahout/utils/io/ChunkedWriter.java
A integration/src/main/java/org/apache/mahout/utils/nlp
A integration/src/main/java/org/apache/mahout/utils/nlp/collocations
A integration/src/main/java/org/apache/mahout/utils/nlp/collocations/llr
A 
integration/src/main/java/org/apache/mahout/utils/nlp/collocations/llr/BloomTokenFilter.java
A integration/src/main/java/org/apache/mahout/utils/regex
A 
integration/src/main/java/org/apache/mahout/utils/regex/RegexMapper.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/FPGFormatter.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/ChainTransformer.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/AnalyzerTransformer.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/URLDecodeTransformer.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/RegexUtils.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/IdentityTransformer.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/IdentityFormatter.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/RegexTransformer.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/RegexConverterDriver.java
A 
integration/src/main/java/org/apache/mahout/utils/regex/RegexFormatter.java
A integration/src/main/java/org/apache/mahout/utils/Bump125.java
A integration/src/main/java/org/apache/mahout/utils/SplitInput.java
A integration/src/main/java/org/apache/mahout/classifier
A 
integration/src/main/java/org/apache/mahout/classifier/ConfusionMatrixDumper.java
A integration/src/main/java/org/apache/mahout/text
A 
integration/src/main/java/org/apache/mahout/text/PrefixAdditionFilter.java
A 
integration/src/main/java/org/apache/mahout/text/TextParagraphSplittingJob.java
A 
integration/src/main/java/org/apache/mahout/text/SequenceFilesFromDirectory.java
AU
integration/src/main/java/org/apache/mahout/text/SequenceFilesFromMailArchives.java
A 
integration/src/main/java/org/apache/mahout/text/SequenceFilesFromDirectoryFilter.java
AU
integration/src/main/java/org/apache/mahout/text/MailArchivesClusteringAnalyzer.java
A integration/src/main/java/org/apache/mahout/cf
A integration/src/main/java/org/apache/mahout/cf/taste
A integration/src/main/java/org/apache/mahout/cf/taste/impl
A integration/src/main/java/org/apache/mahout/cf/taste/impl/model
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/cassandra
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/cassandra/CassandraDataModel.java
A integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/MySQLBooleanPrefJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/AbstractJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/PostgreSQLJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/MySQLJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/ConnectionPoolDataSource.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/SQL92BooleanPrefJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/ReloadFromJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/SQL92JDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/GenericJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/AbstractBooleanPrefJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/jdbc/PostgreSQLBooleanPrefJDBCDataModel.java
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/mongodb
A 
integration/src/main/java/org/apache/mahout/cf/taste/impl/model/mongodb/MongoDBDataModel.java
A 

Re: Build failed in Jenkins: Mahout-Examples-Cluster-Reuters #74

2012-03-19 Thread Paritosh Ranjan
I don't have the privilege to view/edit the Jenkins job configuration, 
so, I can not see the command from there.
Somehow, I am not able to figure out the command from the console, can 
you help me here by telling the command ( for running the 
Mahout-Examples-Cluster-Reuters build )?


Can you also help by clarifying the protocols on the builds? i.e. which 
builds to test before committing. How much time is allowed to fix 
failing builds ( how much time for which build ), or is there something 
like this?


Is it needed to run/build the Mahout-Examples-Cluster-Reuters ( along 
with Mahout Quality i.e. mvn clean install on trunk  ) before 
committing, and if yes, is it a common practice?


Is this info about the builds documented somewhere? If the information 
about the builds is not documented anywhere, then I would will like to 
add it to the mahout wiki/site/somewhere i.e. which all builds do we 
have, and which one is needed for what purpose and other rules for them. 
If it is already documented, can you please share that link?


I don't think Jenkins is behaving differently than my local box.

Paritosh

On 20-03-2012 02:51, Isabel Drost wrote:

On 17.03.2012 Paritosh Ranjan wrote:

Is there any way to test this build before commit? The trunk is building
successfully and till now, that's all I check before commit. How do I
test this build before commit?

As far as I know all you have to do is to use the same commands that Jenkins
uses to trigger the build*. Don't know at the top of my head which options it
uses, you should be able to find out by going to Jenkins, selecting our build
and clicking configure (you need to be logged into Jenkins for that) or
selecting the last build that failed and looking at its console output.

 From just briefly scanning the output it looks like after successfully
triggering the maven build it fails when executing the cluster_reuters.sh
script.


Isabel

* Obvious other culprits for Jenkins behaving differently than you local box are
different network settings, different maven versions, different settings.xml,
different java version, screwed up local maven repositories on either side and
such - however I don't think neither of those is particularly likely in this
case.