[ https://issues.apache.org/jira/browse/MAHOUT-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15225099#comment-15225099 ]
shashi bushan dongur commented on MAHOUT-1788: ---------------------------------------------- [~smarthi] I currently have mahout installed and set up on my VM. I am digging up the source code to understand how it work. I will post update when I start editing the code. Is there any resource I can look at to learn how to efficiently edit and run mahout? I have followed the instructions on github, but having hard time understanding how I can run and test the code. Any resource regarding that would hugely help! P.S: I am new to apache open source or open source in general. > spark-itemsimilarity integration test script cleanup > ---------------------------------------------------- > > Key: MAHOUT-1788 > URL: https://issues.apache.org/jira/browse/MAHOUT-1788 > Project: Mahout > Issue Type: Improvement > Components: cooccurrence > Affects Versions: 0.11.0 > Reporter: Pat Ferrel > Assignee: Pat Ferrel > Priority: Trivial > Fix For: 1.0.0 > > > binary release does not contain data for itemsimilarity tests, neith binary > nor source versions will run on a cluster unless data is hand copied to hdfs. > Clean this up so it copies data if needed and the data is in both versions. -- This message was sent by Atlassian JIRA (v6.3.4#6332)