jay vyas created BIGTOP-1128:
--------------------------------

             Summary: modularize and update mahout sample data sets
                 Key: BIGTOP-1128
                 URL: https://issues.apache.org/jira/browse/BIGTOP-1128
             Project: Bigtop
          Issue Type: Bug
            Reporter: jay vyas


The mahout smokes have alot of dependencies

Concretely, we need to fix  the movie lens sample data which has moved.... 
from http://www.grouplens.org/system/files/ml-1m.zip
to http://files.grouplens.org/papers/ml-1m.zip

Otherwise mahout smokes break for obvious reasons. 

More generally, consolidating and verifying these download URLs in a separate 
function might make for simpler debugging of the tests, otherwise, you get html 
documents stored as .zip files, which causes a very hard to interpret error in 
the tests, i.e. you get an exception about how the zip file isnt formatted 
correctly.  

Other Thoughts on how to simplify and isolate moving parts of mahout tests?
We can bundle them into a patch.  Would be a shame if the only thing this JIRA 
resulted in was a fix to a single URL :).... 




--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to