[ https://issues.apache.org/jira/browse/HIVEMALL-111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Makoto Yui updated HIVEMALL-111: -------------------------------- Fix Version/s: (was: 0.6.0) 0.7.0 > Add more ready-to-use data to the Docker image > ---------------------------------------------- > > Key: HIVEMALL-111 > URL: https://issues.apache.org/jira/browse/HIVEMALL-111 > Project: Hivemall > Issue Type: Improvement > Reporter: Takuya Kitazawa > Priority: Major > Labels: docker > Fix For: 0.7.0 > > > In addition to the current *$HOME/bin/prepare_iris.sh* script, we can create > more data preparation scripts. More concretely, at least datasets used by > tutorials in our user guide need to be supported: > * a9a > * news20 > * kdd2010 a/b > * webspam > * E2006-tfidf > Unfortunately, we cannot automate to use datasets hosted by Kaggle because > they require us to log-in to Kaggle. -- This message was sent by Atlassian Jira (v8.3.4#803005)