Takuya Kitazawa created HIVEMALL-111:
----------------------------------------

             Summary: Add more ready-to-use data to the Docker image
                 Key: HIVEMALL-111
                 URL: https://issues.apache.org/jira/browse/HIVEMALL-111
             Project: Hivemall
          Issue Type: Improvement
            Reporter: Takuya Kitazawa


In addition to the current *$HOME/bin/prepare_iris.sh* script, we can create 
more data preparation scripts. More concretely, at least datasets used by 
tutorials in our user guide need to be supported:

* a9a
* news20
* kdd2010 a/b
* webspam
* E2006-tfidf

Unfortunately, we cannot automate to use datasets hosted by Kaggle because they 
require us to log-in to Kaggle.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to