If I have some numerical data (e.g., the data at
http://archive.ics.uci.edu/ml/machine-learning-databases/dermatology/dermatology.data)
and want to run a Mahout classification algorithm on that data, what steps
do I need to take in order to put the data into the correct input format? I
have read that most everything requires a sequence file but I'm not sure
that I still understand what that is. Do I need to provide a key for each
row in this dataset (and the rest of the row sans the final column would be
the value)?