I just started a 6 c1.medium cluster on EMR to vectorize wikipedia using bigram mindf 3 max df 99 minllr 1
Will report when its done :) Hope i dont screw up the command line parameters Robin
I just started a 6 c1.medium cluster on EMR to vectorize wikipedia using bigram mindf 3 max df 99 minllr 1
Will report when its done :) Hope i dont screw up the command line parameters Robin