Hi,
I have been able to run the bayesian network 20news group example provided at Mahout website. I am interested in running the Wikipedia bayes example, as it is a much larger dataset. >From several googling attempts, I figured it is a bit different workflow than running the 20news group example -- e.g., I would need to provide a categories.txt file, and invoke WikipediaXmlSplitter, call wikipediaDataSetCreator and etc. I am wondering is there a document somewhere that describes the process of running Wikipedia bayes example ? https://cwiki.apache.org/MAHOUT/wikipedia-bayes-example.html seems no longer work. Greatly appreciated! Wei