I think you will have a hard time applying clustering to it. I will
suggest that Classification will be a better choice for this use case.
Paritosh Ranjan
On 20-09-2011 14:37, JAGANADH G wrote:
Hi All
I am working on a small project to identify a vertical of a content for news
paper sites.
Typically an online newspaper contains news items travel blogs product
reviews etc..
If I write a Mahout based system to identify travel and product reviews from
newspaper which startegy will be better
Classification or clustering ?