Stefan Groschupf wrote:
> Hi,
> > How is document clustering different/related to text categorization?
> 
> Clustering: try to find own categories and put documents that match
> in it. You group all documents with minimal distance together.

Would I be correct to say that you have to define a "distance threshold"
parameter in order to define when to build a new category for a certain
group?

> Classification: you have already categories and samples for
> it, that help you to match other documents.
> You calculate document distances to the existing categories
> and put it in the category with smallest distance.

Regards,
Marcel


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to