Re: [GRASS-user] v.class.mlR Error

Moritz Lennert Wed, 20 Jun 2018 04:48:29 -0700

On 20/06/18 13:18, Moritz Lennert wrote:

Actually I noticed that your error message concerned the qbwwv voting
algorithm which wasn't in the command line you provided. I was just able
to reproduce your error using that same voting scheme. I'll have to dig
into this a bit deeper to see where the error comes from.


I think I found the issue: your training data is insufficient.

Here is the number of training segments for each class:

 1  2  3  4  5  6  7  8  9 10 11 12 13 15 16 18 19 20
 2  2  1  2  7 12  1  1  3  3  2  2  4  2  1  1  2  2

As you can see most classes have only 1-2 segments. The random forestalgorithm's crossvalidation works by randomly selecting subsets oftraining data for each run. This means that using your training data setit will very often have subsets with some classes not represented atall, leading to warnings such as this:

50: model fit failed for Fold4.Rep02: mtry=60 Error inrandomForest.default(x, y, mtry = param$mtry, ...) :

  Can't have empty classes in y.

This means that often the random forest algorithm cannot determineaccuracy measures through cross-validation. These accuracy measures areused as weights in the weighted voting schemes.

You could still use v.class.mlR either by not using a tree basedapproach such as random forest, or by not using a weighted vote, butjust a simple majority vote (the default). But I would not recommend itin light of your limited training data. In general rule of thumb youshould have at least 30 training segments per class, but often you mighteven need a bit more. Another solution would be to be less ambitious interms of number of classes. 20 is a lot. Often classifications target5-10 classes.


Moritz


_______________________________________________
grass-user mailing list
[email protected]
https://lists.osgeo.org/mailman/listinfo/grass-user

Re: [GRASS-user] v.class.mlR Error

Reply via email to