In my opinion the models should be documented. In some cases it is said the training corpus used but in others it's not. We should also said which features were used and the results obtained on which dataset. If default features are used we should also said so.
If we cannot provide such info we should also add a disclaimer about it. Furthermore I can also provide some models trained with usual corpora for pos, namefinder and parse. Cheers R
