Just a reminding that errors in UD treebanks can be reported as issues in there repositories. As a UD treebank maintainer, Portuguese in my case, I would love to receive feedbacks such as these mentioned below.
Alexandre Sent from my iPhone > On 14 Jun 2022, at 06:39, lesze...@interia.eu wrote: > > I observed that lemmatizer fails for some languages: > german - Compound nouns are inconsistently lemmatized. Sometimes they are > lemmatized to the full word, but sometimes they are lemmatized to their last > word. In example: kundendienstzentrums => zentrum, geheimdienste => dienst > It causes an enormous number of outcomes and lemmatizer fails > with out of memory error.