If I have two words in dictionary that differ only in captitalization of the first letter then when the tagger finds lowercase in the sentence it correcly tags only from lowercase one (and if it find a capitalized version in the sentence it correcly tags from both). But if I have a capitalized word (proper noun) in a dictionary that does not have a same non-capitalized sibling if the lowercase version is found in the sentence (which I would expect it to be not tagged) it tags it from capitalized word (which is wrong at least for Ukrainian).
For example I have "Бен" (Ben) defined as man's name in the dictionary but not "бен" (ben) so when "бен Ладен" (bin Laden) is found the "бен" is tagged as name. I can probably create a lowercase "бен" in the dictionary with some special/empty tag (it's not a separate word per se) to fix it, but just wanted to doublecheck this was intentional and there's no easy way to configure it the other way. Thanks Andriy ------------------------------------------------------------------------------ Learn Graph Databases - Download FREE O'Reilly Book "Graph Databases" is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/13534_NeoTech _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel