On 13 May 2011 22:55, Paulo Schreiner <[email protected]> wrote: > Anyone here has some experience with the apertium tagger? > > I have created (to my best knowledge) all required resources, but got > stuck with the following error: > > apertium-tagger -d -s 0 pt.expand pt.tagged.txt pt.tsx pt.prob pt.tagged > pt.tagged.morf > Calculating ambiguity classes... > > 30 states and 31 ambiguity classes > Kupiec's initialization of transition and emission probabilities... > Initializing transition and emission probabilities from a hand-tagged > corpus... > {adv} Word: depois -- {prp,adv} Word: depois > Error: A new ambiguity class was found. I cannot continue. > Word 'depois' not found in the dictionary. > New ambiguity class: {prp,adv} > Take a look at the dictionary, then retrain.
'depois' needs to be added to the dictionary (as both preposition and adverb), to match the corpus. In all likelihood, the word is present (otherwise it couldn't have encountered an ambiguity), so you'll probably need to look at the commands in the Makefile that are used to filter the output of lt-expand - it's discarding too much. -- <Sefam> Are any of the mentors around? <jimregan> yes, they're the ones trolling you ------------------------------------------------------------------------------ Achieve unprecedented app performance and reliability What every C/C++ and Fortran developer should know. Learn how Intel has extended the reach of its next-generation tools to help boost performance applications - inlcuding clusters. http://p.sf.net/sfu/intel-dev2devmay _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
