2011/1/23 Hèctor Alòs i Font <[email protected]>

> 2011/1/23 Francis Tyers <[email protected]>
> >
>
>>  >
>> > 90 states and 420 ambiguity classes
>> > Kupiec's initialization of transition and emission probabilities...
>> > make: *** [fr-eo.prob] Error 1
>> >
>>
>
Mi ne povis malfermi http://tinyurl.com/4jkmko3 .
Sed mi ne ricevas la saman eraron, se mi uzas la tekstaron de SVN
(tekstaro/fr.crp.txt).

$ make -f fr-eo-unsupervised.make
apertium-validate-tagger apertium-eo-fr.fr.tsx
apertium-tagger -t 8 \
                           fr-tagger-data/fr.dic \
                           fr-tagger-data/fr.crp \
                           apertium-eo-fr.fr.tsx \
                           fr-eo.prob;
Calculating ambiguity classes...

90 states and 420 ambiguity classes
Kupiec's initialization of transition and emission probabilities...
.Error: A new ambiguity class was found. I cannot continue.
Word 'M' not found in the dictionary.
New ambiguity class: {ACRONIMOM,NPALTRES,NUM}
Take a look at the dictionary and at the training corpus. Then, retrain.
make: *** [fr-eo.prob] Fejl 1


Do, eble via problemo rilatas al la vorto (litero) 'M'

e lm="M">               <i>M</i><par n="I__num"/></e>
<e lm="m">               <i>m</i><par n="BBVA__n"/></e>
<e lm="M">               <i>M</i><par n="Carrefour__np"/></e>
<e lm="M.">              <i>M.</i><par n="BBVA__n"/></e>

Se mi forprenas '<e lm="M">               <i>M</i><par
n="Carrefour__np"/></e>' la tagger trejnas tute bone:


$ make -f fr-eo-unsupervised.make
Generating fr-tagger-data/fr.dic
apertium-destxt < fr-tagger-data/fr.crp.txt | lt-proc fr-eo.automorf.bin >
fr-tagger-data/fr.crp
This may take some time. Please, take a cup of coffee and come back later.
apertium-validate-dictionary apertium-eo-fr.fr.dix
apertium-validate-tagger apertium-eo-fr.fr.tsx
lt-expand apertium-eo-fr.fr.dix | grep -v "__REGEXP__" | grep -v ":<:" |\
    awk 'BEGIN{FS=":>:|:"}{print $1 ".";}' | apertium-destxt
>fr.dic.expanded
lt-proc -a fr-eo.automorf.bin <fr.dic.expanded | \
    apertium-filter-ambiguity apertium-eo-fr.fr.tsx > fr-tagger-data/fr.dic
rm fr.dic.expanded;
apertium-validate-tagger apertium-eo-fr.fr.tsx
apertium-tagger -t 8 \
                           fr-tagger-data/fr.dic \
                           fr-tagger-data/fr.crp \
                           apertium-eo-fr.fr.tsx \
                           fr-eo.prob;
Calculating ambiguity classes...

90 states and 420 ambiguity classes
Kupiec's initialization of transition and emission probabilities...
....................................................
Applying forbid and enforce rules...
Training (Baum-Welch)...
....................................................Log=1.65287e+06
....................................................Log=1.54451e+06
....................................................Log=1.52963e+06
....................................................Log=1.52437e+06
....................................................Log=1.52179e+06
....................................................Log=1.52007e+06

(ktp)



-- 
Jacob Nordfalk
http://javabog.dk
Underviser i Android på http://ihk.dk
------------------------------------------------------------------------------
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires 
February 28th, so secure your free ArcSight Logger TODAY! 
http://p.sf.net/sfu/arcsight-sfd2d
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to