Hi Massinissa and all, I'd like to use the surface, lemma, and POS factors and annotate my coprora. I have checked treetagger and mxpost tools but neither produces the [surface|lemma|POS] format used in http://www.statmt.org/moses/download/factored-corpus.tgz . Even though I followed the instructions in the moses manual - external tools, I get a separate file for the POS factor. Then I don't know how to proceed, as this is not the format of the above link (and thus not accepted by moses, I suppose)
Thanks, Viktor 2014-03-20 22:11 GMT+01:00 Massinissa Ahmim < [email protected]>: > Hi Viktor, > > As far I know, you can use the wrapper scripts (treetagger or mxpost) > located at /mosesdecoder/scripts/training/wrappers for this purpose. > > Regards > > Massinissa > > > 2014-03-20 17:37 GMT+01:00 Viktor Pless <[email protected]>: > >> Hi, what tools can be used to lemmatize/POS-tag/etc. a corpus in moses >> format (with the pipes)? I need them regarding Spanish, English, Hungarian. >> Thanks in advance. >> Viktor >> >> _______________________________________________ >> Moses-support mailing list >> [email protected] >> http://mailman.mit.edu/mailman/listinfo/moses-support >> >> > > > -- > > [image: Description : Description : lingua_custodia_final full logo] > > *The Translation Trustee* > > *1, Place Charles de Gaulle* > > *78180 Montigny-le-Bretonneux* > > *Tel : +33 1 30 44 04 23 Mobile : +33 7 61 44 40 84* > > *Email :* *[email protected] > <[email protected]>* > > *Website :* *www.linguacustodia.com <http://www.linguacustodia.com/> - > www.thetranslationtrustee.com <http://www.thetranslationtrustee.com>* > > ü Pensez à l'environnement, n'imprimez ce courriel que si nécessaire. > > Please do not print this email unless it is absolutely necessary. Spread > environmental awareness. >
<<inline: image001.jpg>>
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
