Hi Massinissa and all,

I'd like to use the surface, lemma, and POS factors and annotate my
coprora. I have checked treetagger and mxpost tools but neither produces
the [surface|lemma|POS] format used in
http://www.statmt.org/moses/download/factored-corpus.tgz .
Even though I followed the instructions in the moses manual - external
tools, I get a separate file for the POS factor. Then I don't know how to
proceed, as this is not the format of the above link (and thus not accepted
by moses, I suppose)

Thanks,
Viktor



2014-03-20 22:11 GMT+01:00 Massinissa Ahmim <
[email protected]>:

> Hi Viktor,
>
> As far I know, you can use the wrapper scripts (treetagger or mxpost)
> located at /mosesdecoder/scripts/training/wrappers for this purpose.
>
> Regards
>
> Massinissa
>
>
> 2014-03-20 17:37 GMT+01:00 Viktor Pless <[email protected]>:
>
>> Hi, what tools can be used to lemmatize/POS-tag/etc. a corpus in moses
>> format (with the pipes)? I need them regarding Spanish, English, Hungarian.
>> Thanks in advance.
>> Viktor
>>
>> _______________________________________________
>> Moses-support mailing list
>> [email protected]
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
>
> --
>
> [image: Description : Description : lingua_custodia_final full logo]
>
>  *The Translation Trustee*
>
> *1, Place Charles de Gaulle*
>
> *78180 Montigny-le-Bretonneux*
>
> *Tel : +33 1 30 44 04 23   Mobile : +33 7 61 44 40 84*
>
> *Email :*  *[email protected]
> <[email protected]>*
>
> *Website :*  *www.linguacustodia.com <http://www.linguacustodia.com/> -
> www.thetranslationtrustee.com  <http://www.thetranslationtrustee.com>*
>
> ü Pensez à l'environnement, n'imprimez ce courriel que si nécessaire.
>
> Please do not print this email unless it is absolutely necessary. Spread
> environmental awareness.
>

<<inline: image001.jpg>>

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to