Steven Huang writes:
>
> It seems that the XML is not correctly paresed and is taken as plain text.
> Is there anything wrong with my training configuration or training corpus?
> Thanks a lot.
Hi Steven,
The Moses XML format isn't pure and still cares about white space. Each
sentence should be
Hi,
I am trying to build a tree-to-tree model. Before that, I've successfully
build a string-to-string syntax model with the following configuration (the
training corpus are in surface form).
/mosesdecoder/scripts/training/train-model.perl \
--root-dir train \
--mgiza \
--mgiza-cpus 20 \
--corpus