Hi,
I have completed string to tree syntax based model. And have converted the
output of enju parser to format accepted by moses for synatx based training.
But the rule table created consist of phrases like <tree label ............ as
a result the translated text also contains these phrases.
Output of enju-<sentence id="s0" parse_status="success" fom="0.991145"><cons
id="c0" cat="NP" xcat="" head="c1" sem_head="c1" schema="empty_spec_head"><cons
id="c1" cat="NX" xcat="" head="t0" sem_head="t0"><tok id="t0" cat="N" pos="NN"
base="broadcasting" lexentry="[D<N.3sg>]"
pred="noun_arg0">broadcasting</tok></cons></cons></sentence>
Converted format of enju to format accepted by moses-<tree label="TOP"> <tree
label="NP"> <tree label="NX"> <tree label="N"> broadcasting </tree> </tree>
</tree> </tree>
Rule table created-42800 [X] ||| <tree [SC] ||| 0.00103349 1 0.0103349 1 |||
0-0 ||| 10 1 1 |||
[X][ADJP] चरित्र [X] ||| <tree [X][ADJP] <tree [NP] ||| 0.00729519 1 0.00226863
1 ||| 0-1 1-2 ||| 0.485714 1.5619 0.342857 |||
Where am i going wrong?? Is there a problem in XML Format. ?? ThanksKhyati
GuptaTraineeCDAC Noida
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support