Hi Barry,

i am not training giza through moses. i am training it independently. Will
it make any difference ? Anyways i  do not have clean-corpus-n.perl in giza.
please tell what to do of it ?

On Mon, Jan 31, 2011 at 3:07 PM, Barry Haddow <[email protected]> wrote:

> Hi Nakul
>
> Did you clean your corpus first (ie run clean-corpus-n.perl over it) ?
>
> best regards - Barry
>
> On Monday 31 January 2011 04:20, nakul sharma wrote:
> > hi all,
> >
> > i have having g++ version 4.4.3 and ubuntu 10.04 LTS, while training
> > GIZA++, i get following error upon execution of GIZA++ exe file:-
> >
> > Reading vocabulary file from:200ESens.vcb
> > Reading vocabulary file from:200HSens.vcb
> > {WARNING:(a)truncated sentence 0}{WARNING:(a)truncated sentence
> 1}WARNING:
> > The following sentence pair has source/target sentence length ration more
> > than the maximum allowed limit for a source word fertility
> >  source length = 1 target length = 11 ratio 11 ferility limit : 9
> > Shortening sentence
> > Sent No: 3 , No. Occurrences: 1
> > 0 254
> > 57 5 3 58 59 60 5 61 62 63 64
> >
> > like this for almost all the Sent No, i get this warning and then for a
> > sentence number 98 i get this error message:-
> >
> > Sent No: 98 , No. Occurrences: 1
> > 0 457 458
> > 909 910 15 911 17 86 912 913 65 3 914 915 22 916 11 917 170 162 918 919 3
> > 684 22 8 920 921 22 8 333 922 923 924 22 925
> > ERROR: target word 937 is not in the vocabulary list.
> >
> > Giza++ has generated only one file **.root.gfcs.
> >
> > Please tell how to deal with this problem.
>
> --
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
>
>


-- 
Thanks & Regards,
nakul
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to