Ok, thanks again, sorry I'm really new at this... Now I would like to know
what to do with the files I obtain from running Giza++, because I have them
all in a folder, so that when I run moses it can access them, for example
when I try to run it I get this message:
ERROR: Can't read work/corpus_giza//giza.fr-en/fr-en.A3.final.{bz2,gz}
and i dont have a file named fr-en.A3.final...
The files generated by GIZA++ are:
dic.a3.final
dic.d4.final
dic.n3.final
dic.ti.final
dic.tst.trg.vcb
prueba.en.vcb.classes
prueba.fr.vcb
dic.A3.final
dic.D4.final
dic.p0_3.final
dic.trn.src.vcb
prueba.en.vcb.classes.cats
prueba.fr.vcb.classes
dic.actual.ti.final
dic.Decoder.config dic.perp
dic.trn.trg.vcb
prueba.en_prueba.fr.snt
prueba.fr.vcb.classes.cats
dic.d3.final
dic.gizacfg
dic.t3.final
dic.tst.src.vcb
prueba.en.vcb
prueba.fr_prueba.en.snt
because the original corpus files are prueba.en and prueba.fr
I hope u can help me or at least tell me where to look for help, thank you!
El 1 de julio de 2009 14:36, Felipe Sánchez Martínez
<[email protected]>escribió:
>
> Hi,
>
> just add "--first-step 3 --last-step 9" when calling
> train-factored-phrase-model.perl
>
> --
> Felipe.
>
> El mié, 01-07-2009 a las 13:52 -0430, maria sol ferrer escribió:
> > Hi, thanks for all your answers, I have already looked on the internet
> > and tried those solutions but cant seem to fix the problem. Now I ran
> > the sample english-french corpus through GIZA++ (not through the moses
> > script that calls Giza) and it seems to generate the dictionary and
> > everything fine, now I would like to know if there is a way to run
> > moses from step 3, (after generating the GIZA++ files) because my
> > problem seemed to be on step 2... Thanks again
> >
> > 2009/6/25 Barry Haddow <[email protected]>
> > Hi Maria
> >
> > There's some discussion of the GIZA buffer overflow problem on
> > the GIZA
> > website.
> > http://code.google.com/p/giza-pp/updates/list
> >
> > The problems seems to arise with recent versions of g++. If
> > you could
> > recompile with an earlier version (g++ 4.1) then it should be
> > ok,
> >
> > regards
> > Barry
> >
> >
> > On Thursday 25 June 2009 04:13, maria sol ferrer wrote:
> > > Hi, thanks for your reply, but I have tried wit a cutoff of
> > 40, 20, 10 even
> > > 5 words per line and I am still getting the same error, I
> > have followed the
> > > training steps exactly as described and I cant seem to find
> > the problem.
> > > Please any help would be really appreciated! Thank you
> > >
> > > 2009/6/23 John Burger <[email protected]>
> > >
> > > > maria sol ferrer wrote:
> > > > > Hi, I am going to be working on my thesis with moses
> > and I a having
> > > > > some problems getting started.. So far I was able to
> > download and
> > > > > install Moses and all its necessary tools successfully,
> > tried the
> > > > > one sentence example and worked fine.. Now I am trying
> > to do the
> > > > > training that is available on the "Moses installation
> > and training
> > > > > run-through" (http://www.statmt.org/moses_steps.html)
> > but when I run
> > > > > moses, during step 2.1a I was getting this error:
> > > > >
> > > > > *** buffer overflow detected
> > ***: /home/puchi/tools/bin/GIZA++
> > > > > terminated
> > > > >
> > > > > now, a few days later without changing anything it seems
> > to go to
> > > > > step 2.1b but I get:
> > > >
> > > > In my experience, the buffer overflow error is due to not
> > sufficiently
> > > > filtering your corpus - GIZA is unhappy with very long
> > segments. That
> > > > same page describes the "Filter out long sentences" step,
> > with an
> > > > example invocation like this:
> > > >
> > > > clean-corpus-n.perl ... 1 40
> > > >
> > > > What value are you using for the max length? I would
> > decrease this
> > > > until GIZA stops throwing the error.
> > > >
> > > > Also, I suspect it only appears to go on to step 2.1b
> > because some
> > > > partial file is laying around - don't be fooled.
> > > >
> > > > - John D. Burger
> > > > MITRE
> > > >
> > > > _______________________________________________
> > > > Moses-support mailing list
> > > > [email protected]
> > > > http://mailman.mit.edu/mailman/listinfo/moses-support
> >
> >
> > --
> > The University of Edinburgh is a charitable body, registered
> > in
> > Scotland, with registration number SC005336.
> >
> >
> > _______________________________________________
> > Moses-support mailing list
> > [email protected]
> > http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support