Hi, GIZA++ is designed for parallel sentence pairs, but it is tolerant to some degree of noise. If you feed "comparable" sentence pairs, which I assume you mean to be sentence pairs which are only partially (or not at all) translationally equivalent, then you will hopefully get word alignments for the relevant parts, but also alignments for the mismatched parts. Given a large amount of such data, you may be able to learn useful word translation tables - but it depends on the noisiness of the data.
If you add parallel sentence pairs, you will likely get better alignments. -phi On Thu, Jun 19, 2014 at 7:20 AM, alireza tabebordbar <[email protected]> wrote: > Hi all > I am Master degree Student and I'm New to SMT. > I extracted some Comparable sentences and I want to use Giza++ for > word alignment.I Think Giza Use EM algorithm for aligning words, > However I don't know that I can feed comparable sentences directly to > Giza++ or I have combine them with some Parallel sentences. > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
