Dear Prof. Koehn, The fuzzy thing is there is no mismatch between those files! The only mismatch is the output of the error!
>> !alignment point (42,23) out of range (0-31,0-25) in line 1, ignoring >> alignment point (33,24) out of range (0-31,0-25) in line 1, ignoring >> alignment point (28,26) out of range (0-31,0-25) in line 1, ignoring >> alignment point (30,28) out of range (0-31,0-25) in line 1, ignoring >> alignment point (29,29) out of range (0-31,0-25) in line 1, ignoring >> alignment point (32,29) out of range (0-31,0-25) in line 1, ignoring >> alignment point (31,30) out of range (0-31,0-25) in line 1, ignoring >> alignment point (34,31) out of range (0-31,0-25) in line 1, ignoring >> alignment point (37,34) out of range (0-31,0-25) in line 1, ignoring >> alignment point (42,35) out of range (0-31,0-25) in line 1, ignoring >> alignment point (41,36) out of range (0-31,0-25) in line 1, ignoring >> alignment point (35,37) out of range (0-31,0-25) in line 1, ignoring >> alignment point (42,37) out of range (0-31,0-25) in line 1, ignoring >> alignment point (36,38) out of range (0-31,0-25) in line 1, ignoring >> alignment point (43,39) out of range (0-31,0-25) in line 1, ignoring On Tue, Nov 13, 2012 at 12:01 AM, Philipp Koehn <[email protected]> wrote: > Hi, > > can you take a look at the files corpus.lowercased.* ? > There seems to be a mismatch between those files and > the GIZA++ output. > > -phi > > On Mon, Nov 12, 2012 at 6:58 AM, Cuong Hoang <[email protected]> > wrote: > > I attach the output when I run MOSES. > > I want to make a note that the quality of my toolkit is good since I > > reference GIZA++ > > from every aspect before coding. > > To besides, the format of the word alignment output is correct. > > However, I stuck with the problem here! > > > > On Mon, Nov 12, 2012 at 10:42 PM, Cuong Hoang <[email protected]> > > wrote: > >> > >> Hi all, > >> These days I've been stuck with a very fuzzy error. > >> I've been coding an IBM models training toolkit (1-3) and now finishing > >> IBM Model 4. > >> The output of my toolkit is exactly as the MOSES specification as > >> described: http://www.statmt.org/moses/?n=FactoredTraining.RunGIZA > >> For example: > >> > >> # Sentence pair (1) source length 4 target length 3 alignment score : > >> 0.00643931 > >> wiederaufnahme der sitzungsperiode > >> NULL ({ }) resumption ({ 1 }) of ({ }) the ({ 2 }) session ({ 3 }) > >> # Sentence pair (2) source length 17 target length 18 alignment score : > >> 1.74092e-26 > >> ich erklaere die am donnerstag , den 28. maerz 1996 unterbrochene > >> sitzungsperiode > >> des europaeischen parlaments fuer wiederaufgenommen . > >> > >> > >> > >> > >> Now, I've been stuck with the very strange error alignment point out of > >> range. For example: > >> > >> !alignment point (42,23) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (33,24) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (28,26) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (30,28) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (29,29) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (32,29) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (31,30) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (34,31) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (37,34) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (42,35) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (41,36) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (35,37) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (42,37) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (36,38) out of range (0-31,0-25) in line 1, ignoring > >> alignment point (43,39) out of range (0-31,0-25) in line 1, ignoring > >> > >> However, the outputs of my toolkit of the pair 1 are: > >> From English-French: > >> # Sentence pair (1) source length 40 target length 44 alignment score : > >> 1.0 > >> à partir de la fin du xixe siècle , la découverte du spectre > >> électromagnétique et du monde de l 'atome va aussi mener à l > 'apparition d > >> 'une nouvelle branche de l 'astronomie , la plus importante de nos > jours : l > >> 'astrophysique . > >> NULL ({ }) from ({ 1 }) the ({ 4 }) 19th ({ 41 }) century ({ 8 }) > onwards > >> ({ 2 5 7 }) , ({ 9 }) the ({ 6 }) discovery ({ 11 }) of ({ 12 }) the ({ > 10 > >> }) electromagnetic ({ 14 }) spectrum ({ 13 }) and ({ 15 }) the ({ 16 }) > >> world ({ 17 }) of ({ 18 }) the ({ 19 }) atom ({ 20 39 }) spurred ({ 21 > 22 40 > >> }) on ({ 24 }) the ({ 25 }) development ({ }) of ({ 3 }) astrophysics > ({ 23 > >> 26 43 }) , ({ 34 }) a ({ 27 28 }) new ({ 29 }) discipline ({ }) in ({ > 31 }) > >> astronomy ({ 30 33 }) that ({ 32 }) is ({ 35 }) now ({ }) considered ({ > }) > >> to ({ 38 }) be ({ }) the ({ 42 }) most ({ 36 }) important ({ 37 }) . ({ > 44 > >> }) > >> > >> From French-English > >> # Sentence pair (1) source length 44 target length 40 alignment score : > >> 1.0 > >> from the 19th century onwards , the discovery of the electromagnetic > >> spectrum and the world of the atom spurred on the development of > >> astrophysics , a new discipline in astronomy that is now considered to > be > >> the most important . > >> NULL ({ }) à ({ }) partir ({ 1 }) de ({ }) la ({ 2 }) fin ({ }) du ({ 7 > }) > >> xixe ({ 3 5 }) siècle ({ 4 }) , ({ 6 }) la ({ 10 }) découverte ({ 8 }) > du ({ > >> 9 }) spectre ({ 12 }) électromagnétique ({ 11 34 }) et ({ 13 }) du ({ > 14 }) > >> monde ({ 15 }) de ({ 16 }) l ({ 17 }) 'atome ({ 18 19 33 }) va ({ 20 32 > }) > >> aussi ({ }) mener ({ }) à ({ }) l ({ 21 }) 'apparition ({ 22 }) d ({ 23 > }) > >> 'une ({ 26 }) nouvelle ({ 27 }) branche ({ }) de ({ 29 }) l ({ }) > >> 'astronomie ({ 28 30 }) , ({ 25 }) la ({ 31 }) plus ({ }) importante ({ > 39 > >> }) de ({ 35 }) nos ({ }) jours ({ }) : ({ }) l ({ 37 }) 'astrophysique > ({ 24 > >> 36 38 }) . ({ 40 }) > >> > >> So, I just wonder what's the exactly problem here since the output of > word > >> alignment is very normal? > >> Thanks, > >> best regards, > >> C. Hoang > >> > >> -- > >> Hoàng Cường > >> SMTNerd > >> > > > > > > > > -- > > Hoàng Cường > > SMTNerd > > > > > > _______________________________________________ > > Moses-support mailing list > > [email protected] > > http://mailman.mit.edu/mailman/listinfo/moses-support > > > -- Hoàng Cường SMTNerd
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
