Hi, friends,
I encountered a bug when trying to use the ‘-xml-input inclusive’ options
with the command line tool. The moses library I use is the latest, which I
checked out from subversion server on May 5th, 2008. The IRSTLM tool is
version 5.05.01. It’s also the latest. I compiled the programs with g++
4.1.2 in Ubuntu. It works if I just ran the moses tool without the
‘-xml-input’ option. So I think the issue and the XML option are related.
Can someone provide some clues on what the problems might be? Thanks.
Here is dump information with gdb: (There are some Chinese characters. )
===================
(gdb) r
Starting program: /home/victor/ir40_model/moses -f moses.ini -xml-input
inclusive
Defined parameters (per moses.ini or switch):
config: moses.ini
distortion-file: 0-0 msd-bidirectional-fe 6
/home/victor/ir40_model/model/reordering-table.msd-bidirectional-fe.0.5.0-0
distortion-limit: 6
input-factors: 0
lmodel-file: 1 0 3
/home/victor/ir40_model/lang-model/europarl.blm.mm
mapping: 0 T 0
ttable-file: 0 0 5 /home/victor/ir40_model/model/phrase-table.0-0
ttable-limit: 20 0
weight-d: 0.2 0.2 0.2 0.2 0.2 0.2 0.2 0.2
weight-l: 0.5000
weight-t: 0.2 0.2 0.2 0.2 0.2
weight-w: -1
xml-input: inclusive
Loading lexical distortion models...
have 1 models
Creating lexical reordering...
weights: 0.200 0.200 0.200 0.200 0.200 0.200
binary file loaded, default OFF_T: -1
Created lexical orientation reordering
Start loading LanguageModel /home/victor/ir40_model/lang-model/europarl.blm.
mm : [1.000] seconds
In LanguageModelIRST::Load: nGramOrder = 3
Loading LM file (no MAP)
blmt
loadbin()
mapping 86700 1-grams
mapping 1948935 2-grams
mapping 2070512 3-grams
done
OOV code is 86700
IRST: m_unknownId=86700
Set dictionary_upperbound to: 86702
Set logOOVpenalty to: 0.000
Finished loading LanguageModels : [1.000] seconds
Start loading PhraseTable /home/victor/ir40_model/model/phrase-table.0-0 :
[1.000] seconds
using binary phrase tables for idx 0
reading bin ttable
size of OFF_T 8
binary phrasefile loaded, default OFF_T: -1
Finished loading phrase tables : [2.000] seconds
IO from STDOUT/STDIN
Created input-output object : [2.000] seconds
一 位 英国 广播 公司 的 喜剧 演员 ask 他 的 助手 <n english="to">to</n> 念
当天 的 新闻 .
Translating: 一 位 英国 广播 公司 的 喜剧 演员 ask 他 的 助手 to 念 当天 的
新闻 .
Program received signal SIGSEGV, Segmentation fault.
Word::GetString (this=0x91c9080, [EMAIL PROTECTED], endWithBlank=false)
at Word.cpp:75
75 strme << factor->GetString();
(gdb) backtrace
#0 Word::GetString (this=0x91c9080, [EMAIL PROTECTED],
endWithBlank=false) at Word.cpp:75
#1 0x08066343 in LexicalReorderingTableTree::MakeTableKey (this=0x810cc78,
[EMAIL PROTECTED], [EMAIL PROTECTED]) at LexicalReorderingTable.cpp:514
#2 0x08069151 in LexicalReorderingTableTree::GetScore (this=0x810cc78,
[EMAIL PROTECTED], [EMAIL PROTECTED], [EMAIL PROTECTED])
at LexicalReorderingTable.cpp:241
#3 0x0806291e in LexicalReordering::GetProb (this=0x810cc00, [EMAIL PROTECTED],
[EMAIL PROTECTED]) at LexicalReordering.cpp:248
#4 0x080cbf40 in TranslationOptionCollection::CacheLexReordering (
this=0x91c9140) at TranslationOptionCollection.cpp:583
#5 0x080cd662 in TranslationOptionCollection::CreateTranslationOptions (
this=0x91c9140, [EMAIL PROTECTED])
at TranslationOptionCollection.cpp:367
#6 0x080767a8 in Manager::ProcessSentence (this=0xbfac85dc) at
Manager.cpp:97
#7 0x0804d0e6 in main (argc=0, argv=0x10) at Main.cpp:138
===================
Liqi Gao
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support