thanks. I couldn't replicate it the slowdown when i ran my experiments. Maybe you should try Mike Ladwig's suggestion and use the compact phrase table.
If you can create a model that you can upload, maybe i can debug it On 26 September 2014 11:45, Mohammad Mahdi Mahsuli <[email protected]> wrote: > I > patched my server application with the new mosserver which has better > thread management but the problem has not been solved. Still, the cache is > not used when running moses with binary models. Unfortunately, the models > are too big for me to upload them. > > Here is some extra information: > > Server: SuperMicro CPU 2x Xeon 2260 RAM 96GB SSD 250GB > All models were OS-cached using cat models/* > /dev/null > Binary Phrases + Binary reordering ~= 3.5GB > Plain Phrases + Plain reordering ~= 30GB > > Results: > +--------------+----------+---------------+ > |Moses Version | Model | Loading Time | > +--------------+----------+---------------+ > | 0.9 | binary | ~45 sec | > | 2.1.1 | binary | ~20 sec | > | 2.1.1 | plain.gz | ~72 min | > | 2.1.1 | plain | ~60 min | > +--------------+----------+---------------+ > > +-------------+----------------+-----------------------+ > | | | TranslationTime (sec) | > | Token Count | Moses/Models +-----------------------+ > | | | 1st round | 2nd round | > +-------------+----------------+-----------------------+ > | 5 | 0.9 binary | 2.7 | 0.19 | > | 5 | 2.1.1 binary | 6.9 | 6.9 | > | 5 | 2.1.1 plain | 4.3 | 0.02 | > +-------------+----------------+-----------------------+ > | 80 | 0.9 binary | 8.0 | 2.2 | > | 80 | 2.1.1 binary | 19.051 | 18.9 | > | 80 | 2.1.1 plain | 7.2 | 0.2 | > +-------------+----------------+-----------------------+ > | 191 | 0.9 binary | 18.26 | 5.6 | > | 191 | 2.1.1 binary | 33.51 | 32.3 | > | 191 | 2.1.1 plain | 16.6 | 0.5 | > +-------------+----------------+-----------------------+ > > Even tried moses trunk ver 84ad576750 with no progress. For a sample of > 191 tokens using mosesserver we have : > > First round: > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > binary file loaded, default OFF_T: -1 > Line 0: Collecting options took 12.1748 seconds at moses/Manager.cpp:110 > Line 0: Search took 2.48092 seconds > > Second round: > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > binary file loaded, default OFF_T: -1 > Line 0: Collecting options took 11.9163 seconds at moses/Manager.cpp:110 > Line 0: Search took 2.48623 seconds > > Third round: > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > binary file loaded, default OFF_T: -1 > Line 0: Collecting options took 11.7477 seconds at moses/Manager.cpp:110 > Line 0: Search took 2.46596 seconds > > > ===================================== > > The problem persists even using moses-cmd. Log Follows > > (please note the translation times): > > SMT-SM-1:/SMT/Service/Bin # ./moses -config > /SMT/Service/Models/en2fa/moses-tuned-bin.ini text > Defined parameters (via moses.ini or switch): > config: /SMT/Service/Models/en2fa/moses-tuned-bin.ini text > distortion-limit: 6 > feature: UnknownWordPenalty WordPenalty PhrasePenalty > PhraseDictionaryBinary name=TranslationModel0 num-features=4 > path=/SMT/Service/Models/en2fa/phrase-table input-factor=0 output-factor=0 > table-limit=14 LexicalReordering name=LexicalReordering0 num-features=6 > type=wbe-msd-bidirectional-fe-allff input-factor=0 output-factor=0 > path=/SMT/Service/Models/en2fa/reordering-table Distortion KENLM name=LM0 > factor=0 path=/SMT/Service/Models/en2fa/Parallel.fa.arpa order=4 > input-factors: 0 > mapping: 0 T 0 > weight: LexicalReordering0= 0.073298 0.0596145 0.115321 0.066936 > 0.0283095 0.0565263 Distortion0= 0.0325491 LM0= 0.0620824 WordPenalty0= > -0.248997 PhrasePenalty0= 0.0700501 TranslationModel0= 0.00878866 0.0570903 > 0.0778723 0.042565 UnknownWordPenalty0= 1 > . > line=UnknownWordPenalty > FeatureFunction: UnknownWordPenalty0 start: 0 end: 0 > line=WordPenalty > FeatureFunction: WordPenalty0 start: 1 end: 1 > line=PhrasePenalty > FeatureFunction: PhrasePenalty0 start: 2 end: 2 > line=PhraseDictionaryBinary name=TranslationModel0 num-features=4 > path=/SMT/Service/Models/en2fa/phrase-table input-factor=0 output-factor=0 > table-limit=14 > FeatureFunction: TranslationModel0 start: 3 end: 6 > line=LexicalReordering name=LexicalReordering0 num-features=6 > type=wbe-msd-bidirectional-fe-allff input-factor=0 output-factor=0 > path=/SMT/Service/Models/en2fa/reordering-table > FeatureFunction: LexicalReordering0 start: 7 end: 12 > Initializing LexicalReordering.. > line=Distortion > FeatureFunction: Distortion0 start: 13 end: 13 > line=KENLM name=LM0 factor=0 > path=/SMT/Service/Models/en2fa/Parallel.fa.arpa order=4 > FeatureFunction: LM0 start: 14 end: 14 > Loading the LM will be faster if you build a binary file. > Reading /SMT/Service/Models/en2fa/Parallel.fa.arpa > > ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 > > **************************************************************************************************** > Loading UnknownWordPenalty0 > Loading WordPenalty0 > Loading PhrasePenalty0 > Loading LexicalReordering0 > binary file loaded, default OFF_T: -1 > Loading Distortion0 > Loading LM0 > Loading TranslationModel0 > IO from STDOUT/STDIN > Created input-output object : [12.000] seconds > His address came amid numerous international crises in which the United > States finds itself at the forefront, including the fight against ISIS in > Syria and Iraq, lingering tensions in Ukraine and the spread of Ebola in > Africa, which the U.S. Centers for Disease Control and Prevention this week > said could infect as many as 1.4 million people by early next year. > Translating line 0 in thread id 139648371779328 > Translating: His address came amid numerous international crises in which > the United States finds itself at the forefront, including the fight > against ISIS in Syria and Iraq, lingering tensions in Ukraine and the > spread of Ebola in Africa, which the U.S. Centers for Disease Control and > Prevention this week said could infect as many as 1.4 million people by > early next year. > reading bin ttable > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > binary file loaded, default OFF_T: -1 > Line 0: Collecting options took 5.000 seconds > Line 0: Search took 2.000 seconds > آدرس او در میان بسیاری از بحران های بینالمللی که در آن ایالات متحده آمریکا > خود را در این forefront, از جمله نبرد در سوریه و در برابر تمجید Iraq, > تنشهای ادامه دار در اوکراین و گسترش ابولا Africa, که ایالات متحده در مراکز > کنترل و پیشگیری از بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 > میلیون نفر توسط year. آینده > BEST TRANSLATION: آدرس او در میان بسیاری از بحران های بینالمللی که در آن > ایالات متحده آمریکا خود را در این forefront,|UNK|UNK|UNK از جمله نبرد در > سوریه و در برابر تمجید Iraq,|UNK|UNK|UNK تنشهای ادامه دار در اوکراین و > گسترش ابولا Africa,|UNK|UNK|UNK که ایالات متحده در مراکز کنترل و پیشگیری از > بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 میلیون نفر توسط > year.|UNK|UNK|UNK آینده > [1111111111111111111111111111111111111111111111111111111111111] > [total=-421.245] > core=(-400.000,-64.000,33.000,-82.639,-155.509,-38.502,-90.596,-19.212,-0.111,-1.429,-16.220,0.000,-4.306,-25.000,-311.065) > > ==================> Line 0: Translation took 8.000 seconds total > His address came amid numerous international crises in which the United > States finds itself at the forefront, including the fight against ISIS in > Syria and Iraq, lingering tensions in Ukraine and the spread of Ebola in > Africa, which the U.S. Centers for Disease Control and Prevention this week > said could infect as many as 1.4 million people by early next year. > Translating line 1 in thread id 139648371779328 > Translating: His address came amid numerous international crises in which > the United States finds itself at the forefront, including the fight > against ISIS in Syria and Iraq, lingering tensions in Ukraine and the > spread of Ebola in Africa, which the U.S. Centers for Disease Control and > Prevention this week said could infect as many as 1.4 million people by > early next year. > reading bin ttable > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > Line 1: Collecting options took 0.000 seconds > Line 1: Search took 1.000 seconds > آدرس او در میان بسیاری از بحران های بینالمللی که در آن ایالات متحده آمریکا > خود را در این forefront, از جمله نبرد در سوریه و در برابر تمجید Iraq, > تنشهای ادامه دار در اوکراین و گسترش ابولا Africa, که ایالات متحده در مراکز > کنترل و پیشگیری از بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 > میلیون نفر توسط year. آینده > BEST TRANSLATION: آدرس او در میان بسیاری از بحران های بینالمللی که در آن > ایالات متحده آمریکا خود را در این forefront,|UNK|UNK|UNK از جمله نبرد در > سوریه و در برابر تمجید Iraq,|UNK|UNK|UNK تنشهای ادامه دار در اوکراین و > گسترش ابولا Africa,|UNK|UNK|UNK که ایالات متحده در مراکز کنترل و پیشگیری از > بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 میلیون نفر توسط > year.|UNK|UNK|UNK آینده > [1111111111111111111111111111111111111111111111111111111111111] > [total=-421.245] > core=(-400.000,-64.000,33.000,-82.639,-155.509,-38.502,-90.596,-19.212,-0.111,-1.429,-16.220,0.000,-4.306,-25.000,-311.065) > > ==================> Line 1: Translation took 3.000 seconds total > His address came amid numerous international crises in which the United > States finds itself at the forefront, including the fight against ISIS in > Syria and Iraq, lingering tensions in Ukraine and the spread of Ebola in > Africa, which the U.S. Centers for Disease Control and Prevention this week > said could infect as many as 1.4 million people by early next year. > Translating line 2 in thread id 139648371779328 > Translating: His address came amid numerous international crises in which > the United States finds itself at the forefront, including the fight > against ISIS in Syria and Iraq, lingering tensions in Ukraine and the > spread of Ebola in Africa, which the U.S. Centers for Disease Control and > Prevention this week said could infect as many as 1.4 million people by > early next year. > reading bin ttable > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > Line 2: Collecting options took 0.000 seconds > Line 2: Search took 2.000 seconds > آدرس او در میان بسیاری از بحران های بینالمللی که در آن ایالات متحده آمریکا > خود را در این forefront, از جمله نبرد در سوریه و در برابر تمجید Iraq, > تنشهای ادامه دار در اوکراین و گسترش ابولا Africa, که ایالات متحده در مراکز > کنترل و پیشگیری از بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 > میلیون نفر توسط year. آینده > BEST TRANSLATION: آدرس او در میان بسیاری از بحران های بینالمللی که در آن > ایالات متحده آمریکا خود را در این forefront,|UNK|UNK|UNK از جمله نبرد در > سوریه و در برابر تمجید Iraq,|UNK|UNK|UNK تنشهای ادامه دار در اوکراین و > گسترش ابولا Africa,|UNK|UNK|UNK که ایالات متحده در مراکز کنترل و پیشگیری از > بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 میلیون نفر توسط > year.|UNK|UNK|UNK آینده > [1111111111111111111111111111111111111111111111111111111111111] > [total=-421.245] > core=(-400.000,-64.000,33.000,-82.639,-155.509,-38.502,-90.596,-19.212,-0.111,-1.429,-16.220,0.000,-4.306,-25.000,-311.065) > > ==================> Line 2: Translation took 4.000 seconds total > His address came amid numerous international crises in which the United > States finds itself at the forefront, including the fight against ISIS in > Syria and Iraq, lingering tensions in Ukraine and the spread of Ebola in > Africa, which the U.S. Centers for Disease Control and Prevention this week > said could infect as many as 1.4 million people by early next year. > Translating line 3 in thread id 139648371779328 > Translating: His address came amid numerous international crises in which > the United States finds itself at the forefront, including the fight > against ISIS in Syria and Iraq, lingering tensions in Ukraine and the > spread of Ebola in Africa, which the U.S. Centers for Disease Control and > Prevention this week said could infect as many as 1.4 million people by > early next year. > reading bin ttable > size of OFF_T 8 > binary phrasefile loaded, default OFF_T: -1 > Line 3: Collecting options took 0.000 seconds > Line 3: Search took 2.000 seconds > آدرس او در میان بسیاری از بحران های بینالمللی که در آن ایالات متحده آمریکا > خود را در این forefront, از جمله نبرد در سوریه و در برابر تمجید Iraq, > تنشهای ادامه دار در اوکراین و گسترش ابولا Africa, که ایالات متحده در مراکز > کنترل و پیشگیری از بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 > میلیون نفر توسط year. آینده > BEST TRANSLATION: آدرس او در میان بسیاری از بحران های بینالمللی که در آن > ایالات متحده آمریکا خود را در این forefront,|UNK|UNK|UNK از جمله نبرد در > سوریه و در برابر تمجید Iraq,|UNK|UNK|UNK تنشهای ادامه دار در اوکراین و > گسترش ابولا Africa,|UNK|UNK|UNK که ایالات متحده در مراکز کنترل و پیشگیری از > بیماریها این هفته گفت که میتواند باعث آلوده شدن به 1.4 میلیون نفر توسط > year.|UNK|UNK|UNK آینده > [1111111111111111111111111111111111111111111111111111111111111] > [total=-421.245] > core=(-400.000,-64.000,33.000,-82.639,-155.509,-38.502,-90.596,-19.212,-0.111,-1.429,-16.220,0.000,-4.306,-25.000,-311.065) > > ==================> Line 3: Translation took 3.000 seconds total > > Best regards, > M.M. Mahsuli > there was a problem with moses server, it was fixed a few weeks ago, git > pull from the github to get the fix. > > Does the slowdown happen with the command line moses? In my tests, there > wasn't too much of a slowdown. > http://www.statmt.org/moses/RELEASE-2.1/Mosesv2.1.1releasenotes.pdf > If you do see a decrease in performance and can share your models, I might > be able to debug it > > On 25 September 2014 16:00, Mohammad Mahdi Mahsuli <[email protected]> > wrote: > >> Thanks for your reply. Below you can find the required information: >> >> >> Moses was compiled with following command: >> >> ./bjam --with-xmlrpc-c=$HOME/local/ --with-cmph=$HOME/local/ >> --with-tcmalloc --install-scripts=$HOME/local/moses --enable-boost-pool >> -j24 >> >> moses.ini: >> >> ######################### >> ### MOSES CONFIG FILE ### >> ######################### >> >> # input factors >> [input-factors] >> 0 >> >> # mapping steps >> [mapping] >> 0 T 0 >> >> [distortion-limit] >> 6 >> >> >> # feature functions >> [feature] >> UnknownWordPenalty >> WordPenalty >> PhrasePenalty >> PhraseDictionaryBinary name=TranslationModel0 num-features=4 >> path=/SMT/Service/Models/en2fa/phrase-table input-factor=0 output-factor=0 >> table-limit=14 >> LexicalReordering name=LexicalReordering0 num-features=6 >> type=wbe-msd-bidirectional-fe-allff input-factor=0 output-factor=0 >> path=/SMT/Service/Models/en2fa/reordering-table >> Distortion >> KENLM name=LM0 factor=0 path=/SMT/Service/Models/en2fa/Parallel.fa.arpa >> order=4 >> >> # core weights >> [weight] >> LexicalReordering0= 0.073298 0.0596145 0.115321 0.066936 0.0283095 >> 0.0565263 >> Distortion0= 0.0325491 >> LM0= 0.0620824 >> WordPenalty0= -0.248997 >> PhrasePenalty0= 0.0700501 >> TranslationModel0= 0.00878866 0.0570903 0.0778723 0.042565 >> UnknownWordPenalty0= 1 >> >> ==================================================================== >> >> I ran mosesserver with the following command: >> >> mosesserver -config /SMT/Service/Models/en2fa/moses-tuned-bin.ini \ >> --server-port 9393 \ >> --server-log ./server.log \ >> -s 100 -b .5 >> >> Thanks in advance >> >> On Thu, Sep 25, 2014 at 5:15 PM, Hieu Hoang <[email protected]> wrote: >> >>> what is the exact command you execute? How many threads are you running >>> with? what are the exact times for x,y, z? How many sentences were in the >>> input? >>> >>> Can I please have a look at your moses.ini files. >>> >>> >>> On 25/09/14 11:48, Mohammad Mahdi Mahsuli wrote: >>> >>> Hi, >>> >>> I was using Moses v0.9 for a long time, and I recently decided to opt >>> to use the latest version (v2.1.1). In the test time, when loading the >>> plain models into Moses, there is no difficulty. But when I want to use the >>> binarized models, the translation time is much more than when I used the >>> binary models on the old version of Moses. I have performed the following >>> experiments. >>> >>> model | decoder | time spent >>> ------------+----------------+---------------- >>> old | old | x >>> new | new | y>x >>> old | new | z>x >>> new | old | N/A >>> >>> >>> "Model" means if I have trained the binary model using the old version >>> of Moses or the new one. "Decoder" is the Moses version which I have used >>> for translating the test sentences. >>> >>> I should also note that I have cached the binary models in OS memory >>> using the cat command. No other memory-intensive process was running on the >>> machine. However, I noticed that the translation time when translating a >>> sentence for the second time does not reduce. Therefore, I doubt that the >>> translations are being cached. Is there any argument I should consider for >>> this purpose? >>> >>> Best Regards, >>> M.M. Mahsuli >>> >>> >>> _______________________________________________ >>> Moses-support mailing >>> [email protected]http://mailman.mit.edu/mailman/listinfo/moses-support >>> >>> >>> >> > > > -- > Hieu Hoang > Research Associate > University of Edinburgh > http://www.hoang.co.uk/hieu > > > -- Hieu Hoang Research Associate University of Edinburgh http://www.hoang.co.uk/hieu
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
