[Moses-support] Moses Server index.js 500 internal error

2016-03-29 Thread faiza shahid
Hi,
I'm very new to moses and am currently working on the moses web translation for 
fr-en. I have tested the moses server on the terminal and followed the steps 
from the moses website as much as possible but I'm getting errors on the 
error.log shown below when I try to translate a website:[Tue Mar 29 
13:37:31.560334 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("01") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560484 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("02") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560548 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("03") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560606 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("04") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560664 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("05") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560831 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("06") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560894 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("07") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.560949 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("08") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.561001 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("09") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.561049 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("10") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.561097 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("11") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.561146 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("12") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.561195 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("13") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.575431 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("14") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.575495 2016] [cgi:error] [pid 12589] [client 192.168.1.97:44205] 
AH01215: [Tue Mar 29 13:37:31 2016] translate.cgi: Useless use of a constant 
("15") in void context at /usr/lib/cgi-bin/moses-web/translate.cgi line 68., 
referer: http://192.168.1.121/cgi-bin/moses-web/?url=www.lemonde.fr[Tue Mar 29 
13:37:31.575545 2016] 

[Moses-support] mkcls fault

2016-03-29 Thread Tom Hoar
I think (M)GIZA++ expertise has faded over the years, but I'm hoping 
someone has some ideas about this.


A user start training with an extremely small TMX file with a few dozen 
of parallel segments. Our preparation tools reduced the parallel corpus 
to only 2 pairs (attached). The user was running the Windows version 
(mkcls.exe) but we verified the same error on Linux. The 
train-model.perl script failed in step 1 (log also attached). 
Specifically, the mkcls binary failed with this error message:


   Assertion failed!
   File: src/mkcls/StatVar.cpp, Line 110
   Expression: index>=0&I've never seen this error with a respectable corpus size. So, I did 
some tests.


 * test 1, copy same sentences 14,500 times, got assertion (failed).
 * test 2, copy each of the pairs only 7,250 times, got assertion (failed).
 * test 3, added 4,000 unique pairs, no assertion (success).
 * test 4, reduced to 2 original + 3 new pairs, no assertion (success).
 * test 5, reduced to 2 original + 2 new pairs, assertion returned
   (failed).

It seems this assertion is linked to lack of variety in the training 
corpus. Can anyone confirm this observation? Has anyone ever experienced 
this error?


If no one's seen this with a larger corpus, a terminal failure due to 
lack of variety is probably good. Would it be accurate if we add an 
error message to the effect, "terminal error due to insufficient variety 
in the training corpus"?


Thanks for any ideas/suggestions.
Tom


* STEP 1 **
COMMAND LINE:
C:\Strawberry\perl\bin\perl.exe -w 
C:\Users\tahoar\workbench\slate-toolkit\scripts\training\train-model.perl ^
 --root-dir 
C:\Users\tahoar\workbench\slate-desktop\var\TRAININGS\smt-tm-de_de-en_us-DANIEL 
^
 --e de_de ^
 --do-steps 1 ^
 --f en_us ^
 --mgiza ^
 --config 
C:\Users\tahoar\workbench\slate-desktop\var\TRAININGS\smt-tm-de_de-en_us-DANIEL\model.giza.grow-diag-final-and\moses.7.en_us-de_de.ini
 ^
 --lm 0:1:C:\placeholder.lm:9 ^
 --external-bin-dir C:\Users\tahoar\workbench\slate-toolkit\bin ^
 --max-phrase-length 7 ^
 --temp-dir 
c:\users\tahoar\appdata\local\temp\Slate_Desktop-7288\train-tm-0\trainer,train,210,1,train-tm\1
 ^
 --continue ^
 --reordering msd-bidirectional-fe ^
 --cores 7 ^
 --corpus 
C:\Users\tahoar\workbench\slate-desktop\var\TRAININGS\smt-tm-de_de-en_us-DANIEL\bitext
 ^
 --mgiza-cpus 7 ^
 --alignment grow-diag-final-and
Using SCRIPTS_ROOTDIR: C:\Users\tahoar\workbench\slate-toolkit\scripts
Using: $SPLIT_EXEC ="split.exe"
Using: $SORT_EXEC="sort.exe"
Using: $GZIP_EXEC="gzip.exe -q"
Using: $GUNZIP_EXEC="gzip.exe -q -d"
Using: $BZCAT_EXEC="bzcat.exe -q -d -c"
Using: $ZCAT_EXEC="gzip.exe -q -d -c"
Using: $CAT_EXEC="type"
Using DEBUG: 0
(1) preparing corpus @ Fri Mar 25 21:20:09 +0700 2016
(1.0) selecting factors @ Fri Mar 25 21:20:09 +0700 2016
(1.1) running mkcls @ Fri Mar 25 21:20:09 +0700 2016
Executing: C:\Users\tahoar\workbench\slate-toolkit\bin\mkcls.exe -c50 -n2 
-p"C:\Users\tahoar\workbench\slate-desktop\var\TRAININGS\smt-tm-de_de-en_us-DANIEL\bitext.en_us"
 
-V"C:\Users\tahoar\workbench\slate-desktop\var\TRAININGS\smt-tm-de_de-en_us-DANIEL\giza.de_de-en_us\en_us.vcb.classes"
 opt 1>&2

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.
Assertion failed!

Program: C:\Users\tahoar\workbench\slate-toolkit\bin\mkcls.exe
File: src/mkcls/StatVar.cpp, Line 110

Expression: index>=0&beispiel : sonntag .
um sich anzumelden , wenden sie sich bitte an die örtliche vertretung der 
gesellschaft für verkehrssicherheit .
for example : sunday .
to register , contact your local branch of the road safety association .
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] english-japanese translation

2016-03-29 Thread Vito Mandorino
Thank you all for your comments and ressources. I will go through them and
let you know if I stumble upon something interesting during the process.

Vito Mandorino

2016-03-28 9:47 GMT+02:00 Graham Neubig :

> Hello Vishal,
>
> Yes, that's what pre-ordering means. Specifically it means re-ordering the
> source side.
>
> Graham
>
> On Mon, Mar 28, 2016 at 2:16 PM, Vishal Goyal(विशाल गोयल) <
> vishal@gmail.com> wrote:
>
>> Dear Graham,
>> Greetings.
>> Please clarify that Pre-Ordering in your reply means, that making the
>> word order of both Source Language Sentences and Target Language Sentences
>> similar in Source-Target Language Pair before going for training so that it
>> becomes similar to the scenario of closely related language Pair.
>>
>> On Sat, Mar 26, 2016 at 10:57 AM, Graham Neubig 
>> wrote:
>>
>>> Hi Vito,
>>>
>>> English-Japanese and Japanese-English translation are very difficult due
>>> to the grammatical differences between the languages.
>>>
>>> You have a couple options to overcome this problem:
>>> 1) If you want to use phrase-based Moses, you will have to perform some
>>> variety of pre-ordering, in which you rearrange the words in the source
>>> sentence before training/testing.
>>> 2) You can use a syntax-based system, either using the functionality in
>>> Moses (http://www.statmt.org/moses/?n=Moses.SyntaxTutorial), or using
>>> another decoder specifically designed for syntax-based MT such as my
>>> Travatar decoder (http://www.phontron.com/travatar/). I have released
>>> the setup for training our strongest Japanese-English and English-Japanese
>>> systems here: https://github.com/neubig/wat2014
>>>
>>> Regarding the different types of characters, I would leave them as-is.
>>> It is possible to perform normalization, which will help in a limited
>>> number of cases, but if you're just starting out this is really the least
>>> of your problems.
>>>
>>> Graham
>>>
>>>
>>> On Fri, Mar 25, 2016 at 7:51 PM, Vito Mandorino <
>>> vito.mandor...@linguacustodia.com> wrote:
>>>
 Dear all,

 does anyone have ever done experiments for English-Japanese and
 Japanese-English translation? Do you know about useful ressources for this
 language pair, or some specific gotchas one should be aware of?

 More specifically, what is the best policy for dealing with alphabets?
 Do you think it is a good idea to keep different alphabets (Kanji,
 Hiragana, Katakana, ...) in the corpus, or should one try to convert Kanji
 into one of the other alphabets?

 Best regards,

 Vito Mandorino

 --
 *M**. Vito MANDORINO -- Chief Scientist*


 [image: Description : Description : lingua_custodia_final full logo]

  *The Translation Trustee*

 *1, Place Charles de Gaulle, **78180 Montigny-le-Bretonneux*

 *Tel : +33 1 30 44 04 23   Mobile : +33 6 84 65 68 89
 <%2B33%206%2084%2065%2068%2089>*

 *Email :*  *vito.mandor...@linguacustodia.com
 *

 *Website :*
 *www.linguacustodia.finance *

 ___
 Moses-support mailing list
 Moses-support@mit.edu
 http://mailman.mit.edu/mailman/listinfo/moses-support


>>>
>>> ___
>>> Moses-support mailing list
>>> Moses-support@mit.edu
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>
>>>
>>
>>
>> --
>> *Regards,*
>> Vishal Goyal,
>> Ph.D., M.Tech., MCA, M.C.S.D.
>> Associate Professor(Stage IV),
>> Department of Computer Science,
>> Punjabi University Patiala-147002.
>>
>>
>> *Machine Translation Systems:*
>> [*Online Hindi to Punjabi Machine Translation Tool -*
>> http://h2p.learnpunjabi.org ]
>> [*Statistical Approach Based Hindi to Punjabi Machine Translation
>> System *
>> - http://statmt.org/~vishal/hp/index.cgi
>> - http://tdil-dc.in/hi2pu/index.cgi
>> ]
>> *Online Journal: [Research Cell: An International Journal of Engineering
>> Sciences, http://ijoes.vidyapublications.com
>> ]*
>> *Book: A Simplified Approach to Data Structures, Shroff Publications and
>> Distributors*
>> http://www.shroffpublishers.com/detail.aspx?title=6163
>>
>
>


-- 
*M**. Vito MANDORINO -- Chief Scientist*


[image: Description : Description : lingua_custodia_final full logo]

 *The Translation Trustee*

*1, Place Charles de Gaulle, **78180 Montigny-le-Bretonneux*

*Tel : +33 1 30 44 04 23   Mobile : +33 6 84 65 68 89*

*Email :*  *vito.mandor...@linguacustodia.com
*

*Website :*
*www.linguacustodia.finance *
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support