the echo in dos & unix works slightly differently.

dos: echo "c"
"c"

unix:echo "c"
c

the " char is being appended to the 1st word. oh the joys of command 
line....

Ivan Uemlianin wrote:
> I am not sending text to moses from a text file, I am using the 
> command-line:
>
> m.bat contains:
>
> echo %1  |  c:\\cygwin\\path\\to\\moses.exe  -f 
> c:\\cygwin\\path\\to\\moses.ini   2>  msc_tywyddTeletestun.err
>
> usage
>
> > m.bat "bydd y bore 'n oer ."
> "bydd the morning will be cold ."
>
>
>
> Philipp Koehn wrote:
>> Hi,
>>
>> I have seen text files under windows that add a starting byte to
>> indicate the encoding of the file. Sine the first word is a problem,
>> this may be the cause.
>>
>> -phi
>>
>> On Thu, Nov 26, 2009 at 2:34 PM, Ivan Uemlianin
>> <[email protected]> wrote:
>>> Hieu
>>>
>>> Thanks for your comment.
>>>
>>> How can this be a line-ending issue?  Where are line-endings involved?
>>>
>>> What is appending an extra character to the first word and why?
>>>
>>> The 1st input word *is* being recognised and translated (as I said, the
>>> translations under dos are correct) --- "bydd" translates to "will be".
>>>
>>> I'm using identical material under cygwin and dos, the only difference
>>> is under cygwin I'm using a shell script and under dos I'm using a
>>> ".bat" file.  If it is a line-ending issue why is it affecting dos and
>>> not cygwin?
>>>
>>>
>>> Hieu Hoang wrote:
>>>> hi ivan
>>>>
>>>> i think this might be a problem with line ending again. The 
>>>> non-printing
>>>> 0x13 character is being appended to the 1st input word which causes it
>>>> to be unrecognised so it is outputted ad-verbatim. Cygwin properly has
>>>> internal code which strips out this character
>>>>
>>>> make sure you convert all text files to unix line endings using
>>>>   dos2unix
>>>>
>>>> Ivan Uemlianin wrote:
>>>>> Dear All
>>>>>
>>>>> Running the moses decoder on cygwin and dos gives slightly different
>>>>> results, even though I'm using the same executable and the same 
>>>>> models.
>>>>>
>>>>> For example, translating from Welsh to English:
>>>>>
>>>>>      Welsh:   bydd y bore 'n oer .
>>>>>      English: the morning will be cold .
>>>>>
>>>>>      mo...@cygwin: morning will be cold .
>>>>>      mo...@dos:    bydd the morning will be cold .
>>>>>
>>>>> The main problem is that on dos, moses is always returning the first
>>>>> word of the source language, prepended to the translation itself.
>>>>> Easy to strip off but annoying.  The translation itself is often
>>>>> slightly better on dos than on cygwin, as above (which is if anything
>>>>> even stranger).
>>>>>
>>>>> Can anyone account for this strange behaviour?  More important, how
>>>>> can I stop the first word of source language returning?
>>>>>
>>>>> Thanks and best wishes
>>>>>
>>>>> Ivan
>>>>>
>>>>>
>>>>>
>>>
>>> -- 
>>> ********************************
>>> Ivan Uemlianin
>>>
>>> Canolfan Bedwyr
>>> Safle'r Normal Site
>>> Prifysgol Bangor University
>>> BANGOR
>>> Gwynedd
>>> LL57 2PZ
>>>
>>> [email protected]
>>> ********************************
>>> _______________________________________________
>>> Moses-support mailing list
>>> [email protected]
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to