my advise though - use Linux instead of MacOSX.
The scripts are written for Linux and rarely tested on Mac, you're on you
own if there's a problem
Hieu Hoang
http://statmt.org/hieu
On Thu, 16 Apr 2020 at 09:55, Hieu Hoang wrote:
> inmert/Jamfile, delete all the unit tests. if necessary,
inmert/Jamfile, delete all the unit tests. if necessary, delete all
unit tests in all Jamfiles you find
Hieu Hoang
http://statmt.org/hieu
On Thu, 16 Apr 2020 at 09:51, Bounyalit Taymany wrote:
> Excuse me when I use the command " find . -name "*Test.cpp" | xargs rm "
> and I try running
Try deleting all the unit test file before running the bjam command again.
You can delete the unit test files with the following command
find . -name "*Test.cpp" | xargs rm
Moses used to compile fine with macOS but macOS's compiler and gcc are
slowly diverging.
Hieu Hoang
Hi Zemichael,
Can you provide the build command that caused this error?
If you are using any arguments with the build command use full paths not
relative paths.
On Sat, Mar 2, 2019, 10:34 AM Zemichael Berihu wrote:
>
> ___
> Moses-support mailing
Hi Maria
* Looking for MT/NLP opportunities *
Hieu Hoang
http://moses-smt.org/
On 21 March 2017 at 14:23, Maria Braga wrote:
> Hi Hieu,
>
> I am Maria and work at Unbabel. We've met in Prague at the MT Marathon
> last year.
>
> I am currently working with the feature
https://github.com/moses-smt/mosesdecoder/archive/master.zip
On 09/15/16 11:20, Selva Nalladurai wrote:
> Hello guys,
>
> Please provide me with the link, where i can download the
> complete moses toolkit
>
>
>
>Regards,
>
add this to your moses.ini file
[n-best-list]
nbest.txt
4
distinct
or add these arguments when you run moses:
-n-best-list nbest.txt 4 distinct
On 18/07/2016 07:23, Selva Nalladurai wrote:
Hello guys,
I would like to know whether we can generate
e: Thu, 14 Jul 2016 13:02:57 +0530
From: Anoop (?) <anoop.kunchukut...@gmail.com>
<mailto:anoop.kunchukut...@gmail.com>
Subject: Re: [Moses-support] help regarding languages used for translation
To: Selva Nalladurai<selva...@gmail.com> <mailto:selva...@gmail
Well, the error means that the file doesn't exist in that directory. You'll
need to see if you can debug it from there.
On Fri, Mar 25, 2016 at 12:22 AM, Parul gupta wrote:
> Yes i'm in the same directory and file is existing in that directory.
>
> On Fri, Mar 25, 2016 at 2:41
Make sure you're in the correct directory. Make sure the file actually
exists in that directory.
On Thu, Mar 24, 2016 at 2:52 PM, Parul gupta wrote:
> I gave this command as follows:
>
> parul@ubuntu:~$ ~/mosesdecoder/bin/build_binary \
> >WMT-News.arpa.en \
> >
Hi,
this looks like a successful build.
It will not have the server functionality where moses runs as a daemon and
responds to tcp/ip requests.
-phi
On Tue, Mar 22, 2016 at 2:43 AM, Parul gupta wrote:
> warning: No toolsets are configured.
> warning: Configuring default
Parul,
You'll get better responses if you email the list instead of me. Please
start a new email thread to the mailing list about this topic.
Lane
On Mon, Mar 21, 2016 at 12:53 AM, Parul gupta wrote:
> This is the command i'm running
>
> echo ’is this an english sentence’ |
Can you show the exact command you're running? That script accepts data on
standard input, so if you're not giving anything on standard input, it
won't produce any output.
On Mon, Mar 14, 2016 at 11:40 PM, Parul gupta wrote:
> Sir i created scripts/share/nonbreaking_prefix.hi
Is that the exact command you're using? You shouldn't have backslashes
before the tildes.
Also, what is the error that you're getting?
On Thu, Mar 17, 2016 at 5:16 AM, Parul gupta wrote:
> ~/mosesdecoder/scripts/training/clean-corpus-n.perl
> \~/corpus/WMT-News.true en hi
Well in fact, it comes to how to handle xml mark up in text to translate
and obviously this is not trivial and no good solution.
If I am not mistaken this issue has been tackled in various circumstances :
- casmacat project / matecat_util project
- web translation in the contrib / web
Hi,
these are very small corpus sizes to train statistical machine
translation system. Typically, folks use at least 100 times
as many words.
You should be able to track down, why some translations
deteriorate - I would expect that coverage increases and
you would get better translations.
-phi
Hi Ulrich,
Sorry for sending the doubts to you directly. I will keep in mind to post
my future queries to moses-support.
Thanks a lot for the clarification. Let me play with the MERT.
Thanks and regards,
sandipan
On 24 October 2014 01:51, Ulrich Germann ulrich.germ...@gmail.com wrote:
Hi
Hi Sandipan,
first, please post Moses-related questions to moses-support@mit.edu, not
individual contributors.
second, the current seven features used by Mmsapt /
PhraseDictionaryBitextSampling are (for details, see my recent paper on
this phrase table implementation:
the decoder is waiting for the input. You can type your input in, or
provide the decoder with the input file to translate:
/moses -f moses.ini -i input-file
or
/moses -f moses.ini input-file
Don't forget - the input must have been tokenized and cased in the same way
as the training
hi maria
On 23 September 2014 10:49, Maria Marpaung maria_marpa...@yahoo.co.id
wrote:
Hi,
I want to ask.
I've Testing step.
Please tell me how make Set the path of the PhraseDictionary feature to
point to $HOME/working/binarised- model/phrase-table
In the moses.ini file, there is a line
Hi Mulu
Is it the same as this problem?
https://www.mail-archive.com/moses-support@mit.edu/msg10584.html
Note that you probably don't need to build Moses with srilm. By default,
Moses includes KenLM, which can handle ARPA files, as well as its own
compiled format for faster loading.
cheers
Hi Maria
first I would like to ask (for next time)
to write to the IRSTLM mailing list (user-irs...@list.fbk.eu)
to any problem related to IRSTLM only
Now I am going to reply you privately.
Nicola
On Jun 20, 2014, at 6:47 AM, Maria Marpaung wrote:
Hello, I need help.
I'm working on my
Hi,
GIZA++ is designed for parallel sentence pairs, but it is tolerant to
some degree of noise. If you feed comparable sentence pairs,
which I assume you mean to be sentence pairs which are only
partially (or not at all) translationally equivalent, then you will
hopefully get word alignments for
HI Marcos
For command-line Moses, it has to output the sentences in the same order
as they are input. So if, say, you have 4 threads and provide Moses with
a large file of input sentences, it will give 1 sentence to each of the
4 threads. But no output will be produced until sentence 1 is
Hi Marcos
I think the problem is that the rules (or phrase pairs) are now cached
on a per thread basis. This is good for command-line Moses as it uses a
pool of threads, and having per-thread caches means that there is no
locking on the caches, as there used to be.
Barry, I am not sure
Hi Marcos,
the command line tool behaviour you are describing is actually
consistent with what Barry said. You would see this lag before the first
couple of translations and later it would disappear. Usually you do not
use the command line tool for only 4 sentences, but for a couple of
Hi Marcos,
What you can do, if you want to rule out the compact data structures are
the issue, is load the tables at start-up into memory (if you have
enough memory):
That would be the following additional options to your ini-file:
[minphr-memory]
1
[minlexr-memory]
1
However your problem
Hi,
In my view, the threading design of the server is a bug. How about a
producer-consumer queue with multiple producers (a client connection)
and consumers (decoding threads). Each client connection owns a
producer-consumer queue as a return channel so that decoder threads can
return
Hi Marcos
I think the problem is that the rules (or phrase pairs) are now cached
on a per thread basis. This is good for command-line Moses as it uses a
pool of threads, and having per-thread caches means that there is no
locking on the caches, as there used to be.
mosesserver, afaik, creates
Hi
I agree that it's a bad design, but I'm not clear if it can be fixed
without changing to a different xmlrpc library
cheers - Barry
On 06/03/14 18:40, Kenneth Heafield wrote:
Hi,
In my view, the threading design of the server is a bug. How about a
producer-consumer queue with
Hi,
The issue is that the xmlrpc library launches a thread for each client
connection. Nothing prevents these threads from then contacting a
central pool of decoder threads.
Kenneth
On 03/06/14 11:09, Barry Haddow wrote:
Hi
I agree that it's a bad design, but I'm not clear if it
Hi Mohadeseh,
have you tried to
apt-get update # update the apt-get repository with the info in the
sources.list file
apt-get upgrade # upgrade the system with the packages available in the
apt repository
apt-get -f install # fix the installation, removing broken
Leila Tavakoli leila.tavakoliii@... writes:
Hi,
I need to tune my English-Persian development set, and i use this command:
root at
ubuntu:/home/msho/Desktop/Shafagh/moses-default/scripts/training#
./mert-moses.pl /mnt/hgfs/H/dev/en/en.txt /mnt/hgfs/H/dev/ref/
Can you send the error log when you run the Moses binary with this
moses.ini file, and include the moses.ini file? The command line would
look like this:
$ /home/msho/Desktop/Shafagh/moses-default/bin/moses -f
/mnt/hgfs/H/Train150/model/moses.ini
On 01/30/2014 07:11 PM, Leila Tavakoli
Hi,
Try recompiling without spaces in the directory name. If that fixes
it, at least we'll know where the issue is.
Kenneth
On 09/27/13 11:41, subhash ganeshan wrote:
Hello,
Recently while installing moses on my system Ubuntu12.04, I ran
into some installation issues.
I recommend you follow this guide to get started with Moses:
http://www.statmt.org/moses/?n=Moses.Baseline
Once you understand it, then you need to collect some arabic-english
data to create a system
On 07/07/2013 22:16, Khetam sh wrote:
Dear,
I am so interested in using Moses tool on
Hi Khetam,
Maybe you can refer this link:
http://www.statmt.org/moses/?n=Moses.Baseline
It is very helpful.
Best Regards,
LANG Jun
On Sun, Jul 7, 2013 at 2:19 AM, Khetam sh khetam.alsha...@hotmail.comwrote:
Dear,
I am so interested in using Moses tool on Arabic English
Hi,
The Vietnamese National Project on Vietnamese Language and Speech
Processing (VLSP) has been working to compile some resources and
fundamental tools, including 10-sentence bilingual English-Vietnamese
corpus. You might have to fill a simple form to get that corpus from them:
hi ha
if you want to translate english to vietnamese, you need an
english-vietnamese corpus. You can find here:
http://opus.lingfil.uu.se/
other people on the moses mailing list might have other data they can
give you.
You can also use pivoting methods and triangulation to translate
Please follow the directions on
http://www.statmt.org/moses/?n=Moses.MailingLists
There should be no space after the equals sign.
./bjam --with-irstlm=/home/nadhii/moses/irstlm/
Kenneth
On 04/17/13 10:02, Nadhii Dagaga wrote:
I have been trying to install moses on ubuntu 12.10 by following
Hi,
the NIST evaluation sets are distributed through the LDC.
The IWSLT evaluation sets are available if you register to participate
in their evaluation campaign - I am not sure if they are available
otherwise, but you should get in touch with the organizers.
-phi
On Mon, Apr 15, 2013 at 7:39
Hi,
please use full paths for your corpus files.
-phi
On Sat, Apr 13, 2013 at 1:44 PM, Jabesa Daba jappy...@gmail.com wrote:
when i have modified the path and run the following command:
~/irstlm/bin/add-start-end.sh ~/corpus/news-commentary-v7.fr-en.true.en
news-commentary-v7.fr-en.sb.en
Hi,
It looks like you're writing to news-commentary-v7.fr-en.lm.en without
a .gz then trying to read news-commentary-v7.fr-en.lm.en.gz . If that
doesn't fix it, subscribe to the IRSTLM mailing list at
https://list.fbk.eu/sympa/subscribe/user-irstlm and direct your message
there.
Hi,
to use Moses, you will need a very large amount of translated text for training,
in the order of millions of words. If you have that, please consult the Moses
web site on details on how to train a system.
-phi
On Sat, Mar 23, 2013 at 5:13 AM, Eleni Teshome tesh.el...@gmail.com wrote:
Dear
Hi, Hieu
Thank you for your quick reply :-).
But then, if the decoder can handle any number of non-terminal, it means I
still don't understand why the rules I am generating do not seem to be used
properly by the decoder. I think I will try to make a minimal example of
the problem I have and come
hi fabien
On 08/11/2012 01:25, Fabien Cromières wrote:
Dear Moses community,
First, thanks for providing the Research community with such a nice
open-source tool.
Second, I have some rather involved questions about the moses-chart
decoder for syntax-based MT. Hopefully, someone having a
Thanks,
I own you the second time, Hieu Hoang!
On Sun, Nov 4, 2012 at 3:20 PM, Hieu Hoang hieuho...@gmail.com wrote:
hi cuong
i think you're running out of memory when it's trying to run word
alignment.
You should try and verify this by running the word alignment step by
itself and see if
Hi all,
I re-check the problem again. It just happens when I run Moses on a large
training corpus.
When I run on a smaller training corpus, no problem is happened.
I don't think there is any problem of server since I run on a 24-cores and
130G RAM server.
Everything is absolutely done including
Hi Cuong
I think you have a | character in your data. You need to escape this before
training. You can use the script
*escape-special-chars.perl*
*To do this*
Hieu
Sent from my flying horse
On 29 Oct 2012, at 03:21 AM, Cuong Hoang hoangcuong2...@gmail.com wrote:
Hi all,
I re-check the
Hi Mohadeseh
You're using a very old version of Moses, so I'd suggest you update to
the latest github version.
As to the Giza error, does this answer your question?
http://comments.gmane.org/gmane.comp.nlp.moses.user/7231
cheers - Barry
cheers - Barry
On 12/08/12 09:32, Mohadeseh
Hi Mohadeseh
If you use the giza option 'nodumps=0' then you should also get the
model 1 alignments. Check the GIZA++ help message to be sure.
I meant that your Moses version was out-of-date, not your GIZA++ version,
cheers - Barry
On 13/08/12 11:19, Mohadeseh Sheikhoveisi wrote:
Hi Barry
Hi Jun
It's normal for the phrase table to contain a lot of noise, and
generally the decoder can cope with this since it looks at various
sources of information, ie different phrase scores, language model,
reordering model. Moses also applies a cutoff when it loads translation
options, so
, August 03, 2012 4:03 PM
To: Tan, Jun
Cc: moses-support@mit.edu
Subject: Re: [Moses-support] help with filtering the noise enties in
phrase-table.
Hi Jun
It's normal for the phrase table to contain a lot of noise, and generally the
decoder can cope with this since it looks at various sources
Hi Mohadeseh
I think the problem is that you added this:
-giza-option
m1=5,m2=0,m3=0,m4=0,mh=0,t1=5,nodumps=0
You are telling GIZA not to run the higher IBM models, yet
train-model.perl is looking for their output. Try removing the
giza-option and it should work,
cheers - Barry
On 02/08/12
All entries in a phrase table are meaningless. There are merely a
mappings with probabilities (et al) between groupings of tokens in your
your source target data. I suggest better corpus preparation and
management is a better way to manage the content in the tables rather
than directly editing
is welcome.
Thanks,
Jun
From: Tom Hoar [mailto:tah...@precisiontranslationtools.com]
Sent: Friday, August 03, 2012 10:08 AM
To: Tan, Jun
Cc: moses-support@mit.edu
Subject: Re: [Moses-support] help with filtering the noise enties in
phrase-table.
All entries in a phrase table are meaningless
Hi Shweta
It looks like you're using an out-of-date version of moses. Try downloading
the latest from github
https://github.com/moses-smt/mosesdecoder
cheers - Barry
On Monday 20 Feb 2012 17:39:39 shweta porwal wrote:
Hi im building an smt web translation system using your given guidelines
Hi,
how did you create the configuration file moses.ini in the
first place? DPR-file is not a standard parameter.
-phi
2011/12/6 李惠 lh9...@126.com:
Dear Mr./Ms.
When I was compling the Moses and tested the phrase-model, some problems
arose. I quoted it as follows.
Hi Hung
The Makefile, XCode project Visual Studio are up-to-date. The eclipse
build has not been maintained so I'm going to delete it. Unless someone
want to take charge of it.
Therefore, the only build you can use is the Makefile if you're using Linux.
On 05/09/2010 11:28, hungnv54
hi hung
you should take a look at
http://www.statmt.org/moses/?n=Moses.FeatureFunctions
Some things might be out of date but most of the information on there
looks correct.
A stateless feature function should be quite easy to implement, I would
try follow a previous example such as
Hello;
I'm really appreciate your help.
Thank you
Haithem
From: berto...@fbk.eu
To: aflihait...@hotmail.com; moses-support@mit.edu
Date: Wed, 10 Mar 2010 10:29:16 +0100
Subject: Re: [Moses-support] help for mert-moses-new.pl
Dear Afli,
mert-moses-new.pl performs a case
hi jeffrey
not sure if you found the source of the problem yet. I've re-ran our
regression tests and there doesn't seem to be any problems. You should
double check the format of the lattice input you're feeding into the
decoder.
to see an example of the format, look in the file
Thank you for the reply.
There isn't the empty in parallel corpus.
And, i didn't use |.
I didn't understand well. But, many thanks.
2008/11/4 Ondrej Bojar [EMAIL PROTECTED]
Dear Yuta,
my guess is that it's some basic issue with your parallel corpus. Something
has just got out of sync.
The
El jue, 18-09-2008 a las 02:30 +0800, Nirav escribió:
Hi,
I would like to know that how to align the two files one is having
Unicode characters ( Indian regional language) and one is having ascii
text ( English),
also is there any changes needed to train and evaluate the model.
It should
Hi,
Thanks for the reply. Problem is script is not roman for the indian regional
language..even the punctuation marks are different...
so how do moses align sentence when it does not know the sentence
terminator.
also moses has a step of lowercasing...there is no concept of lowercasing in
El jue, 18-09-2008 a las 02:44 +0800, Nirav escribió:
Hi,
Thanks for the reply. Problem is script is not roman for the indian
regional language..even the punctuation marks are different...
so how do moses align sentence when it does not know the sentence
terminator.
Again, iirc,
Hi,
you are specifying the language model like this:
lmodel-file: 0 0 3 working-dir/lm/tam.lm
It is better to have full paths, because you are apparently
running the decoder in the wrong directory. This is a entry
in moses.ini (but you knew that).
-phi
On Thu, May 8, 2008 at 9:56 AM, Jespa
67 matches
Mail list logo