Dear Deca,
Are you sure that the irstlm library is saved in
/home/ced/moses/irstlm/lib/i486-pc-linux-gnu
Could you please send me the log file you get when compile irstlm?
best
Nicola
From: [EMAIL PROTECTED] [EMAIL PROTECTED] On Behalf Of Sandy [EMAIL
Dear Jespa
the right call of the configure script is the following
./configure --with-srilm=/root/jespa/project/language_model/srilm
Nicola
From: [EMAIL PROTECTED] [EMAIL PROTECTED] On Behalf Of Jespa [EMAIL PROTECTED]
Sent: Tuesday, February 19, 2008
Hi Linh,
MERT works fine with confusion networks.
I never uset the lattice as input for Moses.
You can transform lattices into confusion network
using the lattice-tool software distributed together with the SRILM-Toolkit.
Pay attention that you formats for confusion networks
used by lattice tool
moses-parallel.pl and mert-moses.pl were changed .
Now they works well with lattice inputs, too.
Notice that you do NOT need to specify
-decoder-flags -inputtype 2
the parameter
--inputtype 2
of mert-moses.pl is passed to the decoder automatically.
best,
Nicola
$fn.gz ! -s $fn.binphr.tgtdata) {
(or something similar) would fix that, I think.
Best,
Christian
On Thu, 28 Feb 2008, Nicola Bertoldi wrote:
moses-parallel.pl and mert-moses.pl were changed .
Now they works well with lattice inputs, too.
Notice that you do NOT need to specify
-decoder-flags
I already found the problem.
I will fix by tomorrow morning
Best
Nicola
From: [EMAIL PROTECTED] [EMAIL PROTECTED] On Behalf Of Severin Hacker [EMAIL
PROTECTED]
Sent: Monday, April 07, 2008 1:42 AM
To: moses-support@mit.edu
Subject: [Moses-support] Moses
Dear Vineet
one more thing about parallelization
1) with moses-parallel.pl you can produce in parallel
the translations of the many input sentences
and the corresponding nbest, as explained by Marcello.
At the moment you can NOT generate with parallelization
search and
Dear Jason,
actually no one did this enhancement of cmert
at least to my knowledge.
I also thoguht to modify the MERT part o Moses
to make it more flexible and efficient,
but unfortunately I never had time.
This is a possible project to work on during
the Second MT Marathon (next week in Berlin)
Dear Vineet ,
Probably you already check it,
but my impression is that there is a mismatch in the order of reference.
best
Nicola
On Sep 18, 2008, at 9:59 AM, Adam Lopez wrote:
I would just like to know if there is a significant difference
when scoring translations using multi-bleu.
With
The new MERT implementation is independent from the decoder.
You can use it without updating Moses
Unfortunately new implementation has not been tested on multiple
factors yet,
but hopefully it works fine.
Nicola
On Nov 13, 2008, at 1:09 PM, Miguel José Hernández Vidal wrote:
Hi Jun,
Is
The script implements the BELU score as described in the official
paper by Papineni
Papineni, Kishore, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2001.
Bleu: a method for
automatic evaluation of machine translation. Research Report RC22176,
IBM Research
Division, Thomas J. Watson Research
Hi Lane
this is what Philip answered some time ago about your question.
best
Nicola
Hi,
there are two choices, when a phrase pair is used
(a) it is scored by both phrase tables, and needs to exist in both
phrase tables
(b) it is taken from either of the two phrase tables (and scored only
No,
-nscores 5is a parameter of processPhraseTable
and refers to the number of scores (features)
associated to each phrase pair in the phrase table.
The standard setting of Moses training provides 5 scores.
Check the textual format of the phrase table
to know how many scores are in it.
I am answering him privately.
Nicola
On 5/12/09 3:26 PM, Fethi Bougares fethi.bouga...@imag.fr wrote:
Hello,
I am Working on Arabic To English Statistical Machine Translation.
AS you now Arabic is a rich morphological language.
So i use confusion network (as input type) to combine
many
Waleed,
if you are interested in a word-to-word alignment (instead of phrase-to-phrase
alignment)
you could give a look to this feature of Moses
http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc6
Pay attention that you have to re-train your phrase tables including the
word-to-word
in the sample from
http://www.statmt.org/moses/download/sample-models.tgz
the phrase table contains only one feature score
while when you train your own phrase table using the Moses training script
you produce a PT with 5 feature scores, becuase this is the default of Moses.
if you want to use
Dear Laurentia,
ss you can read somewhere in the moses manual
-the parameter -f tells you which is the source language
-the parameter -e tells you which is the target language
The names -f and -e do not relate with French and English specifically, but
to generic languages
assuming that your
Dear Zahurul
the newest release of IRSTLM (5.40.01) should solve your problem which is
probably related to the size.
Please download from here:
http://hlt.fbk.eu/en/irstlm
There is an official mailing list for IRSTLM, you can join from here
https://list.fbk.eu/sympa/subscribe/user-irstlm
The
Crhistof is right
the LM is used only to create a formally correct configuration file.
You can simply set any NON EMPTY file, to complete the training successfully.
Of course you have to modify the configfile with your good LM before translating
Nicola
hi Tom
I supposed there is a mismatch with the last working IRSTLM release
and your.
Where did you get yours?
Nicola
On Jun 9, 2010, at 6:36 AM, supp...@precisiontranslationtools.com
wrote:
Problem solved... I spoke too early.
Creating the symlink, $SRILM/bin/i686, enabled me to run
the usual way of using LMs with factors is to train separate LMs for
each factor
and specify which factor each LM refers to in the configuration file.
Look at the manual to know how to modify the config file.
best regards,
Nicola
On Jul 6, 2010, at 5:14 AM, yyjj...@163.com wrote:
hello
the empty line after each ngram-block is not mandatory in the ARPA
format
(see http://www.speech.sri.com/projects/srilm/manpages/ngram-format.
5.html)
and IRSTLM does not produce it.
best regards,
Nicola Bertoldi
On Oct 26, 2010, at 9:42 AM, supp...@precisiontranslationtools.com
supp
Dear Charles,
probably you are using an old release of IRSTLM
please download the new one (5.50.01) from FBK website
http://hlt.fbk.eu/en/irstlm
as suggested during the compilation of Moses
Hope this solve your problem
bet regards,
Nicola Bertoldi
On Nov 2, 2010, at 6:10 AM, Charles Chiu
Dear Charles,
just to know
which compiler are you using? which Operating system?
Nicola
On Nov 3, 2010, at 8:47 AM, Nicola Bertoldi wrote:
Dear Charles,
as mentioned by Tom,
we already discovered this bug which occurs only with some
compilers (probably from version 4.3) and/or some
from revision r3398 (2010-08-10)
mert-moses-new.pl is renamed to mert-moses.pl
and it is no more updated and maintained.
so please use mert-moses.pl
Nevertheless, I do not think this solve your pb
best
Nicola
On Nov 27, 2010, at 3:59 AM, Lane Schwartz wrote:
In the 2010-08-13 release, I
Dear Cyrine,
Consider these two possibilities:
- set the weights of these two features to 0.0 in the configuration
file (or on the command line)
- create an other phrase table with the three (interesting) values
and cahnge the configuration file accordingly
best
Nicola
On Dec 29, 2010, at
) recompile GIZA
3) call the Moses training script train-model.perl adding the following
parameter
--giza-extension UA3.final
This works for giza-pp-v1.0.2.tar.gz, but similar changes can be done for other
versions.
best regards,
Nicola Bertoldi
On Mar 17, 2011, at 5:17 PM, Barry Haddow wrote:
Hi
Dear Arda
This combination compiles
MOSES3878
IRSTLM 5.60.01
By the way,
we have just upload on the official IRSTM website
http://hlt.fbk.eu/en/irstlm
a new version 5.60.02, which the latest Moses revisions (should) compile with.
best regards,
Nicola
On Mar 31, 2011, at 3:07
(Apologies if you receive multiple copies. Please, distribute it among
potentially interested colleagues.)
CALL FOR PAPERS:
OPEN SOURCE TOOLS FOR MACHINE TRANSLATION
The Machine Translation Marathon 2011 is the sixth in a series of events
promoted by EuroMatrix and EuroMatrixPlus, which are
the problem was due to missing dependencies and
occurred only with parallel compilation (make -jN).
the revision 409 solves the problem.
best regards,
Nicola Bertoldi
On May 24, 2011, at 4:09 AM, Tom Hoar wrote:
We're updating DoMY with the newest Moses components. Our IRLSTM installation
It seems that your parallel corpus contains a line long 999 words (at least in
target)
So I suspect that something in the cleaning with clean-corpus-n.perl failed
As you know, if you use 40 as maximum sentence length you should get
only sentence shorter than (or equal to) 40.
Please check this
(Apologies if you receive multiple copies)
Please, distribute it among potentially interested colleagues.
CALL FOR PARTICIPATION
The Machine Translation Marathon 2011 is the sixth in a series of events
promoted by EuroMatrix and EuroMatrixPlus, which are EU research projects on
Machine
actually computes the symmetrized alignment according to a selected
policy specified through its parameters.
I hope my explanation is clear,
best regards,
nicola bertoldi
On Aug 29, 2011, at 8:14 AM, 蒋乾 wrote:
Hi all,
There is a command
za2bal.plhttp://za2bal.pl/ -d path-to-updated-tgt-to-src
Dear Grahan,
have you specified the parameter
weight-i
in the configuration file?
Nicola
On Aug 29, 2011, at 5:35 PM, Graham Neubig wrote:
Hi,
I am trying to run mert-moses.pl on lattice input (using the latest
version from SVN) and am getting the following error after the first
iteration
Use the full path for the input file and for the output file
If this does not work,
please run
sh -x ./build-lm.sh -i Texte_EN.txt -n 3 -o lm -k 3 --verbose
collect both STDERR and STDOUT
and send all to me together with your input file (if not too big)
best regards,
Nicola Bertoldi
On Sep
Hi all,
do you know if it is possible (and how) the conversion of a phrase table (or
reordering table)
from its binary format to the textual format?
Nicola
___
Moses-support mailing list
Moses-support@mit.edu
Dear Jehan,
IRSTLM is able to load the ARPA-format language model
In order to constrain Moses to use IRSTLM toolokit (instead of SRILM or
others),
change the first field (1 for IRSTLM or 0 for SRILM)in the configuration
file
best regards,
Nicola Bertoldi
On Sep 26, 2011, at 10:32 AM
The actual error message is a bit above in the log
please send me the full log
which version/revision of Moses and IRSTLM are you using?
Nicola
On Sep 26, 2011, at 11:49 AM, itua ijagbone wrote:
Hi, i want to compile moses , when i get to ./configure
Hi Neda
There is also a parameter of mert-moses.pl --predictable-seed(see
the help)
which makes MERT deterministic.
In general, MERT procedure relies on an initial seedseed_t at each
iteration t for the creation of the (20 by default) random starting points
for the
Hi Hieu
On Dec 1, 2011, at 8:34 AM, Hieu Hoang wrote:
hi all
can anyone tell me if creating LM with the IRST toolkit is integrated into
the EMS yet?
I let anyone else to answer this point.
if not, is this the entirety of what has to be run?
cat $CORPUSFILE |
Dear Sylvain,
I am starting to answer the question in this thread.
- Most recent release of IRSTLMis 5.70.04 and can be downloaded from
SourceForge
- The IRSTLM user guide can be found in the SourceForge website:
https://sourceforge.net/apps/mediawiki/irstlm/index.php?title=Main_Page
it seems (line Executing of the step 3)
that the root-dir (where the code is available) is not expanded to its actual
value
Instead of
-root-dir/training/symal/giza2bal.pl
you should have something like
/path-to-code//training/symal/giza2bal.pl
This can be due to a wrong installation of the
I would only put in evidence that irstlm is not responsible for the very large
loading time .
In fact, it starts and ends in less than 1 second (start at 49 seconds and end
at the same time)
So probably as Barry mention, you have to use binarized version of the phrase
and reordering tables
In
/n' | moses -f moses.ini
-xml-input inclusive
BEST TRANSLATION: south [1] [total=-0.899] 0.000, -1.000, 0.000, -10.828,
0.000, 0.000, 0.000, 0.000, 0.000, -0.313, -0.845, -0.249, -0.249, 1.000
south
best regards,
Nicola Bertoldi
On Mar 9, 2012, at 7:44 PM, Feifan Liu wrote:
Dear All,
I
, Nicola Bertoldi
berto...@fbk.eumailto:berto...@fbk.eu wrote:
Dear Philley,
let us assume that you use inclusive options and that you suggest the
translation trg
for the source src with probability P with the following xml markup:
np translation=trg prob=Psrc/np
This is almost equivalent
Dear Pratyush,
the format of the LM metafile
(/home/pbanerjee/smt-work/incremental_LM_Exp/mt/l_models/interp.wt.fina)
is the following
LMINTERPOLATION 2
0.439053 /home/pbanerjee/smt-work/incremental_LM_Exp/mt/l_models/forum.lm
0.560947
:
the Moses configuration file should be changed accordingly (with the correct
path)
1 0 5 train.irstlm.blm
You spend some time to binarize the LM once, but then loading it in Moses will
be very fast.
best
Nicola Bertoldi
On Apr 11, 2012, at 8:58 AM, Loki Cheng wrote:
Hi, everyone, I want to use
Dear Somayeh,
I am answering to the same email you sent to the IRSTLM support list
user-irs...@list.fbk.eumailto:user-irs...@list.fbk.eu
because it is more appropriate for you problem.
best
Nicola
On Apr 20, 2012, at 11:35 AM, somayeh bakhshaei wrote:
Hello all,
I am trying to use caching
Hi Tomas,
IRSTLM actually returns 0 if it succeeds,
It returns values greater than 0, if something fails and if the usage info are
required.
Which is the cmd actually executed by train-recaser.perl?
Have you tried to run this command separately in a shell?
What does it return?
The problem
Dear Mauro,
could you please send me the command line you use?
Probably, there is a mismatch in the versions. I would check immediately.
Which version of IRSTLM and MOSES did you actually installed?
Meanwhile could you try to edit the configuration file and remove the keyword
LMINTERPOLATION
me the piece of your code related to dictionary,
as well as the log of your debugging?
best regards,
Nicola Bertoldi
(IRSTLM development team)
On Aug 6, 2012, at 3:52 PM, Filip Petkovski wrote:
Hi,
I am using IRSTLM for making a language model and I got a segmentation fault
when I was trying
Hi Nick,
if you intend to translate case-sensitive input text
and your LM combination is either liner or log-linear,
the answer is YES.
You can use IRSTLM package.
There is the possibility to load (together with a standard uncased LM) a map
which transforms the words of an ngram before querying
Hi Barry, Hi Prashant
could the implementation of parallel data structure
be a nice project for the forthcoming MT Marathon?
Which characteristics should it have?
I have some ideas about it
Nicola
On Aug 24, 2012, at 5:25 PM, Barry Haddow wrote:
Hi Prashant
If you want to update the
Moses useslog_e
while Kenlm useslog_10
log_e(P) = log_e(10) log_10(P)
-17.715 = log_e(10)*-7.69363
Am I right?
Nicola
On Oct 25, 2012, at 4:09 PM, Adam Teichert wrote:
Hi all,
After decoding, I get the output I am . from moses. I expected
the lm feature of the winning
From the FBK community...
as already mentioned by ken,
tlm computes correctly the Improved Kneser-Ney method (-lm=msb)
tlm can keep the singletons: set parameter -ps=no
As concerns as OOV words tlm computes the probability of the OOV as it were a
class of all possible unknown words.
In
BLEU scores on a test set for IRSTLM and
SRILM. I have to do some reading on this dub issue and see what happens.
W dniu 08.11.2012 09:20, Nicola Bertoldi pisze:
From the FBK community...
as already mentioned by ken,
tlm computes correctly the Improved Kneser-Ney method (-lm=msb)
tlm
?
Please try also to run the build-lm.sh command with this additional
parameters -k 10 -verbose,
collect the stdout and stderr and send it me.
Nicola Bertoldi
On behalf of IRSTLM team
On Apr 2, 2013, at 2:19 PM, Hieu Hoang wrote:
i'm not an expert with irstlm but you should double check you
Dear Nelson
I developed a phrase table (based on a cache) which can be automatically fed a
new entries at run-time
The insertion of new entries is performed by adding to the input command line a
xml-like input
'dlt cblm=source_phrase|||target_phrase/
similar to that for the local suggestion of
regards,
Nicola Bertoldi
On May 3, 2013, at 12:47 AM, Pradeep Dasigi wrote:
Hello,
I have been trying to use interpolated LM produced by IRSTLM (-v 5.80.02 ) in
Moses (-v 1.0). interpolate-lm of IRSTLM successfully produces an output file
(lmlist.final) in the following format:
LMINTERPOLATION
Maybe this can help to solve the issue
I also noted that XMLRPC libraries is not statically link
this is what I got running ldd on moses first and mosesserve
ldd moses
linux-vdso.so.1 = (0x7fff3f3a1000)
librt.so.1 = /lib64/librt.so.1 (0x003620a0)
It seems that the BOOST library libboost_unit_test_framework-mt.dylib is
missing
I also use a Mac OSX (10.6) with BOOST 1.51, compiled without multithreaded
support (-mt)
and hence in my BOOST the library is called
libboost_unit_test_framework.dylib
i.e. without suffix -mt
and the testing
I don't see any reason why a lattice should contain an EPSILON edge.
In a confusion network, EPSILON are needed to allow the translation of input of
different lengths.
The sausage structure of the CN imposes the same amount of source words,
and the EPSILONs overcome this constraint.
This is not
-factor 1 ?
Nicola Bertoldi
On Oct 10, 2013, at 1:05 PM, Hieu Hoang wrote:
Hi all
Achim and I have been working on adding support for placeholders into Moses.
That is, replacing a number, date, or named entity with a symbol eg. @num@,
-date-, =named-entity=. We think it would be especially
Hi James,
yes, I already succeeded in installing randLM and SRILM on both MacOSx and
Linux Redhat. it was very simple following their documentations.
and
yes, it is possible to compile Moses with any LM library you like
adding zero, one or more of the following parameters
Hi Xheng
this is possible through the parameter -n-best-list
see for details
http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc10
Nicola
On Jan 21, 2014, at 5:28 PM, Zheng Yuan wrote:
Hi,
The default settings of the Moses decoder only output the best translation
(only one). I am
Hi Janez,
Seth syggested you the right fix
I just checked the IRSTLM documentation
http://sourceforge.net/apps/mediawiki/irstlm/index.php?title=Estimating_gigantic_models
and the correct notation is reported there.
Could you please tell me from where do you get the wrong information
So that I
Hi Nima,
at FBK, we have recently develop a new (beta) version of MGIZA++ which enables
online word alignment
Maybe you could use it directly or grasp the way we load the tables/models
do not hesitate to ask us any clarification.
You can find it here
http://hlt.fbk.eu/technologies/onlinemgiza
Hi Zheng
IRSTLM is able to read and manage google n-gram.
Nicola
On 03/18/14 20:39, Hieu Hoang wrote:
Moses supports RandLM and neural network LM which can handle very large
amounts of data, I think.
I'm not sure if IRSTLM or KenLM can handle Google ngram data, but I know
they can handle
Hi Maria
first I would like to ask (for next time)
to write to the IRSTLM mailing list (user-irs...@list.fbk.eu)
to any problem related to IRSTLM only
Now I am going to reply you privately.
Nicola
On Jun 20, 2014, at 6:47 AM, Maria Marpaung wrote:
Hello, I need help.
I'm working on my
Hi Thomas
this is clearly a problem related to IRSTLM
I would kindly ask you to open a ticket in the IRSTLM Github Repo adding as
much info as possible
Please also add the actual command you run
I (as IRSTLM developer) will reply asap
Nicola
> On 20 May 2016, at 11:31, Tomasz Gawryl
proviamo a partecipare al task English-Czech?
Nicola
> On 11 Jun 2019, at 22:14, marco turchi wrote:
>
> [Apologies for multiple posting]
>
> The 16th International Workshop on Spoken Language Translation
> IWSLT 2019 – First Call for Participation
>
> November 2nd – 3rd, 2019 – Hong Kong
>
71 matches
Mail list logo