++

---------- Forwarded message ---------
From: Namrata Hadimani <namrata.hadim...@mycit.ie>
Date: Fri, 23 Apr 2021 at 00:39
Subject: Re: [Moses-support] reg. moses installation
To: Kenneth Heafield <mo...@kheafield.com>


Hi Kenneth,

Actually I am facing error in creating the Tokenisation itself. I tried to
run through the tokeniser.perl script but it didn't help me . There is no
log generated for this functionality.
Could you guide me more.

*Below are the scripts I ran :*

~/mosesdecoder/scripts/tokenizer/tokenizer.perl -l en \
    < ~/corpus/training/news-commentary-v8.fr-en.en    \
    > ~/corpus/news-commentary-v8.fr-en.tok.en

*And I am getting this as a response :*

Tokenizer Version 1.1
Language: en
Number of threads: 1

*After this Step, the tokens are not created and the file is Empty. How
will I fix this problem?*

*Thanks in Advance*

Regards,
Namrata Hadimani



On Thu, 22 Apr 2021 at 23:45, Kenneth Heafield <mo...@kheafield.com> wrote:

> Your training corpus is empty.
>
> cat ~/corpus/news-commentary-v8.fr-en.true.en
>
> On 4/22/21 9:50 PM, Namrata Hadimani wrote:
> > Hi All,
> >
> > I am trying to perform Language Model Training using the below command
> >
> >  ~/mosesdecoder/bin/lmplz -o 3
> <~/corpus/news-commentary-v8.fr-en.true.en >
> news-commentary-v8.fr-en.arpa.en
> >
> >
> > followed similar to the link below :
> > http://www.statmt.org/moses/?n=Moses.Baseline
> > <http://www.statmt.org/moses/?n=Moses.Baseline>
> >
> > But I faced this issue  :
> >  === 1/5 Counting and sorting n-grams ===
> > Reading /home/namrata/smt/corpus/news-commentary-v8.fr-en.true.en
> >
> ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
> > tcmalloc: large alloc 3135389696 bytes == 0x144e000 @
> > tcmalloc: large alloc 10451279872 bytes == 0xbc272000 @
> > Unigram tokens 0 types 3
> > === 2/5 Calculating and sorting adjusted counts ===
> > Chain sizes: 1:36 2:4734547456 3:8877277184
> > tcmalloc: large alloc 8877277184 bytes == 0x144e000 @
> > tcmalloc: large alloc 4734550016 bytes == 0x32ba4e000 @
> > terminate called after throwing an instance of
> > 'lm::builder::BadDiscountException'
> >   what():  /home/namrata/smt/mosesdecoder/lm/builder/adjust_counts.cc:53
> > in void
> > lm::builder::{anonymous}::StatCollector::CalculateDiscounts(const
> > lm::builder::DiscountConfig&) threw BadDiscountException because `s.n[j]
> > == 0'.
> > Could not calculate Kneser-Ney discounts for 1-grams with adjusted count
> > 2 because we didn't observe any 1-grams with adjusted count 1; Is this
> > small or artificial data?
> > Try deduplicating the input.  To override this error for e.g. a
> > class-based model, rerun with --discount_fallback
> >
> > So I tried changing my command to the following as
> > :  ~/mosesdecoder/bin/lmplz -o 3
> > <~/corpus/news-commentary-v8.fr-en.true.en >
> > news-commentary-v8.fr-en.arpa.en *--**discount_fallback*
> >
> > After this When i run the command : ~/mosesdecoder/bin/build_binary \
> >
> >    news-commentary-v8.fr-en.arpa.en \
> >    news-commentary-v8.fr-en.blm.en
> >
> >
> >
> > I am facing error like:  lm/vocab.cc:324 in void
> > lm::ngram::MissingSentenceMarker(const lm::ngram::Config&, const char*)
> > threw SpecialWordMissingException.
> > The ARPA file is missing </s> and the model is configured to reject
> > these models.  Run build_binary -s to disable this check. Byte: 66
> > ERROR
> >
> > Could anyone help me out with this, please.
> >
> > Regards,
> > Namrata Hadimani
> >
> > On Thu, 22 Apr 2021 at 18:32, Namrata Hadimani
> > <namrata.hadim...@mycit.ie <mailto:namrata.hadim...@mycit.ie>> wrote:
> >
> >     Hi Hieu,
> >
> >     Thanks for the help, I am able to successfully compile the moses
> >     ToolKit.
> >
> >     Regards,
> >     Namrata Hadimani
> >
> >     On Thu, 22 Apr 2021 at 17:23, Hieu Hoang <hieuho...@gmail.com
> >     <mailto:hieuho...@gmail.com>> wrote:
> >
> >         i've just successfully compiled moses-4.0 on ubuntu 20.04 with
> >         boost 1.71.
> >
> >         there's something wrong with your boost installation
> >
> >         On 4/22/2021 3:09 AM, ram anirudh cherukupally wrote:
> >>         There is atleast 60 GB space, so I think it is not space
> >>         issue. Has moses-4.0 been tested for compilation using boost
> >>         1.71? Do you recommend using boost 1.64  (as exemplified in
> >>         Moses manual?)
> >>         Thank you
> >>
> >>         On Thu, Apr 22, 2021 at 1:15 PM Hieu Hoang
> >>         <hieuho...@gmail.com <mailto:hieuho...@gmail.com>> wrote:
> >>
> >>             there seems to be a problem with the boost library. Is the
> >>             disk full? Perhaps you should re-install boost
> >>
> >>             On 4/21/2021 10:39 PM, ram anirudh cherukupally wrote:
> >>>             Dear Moses-Support,
> >>>
> >>>             Please find the build.log.gz attached as per the
> >>>             instructions when the build failed.
> >>>             My system specs:
> >>>
> >>>             OS: Ubuntu 20.04
> >>>             RAM: 8 GB
> >>>             libboost-dev version: 1.71.0.0ubuntu2
> >>>
> >>>             Command used for compiling moses: ./bjam -j4
> >>>
> >>>             Thanks and Regards
> >>>
> >>>             --
> >>>             CH Ram Anirudh
> >>>
> >>>
> >>>             _______________________________________________
> >>>             Moses-support mailing list
> >>>             Moses-support@mit.edu <mailto:Moses-support@mit.edu>
> >>>             http://mailman.mit.edu/mailman/listinfo/moses-support <
> http://mailman.mit.edu/mailman/listinfo/moses-support>
> >>
> >>             --
> >>             Hieu Hoang
> >>             http://statmt.org/hieu <http://statmt.org/hieu>
> >>
> >>
> >>
> >>         --
> >>         CH Ram Anirudh
> >>
> >         --
> >         Hieu Hoang
> >         http://statmt.org/hieu <http://statmt.org/hieu>
> >
> >         _______________________________________________
> >         Moses-support mailing list
> >         Moses-support@mit.edu <mailto:Moses-support@mit.edu>
> >         http://mailman.mit.edu/mailman/listinfo/moses-support
> >         <http://mailman.mit.edu/mailman/listinfo/moses-support>
> >
> >
> > _______________________________________________
> > Moses-support mailing list
> > Moses-support@mit.edu
> > http://mailman.mit.edu/mailman/listinfo/moses-support
> >
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to