Is this some installation related ?
On Fri, 23 Apr 2021 at 12:39 AM, Namrata Hadimani <namrata.hadim...@mycit.ie>
wrote:

> Hi Kenneth,
>
> Actually I am facing error in creating the Tokenisation itself. I tried to
> run through the tokeniser.perl script but it didn't help me . There is no
> log generated for this functionality.
> Could you guide me more.
>
> *Below are the scripts I ran :*
>
> ~/mosesdecoder/scripts/tokenizer/tokenizer.perl -l en \
>     < ~/corpus/training/news-commentary-v8.fr-en.en    \
>     > ~/corpus/news-commentary-v8.fr-en.tok.en
>
> *And I am getting this as a response :*
>
> Tokenizer Version 1.1
> Language: en
> Number of threads: 1
>
> *After this Step, the tokens are not created and the file is Empty. How
> will I fix this problem?*
>
> *Thanks in Advance*
>
> Regards,
> Namrata Hadimani
>
>
>
> On Thu, 22 Apr 2021 at 23:45, Kenneth Heafield <mo...@kheafield.com>
> wrote:
>
>> Your training corpus is empty.
>>
>> cat ~/corpus/news-commentary-v8.fr-en.true.en
>>
>> On 4/22/21 9:50 PM, Namrata Hadimani wrote:
>> > Hi All,
>> >
>> > I am trying to perform Language Model Training using the below command
>> >
>> >  ~/mosesdecoder/bin/lmplz -o 3
>> <~/corpus/news-commentary-v8.fr-en.true.en >
>> news-commentary-v8.fr-en.arpa.en
>> >
>> >
>> > followed similar to the link below :
>> > http://www.statmt.org/moses/?n=Moses.Baseline
>> > <http://www.statmt.org/moses/?n=Moses.Baseline>
>> >
>> > But I faced this issue  :
>> >  === 1/5 Counting and sorting n-grams ===
>> > Reading /home/namrata/smt/corpus/news-commentary-v8.fr-en.true.en
>> >
>> ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
>> > tcmalloc: large alloc 3135389696 bytes == 0x144e000 @
>> > tcmalloc: large alloc 10451279872 bytes == 0xbc272000 @
>> > Unigram tokens 0 types 3
>> > === 2/5 Calculating and sorting adjusted counts ===
>> > Chain sizes: 1:36 2:4734547456 3:8877277184
>> > tcmalloc: large alloc 8877277184 bytes == 0x144e000 @
>> > tcmalloc: large alloc 4734550016 bytes == 0x32ba4e000 @
>> > terminate called after throwing an instance of
>> > 'lm::builder::BadDiscountException'
>> >   what():  /home/namrata/smt/mosesdecoder/lm/builder/adjust_counts.cc:53
>> > in void
>> > lm::builder::{anonymous}::StatCollector::CalculateDiscounts(const
>> > lm::builder::DiscountConfig&) threw BadDiscountException because `s.n[j]
>> > == 0'.
>> > Could not calculate Kneser-Ney discounts for 1-grams with adjusted count
>> > 2 because we didn't observe any 1-grams with adjusted count 1; Is this
>> > small or artificial data?
>> > Try deduplicating the input.  To override this error for e.g. a
>> > class-based model, rerun with --discount_fallback
>> >
>> > So I tried changing my command to the following as
>> > :  ~/mosesdecoder/bin/lmplz -o 3
>> > <~/corpus/news-commentary-v8.fr-en.true.en >
>> > news-commentary-v8.fr-en.arpa.en *--**discount_fallback*
>> >
>> > After this When i run the command : ~/mosesdecoder/bin/build_binary \
>> >
>> >    news-commentary-v8.fr-en.arpa.en \
>> >    news-commentary-v8.fr-en.blm.en
>> >
>> >
>> >
>> > I am facing error like:  lm/vocab.cc:324 in void
>> > lm::ngram::MissingSentenceMarker(const lm::ngram::Config&, const char*)
>> > threw SpecialWordMissingException.
>> > The ARPA file is missing </s> and the model is configured to reject
>> > these models.  Run build_binary -s to disable this check. Byte: 66
>> > ERROR
>> >
>> > Could anyone help me out with this, please.
>> >
>> > Regards,
>> > Namrata Hadimani
>> >
>> > On Thu, 22 Apr 2021 at 18:32, Namrata Hadimani
>> > <namrata.hadim...@mycit.ie <mailto:namrata.hadim...@mycit.ie>> wrote:
>> >
>> >     Hi Hieu,
>> >
>> >     Thanks for the help, I am able to successfully compile the moses
>> >     ToolKit.
>> >
>> >     Regards,
>> >     Namrata Hadimani
>> >
>> >     On Thu, 22 Apr 2021 at 17:23, Hieu Hoang <hieuho...@gmail.com
>> >     <mailto:hieuho...@gmail.com>> wrote:
>> >
>> >         i've just successfully compiled moses-4.0 on ubuntu 20.04 with
>> >         boost 1.71.
>> >
>> >         there's something wrong with your boost installation
>> >
>> >         On 4/22/2021 3:09 AM, ram anirudh cherukupally wrote:
>> >>         There is atleast 60 GB space, so I think it is not space
>> >>         issue. Has moses-4.0 been tested for compilation using boost
>> >>         1.71? Do you recommend using boost 1.64  (as exemplified in
>> >>         Moses manual?)
>> >>         Thank you
>> >>
>> >>         On Thu, Apr 22, 2021 at 1:15 PM Hieu Hoang
>> >>         <hieuho...@gmail.com <mailto:hieuho...@gmail.com>> wrote:
>> >>
>> >>             there seems to be a problem with the boost library. Is the
>> >>             disk full? Perhaps you should re-install boost
>> >>
>> >>             On 4/21/2021 10:39 PM, ram anirudh cherukupally wrote:
>> >>>             Dear Moses-Support,
>> >>>
>> >>>             Please find the build.log.gz attached as per the
>> >>>             instructions when the build failed.
>> >>>             My system specs:
>> >>>
>> >>>             OS: Ubuntu 20.04
>> >>>             RAM: 8 GB
>> >>>             libboost-dev version: 1.71.0.0ubuntu2
>> >>>
>> >>>             Command used for compiling moses: ./bjam -j4
>> >>>
>> >>>             Thanks and Regards
>> >>>
>> >>>             --
>> >>>             CH Ram Anirudh
>> >>>
>> >>>
>> >>>             _______________________________________________
>> >>>             Moses-support mailing list
>> >>>             Moses-support@mit.edu <mailto:Moses-support@mit.edu>
>> >>>             http://mailman.mit.edu/mailman/listinfo/moses-support <
>> http://mailman.mit.edu/mailman/listinfo/moses-support>
>> >>
>> >>             --
>> >>             Hieu Hoang
>> >>             http://statmt.org/hieu <http://statmt.org/hieu>
>> >>
>> >>
>> >>
>> >>         --
>> >>         CH Ram Anirudh
>> >>
>> >         --
>> >         Hieu Hoang
>> >         http://statmt.org/hieu <http://statmt.org/hieu>
>> >
>> >         _______________________________________________
>> >         Moses-support mailing list
>> >         Moses-support@mit.edu <mailto:Moses-support@mit.edu>
>> >         http://mailman.mit.edu/mailman/listinfo/moses-support
>> >         <http://mailman.mit.edu/mailman/listinfo/moses-support>
>> >
>> >
>> > _______________________________________________
>> > Moses-support mailing list
>> > Moses-support@mit.edu
>> > http://mailman.mit.edu/mailman/listinfo/moses-support
>> >
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to