Hi I've seen this error before. The short answer is that you need to use a gzipped version of the corpus.
The reason is that randlm uses gzip to decompress/compress when you have a gzipped corpus, which is fine because gzip takes a -d argument for decompressing. If presented with a non-gzipped version of the corpus, randlm attempts to fake gzip with cat, which fails because cat doesn't accept -d. This has come up on the mailing list before, as far as I recall. regards Barry On Thursday 19 February 2009 13:53, Michael Zuckerman wrote: > Hi, > > We are trying to run RandLM on our files. We use the command: > $ ./buildlm -struct BloomMap -falsepos 8 -values 8 -output-prefix model > -input-type corpus < ../../europarl.lower.token.en > > And we get the following errors: > cat: invalid option -- d > Try `cat --help' for more information. > rm ./model.tokens > buildlm: RandLMStats.cpp:312: virtual bool > randlm::CountStats::observe(const randlm::Word*, randlm::Value, int): > Assertion `len > 0' failed. > Aborted > > Are you familiar with these errors ? Do you have an idea about how to solve > them ? > > Thanks, > Michael. -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
