Hi 

I've seen this error before. The short answer is that you need to use a 
gzipped version of the corpus.

The reason is that randlm uses gzip to decompress/compress when you have a 
gzipped corpus, which is fine because gzip takes a -d argument for 
decompressing. If presented with a non-gzipped version of the corpus, randlm 
attempts to fake gzip with cat, which fails because cat doesn't accept -d. 

This has come up on the mailing list before, as far as I recall.

regards
Barry

On Thursday 19 February 2009 13:53, Michael Zuckerman wrote:
> Hi,
>
> We are trying to run RandLM on our files. We use the command:
> $ ./buildlm -struct BloomMap -falsepos 8 -values 8 -output-prefix model
> -input-type corpus < ../../europarl.lower.token.en
>
> And we get the following errors:
> cat: invalid option -- d
> Try `cat --help' for more information.
> rm ./model.tokens
> buildlm: RandLMStats.cpp:312: virtual bool
> randlm::CountStats::observe(const randlm::Word*, randlm::Value, int):
> Assertion `len > 0' failed.
> Aborted
>
> Are you familiar with these errors ? Do you have an idea about how to solve
> them ?
>
> Thanks,
>      Michael.

-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to