that might be it.  but i seem to have it working here, using a
non-gzipped version of Europarl.

in any case, Michael:  tell us if it works when the corpus is gzipped

Miles

2009/2/19 Barry Haddow <[email protected]>:
> Hi
>
> I've seen this error before. The short answer is that you need to use a
> gzipped version of the corpus.
>
> The reason is that randlm uses gzip to decompress/compress when you have a
> gzipped corpus, which is fine because gzip takes a -d argument for
> decompressing. If presented with a non-gzipped version of the corpus, randlm
> attempts to fake gzip with cat, which fails because cat doesn't accept -d.
>
> This has come up on the mailing list before, as far as I recall.
>
> regards
> Barry
>
> On Thursday 19 February 2009 13:53, Michael Zuckerman wrote:
>> Hi,
>>
>> We are trying to run RandLM on our files. We use the command:
>> $ ./buildlm -struct BloomMap -falsepos 8 -values 8 -output-prefix model
>> -input-type corpus < ../../europarl.lower.token.en
>>
>> And we get the following errors:
>> cat: invalid option -- d
>> Try `cat --help' for more information.
>> rm ./model.tokens
>> buildlm: RandLMStats.cpp:312: virtual bool
>> randlm::CountStats::observe(const randlm::Word*, randlm::Value, int):
>> Assertion `len > 0' failed.
>> Aborted
>>
>> Are you familiar with these errors ? Do you have an idea about how to solve
>> them ?
>>
>> Thanks,
>>      Michael.
>
> --
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to