Done:
http://cs.jhu.edu/~post/tmp/ru.kenlm
4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791
Let me know when you have it so I can delete it.
matt
> On Jul 15, 2016, at 4:42 PM, Matt Post <[email protected]> wrote:
>
> All right, started trying to recompile. If you have a machine with > 256 GB
> of memory, it might be more efficient for me to give you the raw ARPA file
> and for you to compile it. We'll see how it goes. Ping me in a day if you
> don't hear from me.
>
> matt
>
>
>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980)
>> <[email protected]> wrote:
>>
>> Yes please! :)
>>
>> Sent from my iPhone
>>
>>> On Jul 15, 2016, at 1:39 PM, Matt Post <[email protected]> wrote:
>>>
>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM
>>> compiles of it failed in the past, but I'll try again. I expect it to be
>>> about 8 GB when that's done. Do you want it?
>>>
>>> matt
>>>
>>>
>>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980)
>>>> <[email protected]> wrote:
>>>>
>>>> Hey Folks,
>>>>
>>>> Anyone have a Russian Language Model for Joshua? Lewis was working on
>>>> one, not sure if he has it but just broadening the question.
>>>>
>>>> Cheers,
>>>> Chris
>>>>
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> Chris Mattmann, Ph.D.
>>>> Chief Architect
>>>> Instrument Software and Science Data Systems Section (398)
>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>> Office: 168-519, Mailstop: 168-527
>>>> Email: [email protected]
>>>> WWW: http://sunset.usc.edu/~mattmann/
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> Director, Information Retrieval and Data Science Group (IRDS)
>>>> Adjunct Associate Professor, Computer Science Department
>>>> University of Southern California, Los Angeles, CA 90089 USA
>>>> WWW: http://irds.usc.edu/
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>