I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM compiles of it failed in the past, but I'll try again. I expect it to be about 8 GB when that's done. Do you want it?
matt > On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) > <[email protected]> wrote: > > Hey Folks, > > Anyone have a Russian Language Model for Joshua? Lewis was working on > one, not sure if he has it but just broadening the question. > > Cheers, > Chris > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: [email protected] > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Director, Information Retrieval and Data Science Group (IRDS) > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > > >
