Can you make this public for good? Or is it the size which is the issue? Is this build using master branch Matt? I am having issues building models with masterŠ I¹ll post my issues on another thread.
Dr. Lewis John McGibbney Ph.D., B.Sc. Data Scientist II Computer Science for Data Intensive Applications Group 398M Jet Propulsion Laboratory California Institute of Technology 4800 Oak Grove Drive Pasadena, California 91109-8099 Mail Stop : 158-256C Tel: (+1) (818)-393-7402 Cell: (+1) (626)-487-3476 Fax: (+1) (818)-393-1190 Email: [email protected] Dare Mighty Things On 7/16/16, 1:09 PM, "Matt Post" <[email protected]> wrote: >Done: > > http://cs.jhu.edu/~post/tmp/ru.kenlm > 4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791 > >Let me know when you have it so I can delete it. > >matt > > >> On Jul 15, 2016, at 4:42 PM, Matt Post <[email protected]> wrote: >> >> All right, started trying to recompile. If you have a machine with > >>256 GB of memory, it might be more efficient for me to give you the raw >>ARPA file and for you to compile it. We'll see how it goes. Ping me in a >>day if you don't hear from me. >> >> matt >> >> >>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) >>><[email protected]> wrote: >>> >>> Yes please! :) >>> >>> Sent from my iPhone >>> >>>> On Jul 15, 2016, at 1:39 PM, Matt Post <[email protected]> wrote: >>>> >>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM >>>>compiles of it failed in the past, but I'll try again. I expect it to >>>>be about 8 GB when that's done. Do you want it? >>>> >>>> matt >>>> >>>> >>>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>>>><[email protected]> wrote: >>>>> >>>>> Hey Folks, >>>>> >>>>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>>>> one, not sure if he has it but just broadening the question. >>>>> >>>>> Cheers, >>>>> Chris >>>>> >>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>> Chris Mattmann, Ph.D. >>>>> Chief Architect >>>>> Instrument Software and Science Data Systems Section (398) >>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>>>> Office: 168-519, Mailstop: 168-527 >>>>> Email: [email protected] >>>>> WWW: http://sunset.usc.edu/~mattmann/ >>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>> Director, Information Retrieval and Data Science Group (IRDS) >>>>> Adjunct Associate Professor, Computer Science Department >>>>> University of Southern California, Los Angeles, CA 90089 USA >>>>> WWW: http://irds.usc.edu/ >>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>> >> >
