I don't know, but you can try asking the irst guys on their mailing list:
https://list.fbk.eu/sympa/subscribe/user-irstlm
I'll be interested in finding out too
On 6 December 2012 02:01, HOANG Cong Duy Vu <[email protected]> wrote:
> Hi everyone,
>
> I would like to build large LMs from the Google Web1T
> 5-gram<http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13>
> .
> I tried to use the goograms2ngrams.pl script from IRSTLM toolkit to
> extract raw n-gram counts but don't know how to build LMs (e.g. arpa file)
> from those count files.
>
> Does anyone use to deal with it? Please advise me.
>
> Thanks in advance!
>
> --
> Cheers,
> Vu
>
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support