o what you
want) and therefore the vocab file.
Kenneth
On 06/04/16 12:11, Graeme Kidd wrote:
>
> Thanks, that's given me a good starting point. The next problem is
> that the dump_trie program expects a vocab file which isn't provided.
> Any idea how I could create one?
>
>
>
Thanks, that’s given me a good starting point. The next problem is that the
dump_trie program expects a vocab file which isn’t provided. Any idea how I
could create one?
Thanks again,
Graeme
From: Kenneth Heafield [mailto:mo...@kheafield.com]
Sent: 04 June 2016 08:00
To: Graeme Kidd
Hi,
This is still all very new to me so apologies if this is not the correct
place to ask this questions.
I am wanting to take the English Trie Language Model (5.5TB) created from
the Common Crawl data set:
http://data.statmt.org/ngrams/lm/en.trie
Then extract all n-grams that contain