subject:"\[ngram\] Using huge\-count.pl with lots of files"

Re: [ngram] Using huge-count.pl with lots of files

2018-04-15 Thread Ted Pedersen tpede...@d.umn.edu [ngram]

I guess my first thought would be to see if there is a simple way to compute the input you are providing to huge count into fewer files. If you have a lot of files that start with the letter 'a', for example, you could concatentate them all together via a (Linux) command like cat a* >

[ngram] Using huge-count.pl with lots of files

2018-04-15 Thread catherine.dejage...@gmail.com [ngram]

I am trying to get the bigram counts aggregated across a lot of files. However, when I ran huge-count.pl using the list of files as an input, I got the error "Argument list too long". What would you recommend for combining many files, when there are too many files to just run huge-count.pl as