I guess my first thought would be to see if there is a simple way to
compute the input you are providing to huge count into fewer files. If you
have a lot of files that start with the letter 'a', for example, you could
concatentate them all together via a (Linux) command like
cat a* >
I am trying to get the bigram counts aggregated across a lot of files. However,
when I ran huge-count.pl using the list of files as an input, I got the error
"Argument list too long". What would you recommend for combining many files,
when there are too many files to just run huge-count.pl as