Re: [ngram] Using huge-count.pl with lots of files

2018-04-16 Thread Ted Pedersen tpede...@d.umn.edu [ngram]
> -- > *From:* ngram@yahoogroups.com on behalf of Ted > Pedersen tpede...@d.umn.edu [ngram] > *Sent:* 15 April 2018 23:41:36 > *To:* ngram@yahoogroups.com > *Subject:* Re: [ngram] Using huge-count.pl with lots of files > > > > I guess my first

Re: [ngram] Using huge-count.pl with lots of files

2018-04-16 Thread Serge Sharoff s.shar...@leeds.ac.uk [ngram]
Re: [ngram] Using huge-count.pl with lots of files I guess my first thought would be to see if there is a simple way to compute the input you are providing to huge count into fewer files. If you have a lot of files that start with the letter 'a', for example, you could concatentate t

Re: [ngram] Using huge-count.pl with lots of files

2018-04-15 Thread Ted Pedersen tpede...@d.umn.edu [ngram]
I guess my first thought would be to see if there is a simple way to compute the input you are providing to huge count into fewer files. If you have a lot of files that start with the letter 'a', for example, you could concatentate them all together via a (Linux) command like cat a* > myafiles.txt