hello, I have a file of 4 Giga bytes, I tried extract bigrams with huge-count.pl but it gives me an error message at split-data.pl, line 268 "memory problem" I use Fedora 10. why? I treat the TREC collection WT10G 10 GO "after preprocessing I get the file of 4 GO" the number of file is 2,000,000 Is there another possibility to do? thank you Arezki