> I have an idea to unique and combine the k-mers of many (extremely > large) FASTA files for a project.
Coming a bit late to the party (from vacation, and also having some, uh, other things to worry about in my life), but did you have a look at: http://blog.malde.org/posts/frequency-counting.html http://blog.malde.org/posts/k-mer-counting.html There's code as well, if you (or anybody else)'re interested. -k -- If I haven't seen further, it is by standing in the footprints of giants