Hello, Would it be possible to include an option that firstly goes through all of the strings and runs a sliding window along them, to find all the unique k-mers present in the dataset ? This would avoid having a sparse matrix with many columns of all zero counts, when a larger value of width is specified.
-------------------------------------- Dario Strbenac PhD Student University of Sydney Camperdown NSW 2050 Australia [[alternative HTML version deleted]] _______________________________________________ Bioc-devel@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/bioc-devel