Ernesto.
I agree, I think pytables would be a good solution for his full (with gaps)
multiple sequence alignment.
Regarding your problem, hopefully others with more experience can help with
optimization. Personally I've had troubles in similar situations where I
had large amounts of varia
Hi Richard,
>This looks, in part, a way to store a huge multiple sequence
alignment with
>a reference sequence (the first character in ( ) being the DNA base
in a
>reference DNA molecule, but due to the inequal lengths in each VLA,
it would
>seem that gaps are not stored, or stored else
I'm going to stab at understanding your problem. Correct me where
I'm wrong.
On Sat, Dec 12, 2009 at 12:45 PM, Ernesto wrote:
As I wrote I start with an input file. It contains a string of
variable length (10e7-10e8). This string consists of four different
characters (A,C,G,T), the bases of a