Hi Everyone,

I'm working with MAUVE. I'm interested in certain regions that are present
in one of my genomes, but no on the others in the alignment. What I did is
that I used the backbone file, and pick the regions that have coordinates
for that genome (cero I assumed was absent of the region). Additionally, I
made a script to use the coordinates in the backbone file to extract from
the genome the sequences of interest. I have several assumptions about the
way the coordinates on the backbone file reflect the information in the
genomes. However, I have been getting weird sequences, this lead me to ask
you several questions, hoping you could help me.

1. When I have a fragmented genome, I have a multi-fasta file representing
this genome. When I want to extract the sequences using the coordinates
from the backbone file, what assumptions do I have to make to deal with the
different contigs. How does Mauve make the counting over a multifasta
file?. Is Mauve counting the letters in the headers? Do the coordinates
re-start each time that a header appears, I mean, for each contig in my
fasta file, Mauve starts from 1 again?

2. How do I deal with the negative coordinates? Do I extract the sequences,
an then make the reverse of that sequence?  or Do I extract those
coordinates of the whole genome reversed? Do I have to start counting from
the end of the sequence, to the begining?

3. Is there available an utility that helps me with these issues?


Thanks very much for your attention!
-- 

Laura Perlaza-Jiménez
Graduate Research Assistant
Mycology and Plant Pathology Laboratory (LAMFU)
Universidad de Los Andes. Bogotá, Colombia.
------------------------------------------------------------------------------
_______________________________________________
Mauve-users mailing list
Mauve-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/mauve-users

Reply via email to