Hi Malene, thanks for writing...

On Tue, 2010-11-30 at 04:09 +0100, Malene Bek-Thomsen wrote: 
> Hi Mauve users
> 
> I’m a new user of Mauve and have problems aligning 50 bacterial genomes – a 
> mixture of complete genomes and incomplete genomes. It works very well with 
> the complete genomes, but the problem arrives when I want to add the 
> incomplete genomes consisting of multiple contigs. For each incomplete genome 
> I have downloaded a .gbff file from genbank containing all the contigs of 
> that genome in one file. I then thought I would be able to load the file into 
> Mauve and that Mauve would concatenate the contigs and align these incomplete 
> genomes with the complete genomes. However, when I add the incomplete genomes 
> in form of the .gbff file I end up with the following message: ERROR! gap 
> character encountered at genome sequence position 346579 Input sequences must 
> be unaligned and ungapped! Exited with error code: -1073741819. The gap 
> character found by Mauve seems to be the new entry –the  beginning of the 
> next contig. I have looked at the .gbff file in text format and it looks as 
> expected for a genbank file. Can anyone tell me what I do wrong?
> 

Glad to hear things are working well with your complete genomes.  I
believe the trouble you are encountering with the unfinished genomes
might be related to the file name extension.  Mauve uses file name
extensions to guess the file format and only recognizes .gb and .gbk as
genbank format.  So if your .gbff files conform to the GenBank flat-file
format well enough that Mauve can parse them, then renaming them to .gb
or .gbk should solve the problem.  If that doesn't solve it then please
send me a sample of your data off-list so I can verify correct
formatting.

On a related note, I would highly recommend ordering the contigs in your
draft genomes if you haven't already.  Ideally that would be done with
some kind of high resolution genetic map but if that's not available
there are various software tools to order contigs based on reference
genomes, including one in Mauve and written by Anna Rissman called the
Mauve Contig Mover.  GUI and command-line usage instructions are here:
http://asap.ahabs.wisc.edu/mauve-aligner/mauve-user-guide/reording-contigs-in-draft-genomes.html
Other software tools for the task include Projector 2, Baccardi, OSLay,
ABACAS, and probably more that I don't know about.

I recommend ordering the contigs because it will allow progressiveMauve
to run more efficiently (faster, less memory) and may even result in
higher quality alignments.

Cheers,
-Aaron


------------------------------------------------------------------------------
Increase Visibility of Your 3D Game App & Earn a Chance To Win $500!
Tap into the largest installed PC base & get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
_______________________________________________
Mauve-users mailing list
Mauve-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/mauve-users

Reply via email to