Raymond, you've probably figured this out already, but since I was looking
for discussion of these files, I'll answer your email. The SNP file doesn't
address gaps, doesn't depend on the contig mover. The three fields for
every genome represent the contig name (the fasta header line of the
sequence containing the position of the SNP), the position in that
particular contig (i.e. 1-based position of the SNP in that particular
fasta entry), and the position in the whole genome (i.e. if you
concatenated all contigs in order together into one sequence, what would be
the coordinate of the SNP). This last obviously applies to a "draft" or
multi-chromosomal genome that contains multiple fasta entries in one file;
if you're dealing with a single chromosome, then the "position in contig"
and "genome-wide position" will match.
~Joe
On Sat, Feb 28, 2015 at 4:16 AM, Raymond Wan <r...@cuhk.edu.hk> wrote:
>
> Dear Aaron (or anyone else who knows),
>
> I've noticed that the SNP file format has changed in version 2.4.0
> compared to
> what's mentioned here:
>
> http://darlinglab.org/mauve/user-guide/files.html
>
> The first column is still "SNP pattern". But what use to be coordinates
> for
> each genome has now tripled. For each genome there is now:
>
> sequence_1_Contig sequence_1_PosInContg sequence_1_GenWidePos1 ...
>
> Would I be correct in saying that:
>
> 1) *_Contig is null if it contributes a gap to the SNP pattern.
> Otherwise, it
> gives the range of the contig.
>
> 2) *_PosInContg is 0 if it contributes a gap to the SNP pattern;
> otherwise, it
> is the position within that contig.
>
> 3) *_GenWidePos1 is 0 if it contributes a gap to the SNP pattern;
> otherwise it
> is the position within that genome.
>
> I'm thinking aloud a bit. :-) But it sounds like the old SNP file is just
> column 3. Is that correct?
>
> However, I'm a bit puzzled what contig means in this context. Does it
> refer to
> usage of the Mauve Contig Mover? That is, if MCM isn't used, then columns
> (2)
> and columns (3) are identical in value? (This is what I'm noticing and I'm
> trying to understand why...I think this is the reason, since I haven't used
> MCM.)
>
> Thank you!
>
> Ray
>
>
>
>
> -------------------------------------------------
> This e-mail is sent by CUHK WebMail http://webmail.cuhk.edu.hk
>
>
>
> ------------------------------------------------------------------------------
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for
> all
> things parallel software development, from weekly thought leadership blogs
> to
> news, videos, case studies, tutorials and more. Take a look and join the
> conversation now. http://goparallel.sourceforge.net/
> _______________________________________________
> Mauve-users mailing list
> Mauve-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/mauve-users
>
--
Joseph Fass
Bioinformatics Data Analyst
UC Davis Genome Center - Bioinformatics Core
http://bioinformatics.ucdavis.edu/
jnf...@ucdavis.edu
phone ~ 530.752.2698
------------------------------------------------------------------------------
What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic
patterns at an interface-level. Reveals which users, apps, and protocols are
consuming the most bandwidth. Provides multi-vendor support for NetFlow,
J-Flow, sFlow and other flows. Make informed decisions using capacity planning
reports.http://sdm.link/zohodev2dev
_______________________________________________
Mauve-users mailing list
Mauve-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/mauve-users