Hi,

Sometimes we get reads to align that have been trimmed to zero length and
I'm wondering how these should be represented in SAM format.

Here's a pair as reported by Novoalign that had been trimmed by cutadapt
and one read of the pair is zero length

READID    77    *    0    0    *    *    0    0        *    PG:Z:novoalign
READID    141    *    0    0    *    *    0    0
GTGTAGATCTCGGTGGTCGCCGTATCATTAAAAAAAAAAGGGG
EEDDB:=<;A9/=C=@A;:<,1:<?@.0<./;;;AC.;;5@::    PG:Z:novoalign

The first read of the pair has a zero length SEQ field.

This pair fails with a parse error in Samtools Version: 1.2 (using htslib
1.2.1) but is accepted by Samtools Version: 0.1.19-44428cd.

What is a valid SAM record for a zero length read?

Thanks, Colin
------------------------------------------------------------------------------
Developer Access Program for Intel Xeon Phi Processors
Access to Intel Xeon Phi processor-based developer platforms.
With one year of Intel Parallel Studio XE.
Training and support from Colfax.
Order your platform today. http://sdm.link/xeonphi
_______________________________________________
Samtools-help mailing list
Samtools-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/samtools-help

Reply via email to