A solution would be to go through SAM, add the extra tag and convert back to 
BAM:

samtools view -H old.bam > tmp.sam ; samtools view old.bam | awk '{printf 
"%s\tOQ:Z:%s\n", $0, $11}' >> tmp.sam && samtools view -bo new.bam tmp.sam


From: Justin Jeyakani <gnanak...@gis.a-star.edu.sg>
Sent: 20 April 2020 14:09
To: samtools-help@lists.sourceforge.net
Subject: [Samtools-help] read and write a bam: but duplicate the quality column 
(11) and write one with "OQ:Z:" flag [EXT]

Hi All,

I have over 1000 bam files and I need to read and write them as bam files. The 
output bam shd contain a header with the alignment, but the column 11 (quality) 
to be repeated once, the second should contain quality with "OQ:Z:" flag. The 
script/package/tools shd be robust. Appreciated providing the script or tools 
to do this! Thanks in advance.

**eg:**
**input bam::**

    E00579:50:HK2VJALXX:6:1220:15300:41040  2115    chr1    9999    0       
90H60M  chr5    18606598        0       
GATAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAAC    
JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJFJFJJJJFJJJJJFFJJJJJJJJJFJ<A    
SA:Z:chr5,18606834,-,51S99M,37,0;     MD:Z:60 PG:Z:MarkDuplicates     
RG:Z:HK2VJALXX.6        NM:i:0  AS:i:60 XS:i:58

    E00579:50:HK2VJALXX:6:1212:4066:24884   113     chr1    9999    0       
34S60M56S       chr5    18606897        0       
CCTAGAACAGCTCTTCCTTTATTTTCTTTTTCTGGATAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACAGAGATAACTATTGATACAACACCTTCATGACCCTAAGGTACTATCATAGAGTTCT<<-FA-7<AFF<<<7<FJAF-JFJFF-J<JJAAJFFF-AAJJJJFJJJJJAJJJJJJFAJJJJJJJJFJJJJJJJJJJF<FJJJFJJJJFJJFJFJJJJJJJFJJJJJJJJJJFJJJJFJJJJJJJJFJJJJJJJJJJJJJJJJJFFFAA
  SA:Z:chr5,18606769,+,58M92S,0,0;chr5,18606834,+,107S43M,0,0;    MD:Z:60 
PG:Z:MarkDuplicates     RG:Z:HK2VJALXX.6   NM:i:0     AS:i:60 XS:i:59


**output bam::**

    E00579:50:HK2VJALXX:6:1220:15300:41040  2115    chr1    9999    0       
90H60M  chr5    18606598        0       
GATAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAAC    
JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJFJFJJJJFJJJJJFFJJJJJJJJJFJ<A    
SA:Z:chr5,18606834,-,51S99M,37,0;     MD:Z:60 PG:Z:MarkDuplicates     
RG:Z:HK2VJALXX.6        NM:i:0  AS:i:60 XS:i:58 
**OQ:Z:JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJFJFJJJJFJJJJJFFJJJJJJJJJFJ<A**

    E00579:50:HK2VJALXX:6:1212:4066:24884   113     chr1    9999    0       
34S60M56S       chr5    18606897        0       
CCTAGAACAGCTCTTCCTTTATTTTCTTTTTCTGGATAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACAGAGATAACTATTGATACAACACCTTCATGACCCTAAGGTACTATCATAGAGTTCT<<-FA-7<AFF<<<7<FJAF-JFJFF-J<JJAAJFFF-AAJJJJFJJJJJAJJJJJJFAJJJJJJJJFJJJJJJJJJJF<FJJJFJJJJFJJFJFJJJJJJJFJJJJJJJJJJFJJJJFJJJJJJJJFJJJJJJJJJJJJJJJJJFFFAA
  SA:Z:chr5,18606769,+,58M92S,0,0;chr5,18606834,+,107S43M,0,0;    MD:Z:60 
PG:Z:MarkDuplicates     RG:Z:HK2VJALXX.6   NM:i:0     AS:i:60 XS:i:59  
**OQ:Z:T<<-FA-7<AFF<<<7<FJAF-JFJFF-J<JJAAJFFF-AAJJJJFJJJJJAJJJJJJFAJJJJJJJJFJJJJJJJJJJF<FJJJFJJJJFJJFJFJJJJJJJFJJJJJJJJJJFJJJJFJJJJJJJJFJJJJJJJJJJJJJJJJJFFFAA**
This e-mail and any attachments are only for the use of the intended recipient 
and may contain material that is confidential, privileged and/or protected by 
the Official Secrets Act. If you are not the intended recipient, please delete 
it or notify the sender immediately. Please do not copy or use it for any 
purpose or disclose the contents to any other person.



-- 
 The Wellcome Sanger Institute is operated by Genome Research 
 Limited, a charity registered in England with number 1021457 and a 
 company registered in England with number 2742969, whose registered 
 office is 215 Euston Road, London, NW1 2BE. 


_______________________________________________
Samtools-help mailing list
Samtools-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/samtools-help

Reply via email to