[BioC] GenomicAlignments and QNAME collision

Valerie Obenchain vobencha at fhcrc.org
Thu May 8 17:54:12 CEST 2014


Hi Stefano,

No, the current mate-pairing doesn't handle the trailing values. I will 
implement this and post back when it's done.

For reference, where did you download your bam files or what application 
outputs QNAMEs in this format? I'd like to have some for test data.


Thanks.
Valerie


On 05/08/14 08:14, Stefano Calza wrote:
> Hi everybody
>
>
> I am using GenomicAlignments package to read RNAseq pair-end data. The
> problem is that readGAlignmentPairsFromBam, after setting asMates=TRUE
> in BamFile, returns 0 mates.
>
> The reason is that mates have different QNAMEs. Eg:
>
> UNC15-SN850:240:D148CACXX:3:1308:19719:99367/1
> UNC15-SN850:240:D148CACXX:3:1308:19719:99367/2
>
> that is the two mates have /1 or /2 at the end.
>
> I wrote a Python (and a cpp) program to fix it...but this takes still
> quite a substantial amount of time on big files.
>
> Does the mating algorithm allow for this? If so how?
>
> Regards
>
> Stefano
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor


-- 
Valerie Obenchain
Program in Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N, Seattle, WA 98109

Email: vobencha at fhcrc.org
Phone: (206) 667-3158



More information about the Bioconductor mailing list