[BioC] faster way to get differential calls from pileup?

Martin Morgan mtmorgan at fhcrc.org
Sat Oct 16 22:33:11 CEST 2010


On 10/16/2010 11:54 AM, Hollis Wright wrote:
> Hi, all; I've got a pair of lanes of exome sequencing data; we've  
> generated pileup files from samtools and we're interested in looking  
> for discordant calls for quality control or snp discovery. As best I  
> can figure out the way to do this involves doing a findOverlaps and  
> the programatically iterating through the match matrix to get the  
> matching positions and check for differences. However, the overlap  
> finding takes several hours, and since we anticipate there being many  

This sounds like it's taking longer than findOverlaps should be taking;
perhaps you are running out of memory (so process in batches, e.g., by
chromosome) or doing something inefficiently. What does your code look
like (simplified, if possible...)

Martin

> lanes in the future I'm curious if there's a faster or better way to  
> go about this sort of process. Thanks...
> 
> Hollis Wright
> 
> Sent from my iPhone
> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor


-- 
Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109

Location: M1-B861
Telephone: 206 667-2793



More information about the Bioconductor mailing list