[BioC] limma reading genpix files

Gordon Smyth smyth at wehi.edu.au
Tue Apr 13 23:43:38 CEST 2004


Dear Dave,

Thanks for the diagnosis. You are quite correct that read.maimages() does 
assume that all the gpr files in a batch have headers of the same number of 
lines, and this assumption should be relaxed. As it happens, the assumption 
has always been true for genepix data sets that I have seen.

Gordon

At 03:53 AM 14/04/2004, David Nelson wrote:
>Hello all,
>
>I've just started plowing thru some two-color arrays and ran across the 
>same bug.
>
>The problem I found is due to an efficiency hack that doesn't always work: 
>read.maimages assumes that all the genepix files it reads have the same 
>number of header records. That is, rather than taking the number of header 
>records directly from each .gpr file, it counts the number of lines until 
>the header line in the *first file only*, and then assumes that all the 
>rest of the files have the same number of records.
>
>If not, then the skip= parameter is incorrect, read.table() starts in the 
>wrong place, and results are, as they say, unpredictable.
>
>The "right" way to do this is to read the first lines of each .gpr file, 
>get the number of header records, and then use read.table with the right 
>number of header records. A quick hack is just to search for the header in 
>each file.
>
>I've attached a modification to read.maimages() that does exactly that, at 
>least the three times I've tried it :-)
>
>
>Cheers,
>
>Dave Nelson



More information about the Bioconductor mailing list