[BioC] Reading Agilent files using read.maimages

Steve Taylor staylor at molbiol.ox.ac.uk
Fri Aug 17 16:32:16 CEST 2007


Hi Misha,

> 
> The files you downloaded from ArrayExpress are in fact in its own  
> format. For this experiment, in case you might find it useful, we  
> placed the original Agilent raw data files as given to us by the  
> submitter at ftp://ftp.ebi.ac.uk/pub/databases/microarray/data/ 
> experiment/MEXP/E-MEXP-668/E-MEXP-668.original-raw-data.tar.gz.

Yes, that would have been useful to know. It would be nice if this was clearer in ArrayExpress, since it is confusing if you don't know the format...

Regards,

Steve

> 
> Hope this helps.
> 
> --Misha K.
> Microarray Informatics Team, EBI
> 
> On 16 Aug 2007, at 14:35, Steve Taylor wrote:
> 
>> Hi,
>>
>> I removed the prefixes from the file headers and they eventually  read 
>> in.
>>
>> I did get:
>>
>> Read ./E-MEXP-668-raw-data-921408043.txt
>> Read ./E-MEXP-668-raw-data-921408048.txt
>> Read ./E-MEXP-668-raw-data-921408053.txt
>> Read ./E-MEXP-668-raw-data-921408058.txt
>> Error in RG[[a]][, i] <- obj[, columns[[a]]] :
>>          number of items to replace is not a multiple of  replacement 
>> length
>>
>> which I think is due to inconsistent numbers of columns in some  files 
>> (and read.maimages doesn't seem to cope with), but I managed  to get 
>> round this by reording the targets file. I have got the data
>> to normalise now.
>>
>> Thank you all very much for helping me on this problem.
>>
>> Steve
>>
>>
>>> I have download the files and it seems that each file has its own
>>> header. By this I mean that a prefix has been add to each standard  
>>> column.
>>> For example, for file "E-MEXP-668-raw-data-921408043.txt" every  column
>>> start with "H_C1 A_mock A and B/" whereas in file
>>> "E-MEXP-668-raw-data-921408128.txt" it is "H_C52 B_mock A and B/". In
>>> order to make the read.maimages work you should remove those  
>>> prefixes. I
>>> have changed it for a couple of slides and it worked for me.
>>>
>>> targets <-read.("AE.csv", path=".",sep=",")
>>> RG <- read.maimages(targets$FileName[c 
>>> (1,16)]),path=".",source="agilent")
>>> Read ./E-MEXP-668-raw-data-921408043.txt
>>> Read ./E-MEXP-668-raw-data-921408118.txt
>>>
>>>> str(RG$G)
>>>
>>>
>>>  num [1:22153, 1:2] 960.4  73.1  83.3 648.1 132.9 ...
>>>  - attr(*, "dimnames")=List of 2
>>>   ..$ : NULL
>>>   ..$ : chr [1:2] "E-MEXP-668-raw-data-921408043"
>>> "E-MEXP-668-raw-data-921408118"
>>>
>>> Hope it helps,
>>> Nolwenn
>>>
>>> Scott Reagan Franklin wrote:
>>>
>>>> Steve:
>>>>
>>>> It should be noted that when you are using read.maimages with
>>>> source="agilent" and you are using the actual Agilent Raw Data file,
>>>> you will, by default, have RG$R and RG$G equal to the rMeanSignal  and
>>>> gMeanSignal.  In order to specify that they be equal to the
>>>> rProcessedSignal and gProcessedSignal you need to add an option to
>>>> read.maimages:
>>>>
>>>> RG<-read.maimages(targets$FileName,path=".",columns=list 
>>>> (R="rProcessedSignal",G="gProcessedSignal));
>>>>
>>>> You can also specify which background signal is used as well by
>>>> including in Rb and Gb in the list above.  By default Rb and Gb are
>>>> chosen as the median background signals.
>>>>
>>>



More information about the Bioconductor mailing list