[BioC] Multifactorial analysis with RMA and LIMMA of Affymetrix microarrays

Tue Aug 10 09:12:31 CEST 2004

Here is the answer

HTH

Jordi

BioC] Multifactorial analysis with RMA and LIMMA of Affymetrix microarrays
New Search:

From: Gordon Smyth (smyth at landfield.com)
Date: Thu Mar 18 2004 - 02:14:43 EST

Next message: Rafael A. Irizarry: "[BioC] Subsetting Affybatch objects by 
gene list."
Previous message: Helen Cattan: "[BioC] limma, background correction and 
duplicate spots"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

--------------------------------------------------------------------------------

At 02:48 AM 18/03/2004, Jordi Altirriba Guti?rrez wrote:
>Thank you very much Gordon for your quick answer! My phenoData is:
>>pData(eset)
>DIABETES TREATMENT DNT1 TRUE FALSE DNT2 TRUE FALSE DNT3 TRUE FALSE DT1 TRUE 
>TRUE DT2 TRUE TRUE DT3 TRUE TRUE SNT1 FALSE FALSE SNT2 FALSE FALSE SNT3 
>FALSE FALSE ST1 FALSE TRUE ST2 FALSE TRUE ST3 FALSE TRUE
>
>(DNT=Diabetic untreated, DT=Diabetic treated, SNT=Health treated, ST=Health 
>untreated)
>
>I want to know the genes characteristics of the diabetes, the treatment and 
>the treatment + diabetes. Moreover when I analyse my data with SAM and I 
>compare Health treated vs the Health untreated I don't see many 
>differences, but when I compare the Diabetic treated vs the Diabetic 
>treated I see a lot of differences, so is correct to apply a 2 x 2 
>factorial design?

You simply need to fit a model which contains four coefficient which
distinguish your four groups. The classical 2x2 model is just one
particular parametrization you can use:

design <- model.matrix( ~ DIABETES*TREATMENT, data=pData(eset))
fit <- lmFit(eset, design)

>Is LIMMA the correct tool to answer my questions? If it is the correct 
>tool, how can I do a factorial design matrix (if to do a factorial design 
>is correct)? (Robert Gentleman has suggested me to use the factDesign).

You're just fitting a linear model, so the above calculation is exactly
equivalent to what factDesign does, although probably a bit faster. I would
use limma myself because it allows you go on to do empirical Bayes
moderation of the residual standard deviations etc, which I think it
important, but Robert may be able to make a further case for factDesign.

Cheers
Gordon

>Thank you very much for your time, patience and your suggestions. Yours 
>sincerely,

--------------------------------------------------------------------------------

Next message: Rafael A. Irizarry: "[BioC] Subsetting Affybatch objects by 
gene list."
Previous message: Helen Cattan: "[BioC] limma, background correction and 
duplicate spots"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

--------------------------------------------------------------------------------

This archive was generated by hypermail 2.0.0 : Fri Jul 16 2004 - 13:13:33 
EDT

>From: Peter Lee <peter.d.lee at mcgill.ca>
>To: Jordi Altirriba Gutiérrez <altirriba at hotmail.com>
>CC: bioconductor at stat.math.ethz.ch
>Subject: Re: [BioC] Multifactorial analysis with RMA and LIMMA of 
>Affymetrix microarrays
>Date: Thu, 5 Aug 2004 14:42:12 -0400
>
>What eventually was the correct design matrix for this dataset?
>
>Peter
>
>On Mar 17, 2004, at 10:48 AM, Jordi Altirriba Gutiérrez wrote:
>
>>Thank you very much Gordon for your quick answer!
>>My phenoData is:
>>>pData(eset)
>>     DIABETES TREATMENT
>>DNT1     TRUE       FALSE
>>DNT2     TRUE       FALSE
>>DNT3     TRUE       FALSE
>>DT1      TRUE        TRUE
>>DT2      TRUE        TRUE
>>DT3      TRUE        TRUE
>>SNT1    FALSE       FALSE
>>SNT2    FALSE       FALSE
>>SNT3    FALSE       FALSE
>>ST1     FALSE        TRUE
>>ST2     FALSE        TRUE
>>ST3     FALSE        TRUE
>>
>>(DNT=Diabetic untreated, DT=Diabetic treated, SNT=Health treated,  
>>ST=Health untreated)
>>
>>I want to know the genes characteristics of the diabetes, the  treatment 
>>and the treatment + diabetes. Moreover when I analyse my  data with SAM 
>>and I compare Health treated vs the Health untreated I  don't see many 
>>differences, but when I compare the Diabetic treated vs  the Diabetic 
>>treated I see a lot of differences, so is correct to  apply a 2 x 2 
>>factorial design?
>>Is LIMMA the correct tool to answer my questions? If it is the correct  
>>tool, how can I do a factorial design matrix (if to do a factorial  design 
>>is correct)? (Robert Gentleman has suggested me to use the  factDesign).
>>Thank you very much for your time, patience and your suggestions.
>>Yours sincerely,
>>
>>
>>>From: Gordon Smyth <smyth at wehi.edu.au>
>>>To: Jordi Altirriba Gutiérrez <altirriba at hotmail.com>
>>>CC: bioconductor at stat.math.ethz.ch
>>>Subject: Re: [BioC] Multifactorial analysis with RMA and LIMMA of   
>>>Affymetrix microarrays
>>>Date: Wed, 17 Mar 2004 11:32:16 +1100
>>>
>>>At 07:55 AM 17/03/2004, Jordi Altirriba Gutiérrez wrote:
>>>>(Sorry, but I've had some problems with the HTML)
>>>>Hello all!
>>>>I am a beginner user of R and Bioconductor, sorry if my questions  have 
>>>>already been discussed previously.
>>>>I am studying the effects of a new hypoglycaemic drug for the  treatment 
>>>>of diabetes and I have done this classical study:
>>>>4 different groups:
>>>>1.- Healthy untreated
>>>>2.- Healthy treated
>>>>3.- Diabetic untreated
>>>>4.- Diabetic treated
>>>>With 3 biological replicates of each group, therefore I have done 12  
>>>>arrays (Affymetrix).
>>>>I have treated the raw data with the package RMA of Bioconductor  
>>>>according to the article ”Exploration, normalization and summaries  of 
>>>>high density oligonucleotide array probe level data”  (Background=RMA, 
>>>>Normalization=quantiles, PM=PMonly,  Summarization=medianpolish).
>>>>I am currently trying to analyse the object eset with the package  LIMMA 
>>>>of Bioconductor. I want to know what genes are differentially  expressed 
>>>>due to diabetes,  to the treatment and to the combination  of both 
>>>>(diabetes + treatment), being therefore an statistic  analysis similar 
>>>>to a two-ways ANOVA).
>>>>So, my questions are:
>>>>1.- I have created a PhenoData in RMA, will the covariates of the  
>>>>PhenoData have any influence in the analysis of LIMMA?
>>>
>>>Not automatically.
>>>
>>>>2.- Are these commands correct to get these results? (see below) In  the 
>>>>command TopTable, the output of coef=1 are the genes  characteristics of 
>>>>diabetes?
>>>
>>>No.
>>>
>>>>3.- If I do not see any effect of the treatment in the healthy  
>>>>untreated rats should I design the matrix differently? Something  
>>>>similar to a one-way-ANOVA, considering differently the four groups:
>>>>( > design<-model.matrix(~ -1+factor(c(1,1,1,2,2,2,3,3,3,4,4,4)))  ).
>>>
>>>This design matrix would be very much better, i.e., it would be a  
>>>correct matrix. You could then use contrasts to test for differences  and 
>>>interaction terms between your four groups, and that would do the  job.
>>>
>>>If you tell us what's in your phenoData slot, i.e., type pData(eset),  
>>>then we might be able to suggest another approach analogous to the  
>>>classical two-way anova approach.
>>>
>>>Gordon
>>>
>>>>5.- Any other idea?
>>>>Thank you very much for your time and your suggestions.
>>>>Yours sincerely,
>>>>
>>>>Jordi Altirriba (PhD student, Hospital Clínic  IDIBAPS, Barcelona,  
>>>>Spain)
>>>>
>>>>>design<- 
>>>>>cbind("disease"=c(1,1,1,1,1,1,0,0,0,0,0,0),"treatment"=c(0,0,0,1,1,1 
>>>>>,0,0,0,1,1,1))
>>>>>fit<-lmFit(eset,design)
>>>>>contrast.matrix<-cbind("diabetes"=c(1,0),"drug"=c(0,1),"diabetes- 
>>>>>drug"=c(1,1))
>>>>>rownames(contrast.matrix)<-colnames(design)
>>>>>design
>>>>      disease treatment
>>>>[1,]        1           0
>>>>[2,]        1           0
>>>>[3,]        1           0
>>>>[4,]        1           1
>>>>[5,]        1           1
>>>>[6,]        1           1
>>>>[7,]        0           0
>>>>[8,]        0           0
>>>>[9,]        0           0
>>>>[10,]        0           1
>>>>[11,]        0           1
>>>>[12,]        0           1
>>>>>contrast.matrix
>>>>            diabetes drug diabetes-drug
>>>>diabetes      1         0             1
>>>>tratamiento   0         1             1
>>>>>fit2<-contrasts.fit(fit,contrast.matrix)
>>>>>fit2<-eBayes(fit2)
>>>>>clas<-classifyTests(fit2)
>>>>>vennDiagram(clas)
>>>>>topTable(fit2,number=100,genelist=geneNames(eset),coef=1,adjust="fdr ")
>>>>                     Name        M        t      P.Value        B
>>>>11590          1387471_at 19.32575 9.442926 2.743122e-17 29.95299
>>>>2500           1369951_at 19.24748 9.404683 2.743122e-17 29.66737
>>>>10652          1384778_at 19.22968 9.395981 2.743122e-17 29.60254
>>>>

>>>>>sink("limma-diabetes.txt")
>>>>>topTable(fit2,number=1000,genelist=geneNames(eset),coef=1,adjust="fd 
>>>>>r")
>>>>>sink()
>>>
>>
>>_________________________________________________________________
>>Reparaciones, servicios a domicilio, empresas, profesionales... Todo  en 
>>la guía telefónica de QDQ. http://qdq.msn.es/msn.cfm
>>
>>_______________________________________________
>>Bioconductor mailing list
>>Bioconductor at stat.math.ethz.ch
>>https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor
>>
>

más opciones. http://www.msn.es/Viajes/