[BioC] calcNormFactors - normalization

Lana Schaffer schaffer at scripps.edu
Sat May 7 01:57:28 CEST 2011


Mark,
My gene library contains only 141 genes.  Is this low number
Alright in this model?
The length on the genes are not accounted for in this package edgeR?
Lana

-----Original Message-----
From: Mark Robinson [mailto:mrobinson at wehi.EDU.AU] 
Sent: Friday, May 06, 2011 4:55 PM
To: Lana Schaffer
Cc: 'bioconductor at r-project.org'
Subject: Re: [BioC] calcNormFactors - normalization

Hi Lana,

The factor (offset) that gets used in the statistical model is actually the *product* of lib.size and norm.factors, so the lower depth of library HCV_100d_2 is taken into account.

Mark

On 2011-05-07, at 9:49 AM, Lana Schaffer wrote:

> Greetings,
> Using d <- calcNormFactors(d)
> I get the following normalization factors.
> Why are the factors so similar when the the 4th count is 1/20 the counts as the rest?
> 
>> d$samples
>           group lib.size norm.factors
> HCV_45d_1    d45  7812615    1.0471701
> HCV_45d_2    d45  9728373    1.0004453
> HCV_100d_1  d100  8606449    0.9516424
> HCV_100d_2  d100   446991    1.0030340
> 
> Lana Schaffer
> Biostatistics, Informatics
> DNA Array Core Facility
> 858-784-2263
> 
> 
> 	[[alternative HTML version deleted]]
> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

------------------------------
Mark Robinson, PhD (Melb)
Epigenetics Laboratory, Garvan
Bioinformatics Division, WEHI
e: mrobinson at wehi.edu.au
e: m.robinson at garvan.org.au
p: +61 (0)3 9345 2628
f: +61 (0)3 9347 0852
------------------------------


______________________________________________________________________
The information in this email is confidential and intend...{{dropped:6}}



More information about the Bioconductor mailing list