[BioC] Experimental design for RNA-Seq

Sun May 30 13:03:08 CEST 2010

Hi,

We have just completed the sequencing of RNA-Seq libraries from a porcine challenge experiment: two treatments (bacteria) and 5 time points after challenge (T0,T6,T12,T24,T48 hours pi) - a total of 48 samples (5-6 samples/treatmentXtime).
A single RNA-Seq library has been generated from each sample (so no true technical replication) and the 48 libraries have been sequenced as 12-plex in four flowcells (4 lanes of 12-plexed samples/flowcell, all 48 samples sequenced in each flowcell) using the Illumina index system.
In each 12-plex, the samples have been mixed to balance each treatmentXtime in each plex.
When starting the experiment it was not recommended by Illumina to do less than 12-plex. Since then, Illumina have changed their recommendation so it is possible to do 2, 3, 6 and 12 plex indexing. The experiment could hence have been conducted by 3 plexing instead (so each sample would have been sequenced once instead of four times in four runs) but I still like the idea of sequencing all samples in each run....

Following mapping, the counts from each library have been combined from the three runs - generating more than 4 millions seqs/sample

Starting the analysis, I have found that the available package (DEseq, DEGseq and edger) present examples on the analysis of simple experiment (e.g. control vs challenge) but wonder how to analyse a time-point experiment with two treatments.
Initially, I am going to compare each time-point to the control (within and across treatment) but it would be nice to take the interactions into account as well.

Best regards,
Jakob

-----Oprindelig meddelelse-----
Fra: bioconductor-bounces at stat.math.ethz.ch [mailto:bioconductor-bounces at stat.math.ethz.ch] På vegne af michael watson (IAH-C)
Sendt: 28. maj 2010 18:04
Til: 'Steve Lianoglou'; Naomi Altman
Cc: bioconductor
Emne: Re: [BioC] Experimental design for RNA-Seq

Great stuff, thanks Steve and Naomi.

I guess I was thinking of technical replicates simply as sequencing the same library on multiple occasions;  though creating two libraries out of one sample adds an extra layer of complexity.

What is the evidence (if any) that lane and/or library preparation can have an effect?

To adjust for lane effects, I guess one could multiplex each sample so that they're run on all lanes, and combine the counts at the end?

Hmmm
Mick

-----Original Message-----
From: Steve Lianoglou [mailto:mailinglist.honeypot at gmail.com] 
Sent: 28 May 2010 16:01
To: Naomi Altman
Cc: michael watson (IAH-C); bioconductor
Subject: Re: [BioC] Experimental design for RNA-Seq

Hi,

I just wanted to ask/make one point.

On Fri, May 28, 2010 at 9:17 AM, Naomi Altman <naomi at stat.psu.edu> wrote:
> At least from the stat theory point of view, the best design is equal
> numbers of biological samples (the more the better) for each condition and
> no technical reps.

Can you clarify a bit as to what you are referring to as a "technical
replicate" in this sense?

You could consider two lanes that are sequenced from the same library
as technical replicates, no? Or, by "technical replicate" do you mean
creating two libraries out of one sample?

If we're talking about the former, then I think there is lots of value
to be gained, and perhaps necessary(?), to running more than one lane
per library preparation -- and maybe the question would rather be "how
many lanes to run per library"?

What does the court think?

-steve

-- 
Steve Lianoglou
Graduate Student: Computational Systems Biology
 | Memorial Sloan-Kettering Cancer Center
 | Weill Medical College of Cornell University
Contact Info: http://cbio.mskcc.org/~lianos/contact

_______________________________________________
Bioconductor mailing list
Bioconductor at stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/bioconductor
Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor