[BioC] Understanding limma, fdr and topTable
naomi at stat.psu.edu
Tue Jul 8 19:46:48 CEST 2008
I agree with Aaron.
At 11:17 AM 7/8/2008, aaron.j.mackey at gsk.com wrote:
> > I would add that removing those genes that are unchanged in any sample
> > will also help reduce the multiplicity problem. Regardless of the
> > expression level, those genes that never change expression are
> > uninteresting by default, so e.g., if beta-actin is highly expressed at
> > the same level in all samples we don't really care to test for
> > differential expression for that gene since it apparently is not
> > differentially expressed.
>This doesn't make sense. How can I choose to filter out "unchanged"
>probesets without fitting a model of some sort, and making a probabilistic
>decision for each probeset about whether it is "unchanged" or not. Every
>probeset (save those below the detection limit) will exhibit variance
>(though the variance may be below the precision of the instrument to
>measure), right? You're not suggesting that there are some probesets with
>It seems to me that this approach leads to a false/erroneous reduction in
>the multiplicity problem, as you've just moved the hypothesis testing into
>a separate "phase" of the analysis. And it also would mess up pooled
>variance estimates such as those used in eBayes-based methods (e.g.
>So, while I might be willing to filter out known "dead" probesets (that I
>never see above detection threshold over many hundreds of assays), I'm in
>the camp that the statistics are corrupt if you filter without regard to
>its affect on multiplicity corrections.
>As an aside, it should be possible to fit some of the models using
>truncated/censored distributions (wherein the statistical model gets to
>know that there were X number of probesets with values < threshold, but
>doesn't pretend that those values are real). That's an idea for the model
>developers to ponder ...
>Bioconductor mailing list
>Bioconductor at stat.math.ethz.ch
>Search the archives:
Naomi S. Altman 814-865-3791 (voice)
Dept. of Statistics 814-863-7114 (fax)
Penn State University 814-865-1348 (Statistics)
University Park, PA 16802-2111
More information about the Bioconductor