[BioC] outlier removal from gene chip

Weiwei Shi helprhelp at gmail.com
Tue Sep 19 19:06:00 CEST 2006


some added info:
V1 is gene id, but each row represents a probe. so there could be
multiple rows with the same V1 since they (those probes) correspond to
the same gene.

On 9/19/06, Weiwei Shi <helprhelp at gmail.com> wrote:
> dear listers:
>
> I have a question on whether bioconductor has some tool-kit to detect
> outliers and remove them.
>
> my original dataset looks like this:
>             V1       V51       V53        V55       V57
> 1   -493249600  1.459459 -3.069444  -1.300000  1.935484
> 2  -1613096495 -1.139269 -5.525281 -16.592593 -1.831978
> 3   1626196571 -3.500000 -1.011662   2.223881  3.921053
> 4  -1397009217 -3.571429  1.685714  -1.180297 -6.807692
> 5   1428659728 -1.405405 -1.469004  -4.779754 -1.033708
> 6    459853658 -2.158879 -7.510823  -1.085581 -9.382979
> 7    530182506 -1.431677 -1.336343  -3.126437  4.878788
> 8   1173842263  1.215385  1.856410  -2.059794 -6.020833
> 9        28847  2.407895 -2.048889  -1.730337 -1.178947
> 10 -1961875610  2.864159 -2.301234  -4.733264 -1.172058
>
> V1: internal probe id
> the rests are different samples. the cells are fold-change of disease/normal.
>
> summary of the sample columns( V51, ... V57) gives the following:
>       V51                V53                 V55                V57
>  Min.   :-482.000   Min.   : -55.7342   Min.   :-122.074   Min.   :-14086.750
>  1st Qu.:  -2.159   1st Qu.:  -1.7312   1st Qu.:  -2.125   1st Qu.:    -1.831
>  Median :  -1.199   Median :  -1.0416   Median :  -1.200   Median :    -1.080
>  Mean   :  -0.918   Mean   :   0.1662   Mean   :  -1.027   Mean   :    -1.874
>  3rd Qu.:   1.441   3rd Qu.:   1.5721   3rd Qu.:   1.419   3rd Qu.:     1.521
>  Max.   : 198.434   Max.   :1478.1639   Max.   :  95.768   Max.   :   683.519
>
>
> My question is, is there any package which can detect those outliers
> (like -14086.750)and remove them and get an "average" for each gene
> (instead of each probe)?
>
> Thank you.
>
> Weiwei
>
> --
> Weiwei Shi, Ph.D
> Research Scientist
> GeneGO, Inc.
>
> "Did you always know?"
> "No, I did not. But I believed..."
> ---Matrix III
>


-- 
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.

"Did you always know?"
"No, I did not. But I believed..."
---Matrix III



More information about the Bioconductor mailing list