[BioC] hgu133plus2 GO issues

Jake jjmichael at comcast.net
Tue Apr 18 17:59:58 CEST 2006


Hi list,

Could someone please help me understand the differences between the
(hgu133plus2)GO, GO2PROBE, GO2ALLPROBES?  I've found discepancies that I
can't quite explain:

 > mget("GO:0042611", hgu133plus2GO2PROBE)
Error: value for 'GO:0042611' not found

> mget("GO:0042611", hgu133plus2GO2ALLPROBES)
$"GO:0042611"
          <NA>            IEA            IEA            IEA
<NA>
   "209309_at"  "217014_s_at"    "210325_at"  "218831_s_at"
"1553402_a_at"
          <NA>           <NA>           <NA>           <NA>
<NA>
 "206086_x_at"  "206087_x_at"  "210864_x_at"  "211326_x_at"
"211327_x_at"
          <NA>           <NA>           <NA>           <NA>
<NA>
 "211328_x_at"  "211329_x_at"  "211330_s_at"  "211331_x_at"
"211332_x_at"
          <NA>           <NA>           <NA>            IEA
<NA>
 "211863_x_at"  "211866_x_at"  "214647_s_at"    "235754_at"
"213932_x_at"
           IEA           <NA>           <NA>            IEA
<NA>
 "215313_x_at"  "208729_x_at"  "209140_x_at"  "211911_x_at"
"208812_x_at"
          <NA>           <NA>            IEA           <NA>
<NA>
 "211799_x_at"  "214459_x_at"  "216526_x_at"    "200904_at"
"200905_x_at"
           IEA           <NA>           <NA>            IEA
<NA>
 "217456_x_at"  "204806_x_at"  "221875_x_at"    "221978_at"
"210514_x_at"
          <NA>           <NA>           <NA>            IEA
IEA
 "211528_x_at"  "211529_x_at"  "211530_x_at"  "217436_x_at"
"231748_at"
          <NA>            IEA            IEA            IEA
   "221291_at"    "238542_at"    "221323_at" "1552777_a_at"

and finally...

### "208729_x_at" is one of the probes returned with the above command
> grep("GO:0042611",unlist(mget("208729_x_at", hgu133plus2GO)))
numeric(0)



"208729_x_at" is on the hgu133plus2 chip, but GO and GO2ALLPROBES don't
map it to the same GO ID.

Is there something wrong here or am I just missing something?  If
different, which is the most "reliable" mapping?  I'm concerned because
I went through to validate GO IDs I had gotten from the GOHyperG
function (a total of 314), and 117 of those I could not map back to my
significant probe list using the hgu133plus2GO annotation.  I noticed by
looking at the GOHyperG function that it uses information from
GO2ALLPROBES.

Any help/enlightenment is much appreciated.

PS - using R 2.2.1 with hgu133plus2 1.10.0

--Jake



More information about the Bioconductor mailing list