[BioC] Converting gene names into Illumina IDs

Aliaksei Holik salvador at bio.bsu.by
Thu Sep 27 18:05:44 CEST 2012


Dear fellow Bioconductors,

I'm faced with a problem I can't get my head round and hope somebody 
would be able to point me in the right direction.

I am trying to plot a heatmap using expression values from my array for 
an external set of genes. I have used illuminaMousev2ALIAS2PROBE object 
to extract IlluminaIDs corresponding to the gene symbols in the set. 
Consistent with possibility of more than 1 probe per gene I got 1052 
IlluminaIDs for 510 gene names. However, if I try to remove duplicates I 
only get 259 IlluminaIDs, which makes no sense to me. I have checked and 
I do indeed get a lot of duplicated probe IDs. I wonder where I go 
wrong. Here's the code I used:


# Generate a list of gene symbols with corresponding Illumina IDs
xx <- as.list(illuminaMousev2ALIAS2PROBE)
# Subset all Illumina IDs for the genes present in SCSGenes vector
scs.probes.and.genes <- xx[SCSGenes]

# Generate a vector of probes while removing gene names
scs.probes <- as.character(unlist(scs.probes.and.genes))   #1058 probes
scs.probes <- na.omit(scs.probes)	 #1052 probes

# Remove duplicates
scs.probes <- scs.probes[!duplicated(scs.probes)] #259 probes
# end of code

Any help is much appreciated.

All the best,

Aliaksei.



More information about the Bioconductor mailing list