[BioC] Mapping onto KO numbers (KEGG Orthology)

James W. MacDonald jmacdon at med.umich.edu
Thu Apr 8 15:05:44 CEST 2010


Hi Alla,

Alla Bulashevska wrote:
> Dear Bioconductor users, 
> i have RefSeq gene descriptions and also UniProt
> identifiers.
> How can i map this onto KO numbers (KEGG Orthology), e.g.
> K03120 ?

I would use the org.Xx.eg.db package for your organism. Here is an 
example using the human package.

First convert UniProt IDs to Entrez Gene IDs

egs <- mget(<UniProt IDs>, revmap(org.Hs.eg.dbUNIPROT))

you should check the resulting list for any missing or duplicate values 
and deal with those in whatever manner you find useful. You then need to 
convert to a character vector using e.g., unlist()

Now convert Entrez Gene IDs to KEGG IDs

kegg <- mget(<Entrez Gene ID vector>, org.Hs.egPATH)

again, you may need to deal with duplicates.

Best,

Jim


> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Douglas Lab
University of Michigan
Department of Human Genetics
5912 Buhl
1241 E. Catherine St.
Ann Arbor MI 48109-5618
734-615-7826
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues 



More information about the Bioconductor mailing list