[BioC] Pathview with minor species

Luo Weijun luo_weijun at yahoo.com
Mon Feb 3 03:58:53 CET 2014


I forgot to mention that in pathview vignette there is a dedicated section covering on species related issues, i.e. Section 7.5 “Working with species”:
http://bioconductor.org/packages/release/bioc/vignettes/pathview/inst/doc/pathview.pdf

I just noticed that locus tag is used as the default gene ID for KEGG pathways for B. subtilis (bsu). This is actually true for many bacteria and other minor species (including E coli K12). KEGG does not specify the default gene ID type explicitly, but you may find this out for your species in korg data (the kegg.geneid column).
> data(korg)
> ri=which(korg[,1]=="bsu")
> korg[ri,]
          kegg.code     scientific.name         common.name       entrez.gnodes
              "bsu" "Bacillus subtilis"                  ""                 "0"
        kegg.geneid         ncbi.geneid
         "BSU00020"            "939970"

So you can actually do it in a simple way. You don’t have to map your gene IDs, just specify gene.idtype="kegg" when calling pathview. My analysis with simulated bsu data works well.

bsu.dat.kegg <- sim.mol.data(mol.type="gene",id.type="kegg",species="bsu",nmol=3000)
head(bsu.dat.kegg)
   BSU13160    BSU11080    BSU23500    BSU02510    BSU19920    BSU20590 
-1.15259948  0.46416071  0.72893247  0.41061745 -1.46114720 -0.01890809 
pv.out <- pathview(gene.data = bsu.dat.kegg, gene.idtype="kegg",
pathway.id = "00010", species = "bsu", out.suffix = "bsu.kegg")

A similar example was given on E coli K12 in page 20 of the vignette. I would suggest to go over section 7.5 “Working with species” if you need to work on minor species.


--------------------------------------------


 Subject: Re: Pathview with minor species
 To: bioconductor at r-project.org
 Date: Wednesday, January 29, 2014, 3:25 PM

 Someone asked a similar question on
 using pathview or gage with honey bee. Here is the
 solution:
 http://seqanswers.com/forums/showthread.php?t=35472#6

 Like honey bee, B. subtilis (bsu) is not a major species
 with Bioconductor gene annotation package as those listed in
 data(bods) under pathview package. similarly you need to map
 Locus_tags to Entrez gene ID manually. Again, you can
 download the gene_info data file (for all bacteria) from
 NCBI ftp site:
 ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/GENE_INFO/Archaea_Bacteria/Bacteria.gene_info.gz
 Column 2-6 should be (Entrez) GeneID, Symbol, LocusTag,
 Synonyms, dbXrefs. Then follow the route described in the
 honey bee case.

 HTHs.

 --------------------------------------------
 On Wed, 1/29/14, David wrote:

  Subject: Pathview with minor species

  Date: Wednesday, January 29, 2014, 11:09 AM
  
  Dear Mr Luo,
  
  In our course we got B. subtilis (bsu) microarray data
  and I want to visualize some expression changes with
  Pathview. But I have a problem. We got Locus_tags like
  BSU00010 and no gene IDs and I don't know whether Pathview
  can use it or not. I really don't understand the part
 about
  it in the vignette. So I thought of converting them to
  UniProt IDs. But first I wanted to do a test run. So I
  searched for the UniProt IDs of 3 genes of the glycolysis
  from B. subtilis and put them into a data.frame with
  fictional data (dummy).
  
  library(pathview)
  dummy<-data.frame(name=c("P18159","P80860","P13243"),ratio=c(1,0,-1),row.names="name")
  dummy<-as.matrix(dummy)
  
  pv.out <- pathview(gene.data = dummy[, "ratio"],
  pathway.id = "00010",species = "bsu", out.suffix =
 "dummy",
  kegg.native=T, gene.idtype="UNIPROT")
  
  
  When I tried it this way, an error occured: No proper gene
[[elided Yahoo spam]]
  Afterwards I tested it with 3 genes from human and hsa as
  species and than it worked fine. Nothing I tried to solve
  the problem, worked. I have the latest version of
  Bioconductor and Pathview installed. Does Pathview not
 work
  with Bacillus subtilis subsp. subtilis 168 (=bsu) even
  though it's a KEGG organism? Or what am I doing wrong?
  
  I look forward to hearing from you.
  
  Sincerely yours,
  David 

  



More information about the Bioconductor mailing list