[BioC] error in plotKEGGGraph

Pierre Neuvial pierre at stat.berkeley.edu
Tue Sep 21 23:54:47 CEST 2010


Hi David,

Has there been any follow-up on this ? I bumped into the same error as Robert.

You asked for the file he used.  If you run the code sent by Robert,
you will get

> keggID <- "05213"
> tmpXML <- "testKGML1.xml"
> retrieveKGML(pathwayid=keggID, organism='mmu', destfile=tmpXML, method="wget")
--2010-09-21 12:23:04--
ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/mmu/mmu05213.xml
          => `/var/folders/R7/R7TMUh6WGeSJE0LtxqwHWk+++TI/-Tmp-//RtmpmGNCWB/file1eaf6b0f'
Resolving ftp.genome.jp... 133.103.100.184
Connecting to ftp.genome.jp|133.103.100.184|:21... connected.
Logging in as anonymous ... Logged in!
==> SYST ... done.    ==> PWD ... done.
==> TYPE I ... done.  ==> CWD
/pub/kegg/xml/kgml/non-metabolic/organisms/mmu ... done.
==> SIZE mmu05213.xml ... 16729
==> PASV ... done.    ==> RETR mmu05213.xml ... done.
Length: 16729 (16K)

100%[======================================>] 16,729      38.4K/s   in 0.4s

2010-09-21 12:23:07 (38.4 KB/s) -
`/var/folders/R7/R7TMUh6WGeSJE0LtxqwHWk+++TI/-Tmp-//RtmpmGNCWB/file1eaf6b0f'
saved [16729]

--2010-09-21 12:23:07--
ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/mmu/mmu05213.xml
          => `testKGML1.xml'
Resolving ftp.genome.jp... 133.103.100.184
Connecting to ftp.genome.jp|133.103.100.184|:21... connected.
Logging in as anonymous ... Logged in!
==> SYST ... done.    ==> PWD ... done.
==> TYPE I ... done.  ==> CWD
/pub/kegg/xml/kgml/non-metabolic/organisms/mmu ... done.
==> SIZE mmu05213.xml ... 16729
==> PASV ... done.    ==> RETR mmu05213.xml ... done.
Length: 16729 (16K)

100%[======================================>] 16,729      44.2K/s   in 0.4s

2010-09-21 12:23:09 (44.2 KB/s) - `testKGML1.xml' saved [16729]

That should be enough for you to reproduce the error.  If not, the file is here:

ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/mmu/mmu05213.xml

I've investigated this issue a bit more. The problem comes from the
fact that this KEGG pathway (as well as a fair proportion of KEGG
pathways) has at least one edge whose label is not understood by
KEGGgraph, in this case (quoted from the above KGML file):

   <relation entry1="20" entry2="25" type="PCrel">
       <subtype name="missing interaction" value="-/-"/>
   </relation>

The types of interaction known by KEGGgraph are retrieved from a data
object from the package:

> data(KEGGEdgeSubtype, package="KEGGgraph")
> as.character(KEGGEdgeSubtype[["name"]])
 [1] "compound"            "hidden compound"     "activation"
 [4] "inhibition"          "expression"          "repression"
 [7] "indirect"            "indirect effect"     "state change"
[10] "binding/association" "dissociation"        "phosphorylation"
[13] "dephosphorylation"   "glycosylation"       "ubiquitination"
[16] "methylation"

Because "missing interaction" is not known, the above error occurs.
An easy fix for this problem is to add a line with name=="missing
interaction" to this data.frame, and re-build the package.

I've also encountered another instance of the same problem:

> keggID <- "05213"
> tmpXML <- "testKGML2.xml"
> retrieveKGML(pathwayid=keggID, organism='mmu', destfile=tmpXML, method="wget")
--2010-09-21 13:56:37--
ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/mmu/mmu05213.xml
          => `/var/folders/R7/R7TMUh6WGeSJE0LtxqwHWk+++TI/-Tmp-//RtmpmGNCWB/file13f877e9'
Resolving ftp.genome.jp... 133.103.100.184
Connecting to ftp.genome.jp|133.103.100.184|:21... connected.
Logging in as anonymous ... Logged in!
==> SYST ... done.    ==> PWD ... done.
==> TYPE I ... done.  ==> CWD
/pub/kegg/xml/kgml/non-metabolic/organisms/mmu ... done.
==> SIZE mmu05213.xml ... 16729
==> PASV ... done.    ==> RETR mmu05213.xml ... done.
Length: 16729 (16K)

100%[======================================>] 16,729      41.4K/s   in 0.4s

2010-09-21 13:56:39 (41.4 KB/s) -
`/var/folders/R7/R7TMUh6WGeSJE0LtxqwHWk+++TI/-Tmp-//RtmpmGNCWB/file13f877e9'
saved [16729]

--2010-09-21 13:56:39--
ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/mmu/mmu05213.xml
          => `testKGML2.xml'
Resolving ftp.genome.jp... 133.103.100.184
Connecting to ftp.genome.jp|133.103.100.184|:21... connected.
Logging in as anonymous ... Logged in!
==> SYST ... done.    ==> PWD ... done.
==> TYPE I ... done.  ==> CWD
/pub/kegg/xml/kgml/non-metabolic/organisms/mmu ... done.
==> SIZE mmu05213.xml ... 16729
==> PASV ... done.    ==> RETR mmu05213.xml ... done.
Length: 16729 (16K)

100%[======================================>] 16,729      58.8K/s   in 0.3s

2010-09-21 13:56:42 (58.8 KB/s) - `testKGML2.xml' saved [16729]

> tmpGraph2 <- parseKGML2Graph(tmpXML, genesOnly=FALSE)
> plotKEGGgraph(tmpGraph2)
Error in .subtypeDisplay(object) :
 Given subtype 'ubiquination' is not found!

This actually comes from a typo in KEGG entries: ubiquitination has
been misspelled as "ubiquination" in
ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/mmu/mmu05213.xml
(and a bunch of other KEGG files too).

Again, this can be fixed by adding a line with name=="ubiquination" to
KEGGEdgeSubtype.
I've done this locally: edited the file
KEGGgraph/data/KEGGEdgeSubtype.csv, recompiled the KEGGgraph package,
and it works with no error.

David, could you do this kind of fix ?

More generally, it would of course be much better if the
.subtypeDisplay function did not throw an *error* but a *warning* when
encountering unknown interaction names.

Hope this helps,

Pierre

R version 2.11.1 Patched (2010-09-16 r52946)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)

locale:
[1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] grid      stats     graphics  grDevices datasets  utils     methods
[8] base

other attached packages:
[1] KEGGgraph_1.5.0 Rgraphviz_1.26.0  graph_1.26.0      XML_2.8-1
[5] R.utils_1.5.2     R.oo_1.7.3        R.methodsS3_1.2.0 fortunes_1.3-7

loaded via a namespace (and not attached):
[1] tools_2.11.1

On Wed, Jul 21, 2010 at 2:31 PM, Jitao David Zhang
<davidvonpku at gmail.com> wrote:
> Hi dear Robert,
>
>   Thanks for reporting! Would you please send the file you used? I will try
> to figure out what went wrong.
>
>  Best wishes,
> David
>
> 2010/7/7 Robert M. Flight <rflight79 at gmail.com>
>
>> Hi All,
>>
>> I'm playing around with KEGGGraph to examine some pathways, and I am
>> running into some errors.
>>
>> When I try to plot some pathways, I get the error: "Error in
>> .subtypeDisplay(object) :  Given subtype 'missing interaction' is not
>> found!"
>>
>> Here is an example with one pathway that gives the error and another
>> that does not:
>>
>> library(KEGGgraph)
>>
>> # this one gives the error
>> keggID <- "05213"
>> tmpXML <- "testKGML1.xml"
>> retrieveKGML(pathwayid=keggID, organism='mmu', destfile=tmpXML,
>> method="wget")
>> tmpGraph1 <- parseKGML2Graph(tmpXML, genesOnly=FALSE)
>> plotKEGGgraph(tmpGraph1)
>>
>> # this one does not give an error, and spits out a very complicated graph
>> keggID <- "04810"
>> tmpXML <- "testKGML2.xml"
>> retrieveKGML(pathwayid=keggID, organism='mmu', destfile=tmpXML,
>> method="wget")
>> tmpGraph2 <- parseKGML2Graph(tmpXML, genesOnly=FALSE)
>> plotKEGGgraph(tmpGraph2)
>>
>> Any help on why this error is cropping up would be appreciated.
>>
>> R version 2.11.1 (2010-05-31)
>> i386-pc-mingw32
>>
>> locale:
>> [1] LC_COLLATE=English_United States.1252
>> [2] LC_CTYPE=English_United States.1252
>> [3] LC_MONETARY=English_United States.1252
>> [4] LC_NUMERIC=C
>> [5] LC_TIME=English_United States.1252
>>
>> attached base packages:
>> [1] grid      stats     graphics  grDevices utils     datasets  methods
>> [8] base
>>
>> other attached packages:
>> [1] KEGGgraph_1.4.0  Rgraphviz_1.26.0 graph_1.26.0     XML_3.1-0
>>
>> loaded via a namespace (and not attached):
>> [1] tools_2.11.1
>>
>> Thanks in advance,
>>
>> -Robert
>>
>> Robert M. Flight, Ph.D.
>> Bioinformatics and Biomedical Computing Laboratory
>> University of Louisville
>> Louisville, KY
>>
>> PH 502-852-0467
>> EM robert.flight at louisville.edu
>> EM rflight79 at gmail.com
>>
>> Williams and Holland's Law:
>>       If enough data is collected, anything may be proven by
>> statistical methods.
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>
>
>
> --
> Jitao David Zhang
> Pre-doc staff of Computational Biology and Biostatistics
> Division of Molecular Genome Analysis
> DKFZ, Heidelberg D-69120, Germany
>
> http://www.NextBioMotif.com/
>
>        [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>



More information about the Bioconductor mailing list