[BioC] slow insertions in to graphNEL object (24 hours for 16k nodes)

Paul Shannon pshannon at systemsbiology.org
Tue Sep 11 20:28:18 CEST 2007


It took nearly 24 hours (!) to create a 16k node graph using two  
different techniques:

    g = fromGXL (file ('someFile.gxl'))

and

   g = new ('graphNEL', edgemode='undirected')
   edgeDataDefaults (g, attr='edgeType') = 'edge'
   edgeDataDefaults (g, attr='source') = 'unknown'

   ...

   for (r in 1:max) {
     ...
     g = addNode (a, g)
     g = addNode (b, g)
     g = addEdge (a, b, g)
     edgeData (g, a, b, 'source') = source
     edgeData (g, a, b, 'edgeType') = method
     }

The 16k nodes and their edges are from a suitably parsed version of  
all of the reactions
reported by KEGG.

Is this user error, user misconception, ... or maybe an inefficiency  
that future versions
of the graph package could improve upon?

sessionInfo ()
R version 2.5.1 (2007-06-27)
i386-apple-darwin8.9.1

locale:
C

attached base packages:
[1] "stats"     "graphics"  "grDevices" "utils"     "datasets"   
"methods"   "base"

other attached packages:
    graph    RUnit
"1.14.0" "0.4.15"



More information about the Bioconductor mailing list