[R] converting XML document to table or dataframe

David Winsemius dwinsemius at comcast.net
Tue Jan 29 12:42:59 CET 2013


On Jan 28, 2013, at 9:00 PM, Anika Masters wrote:

> I am a relatively new user to R, and I am trying to learn more about
> converting data in an XML document into "2-dimensional format" such  
> as a
> table or array.  I might eventually wish to export this data into a
> relational database such as SQL, and/or to work with this data  
> within the R
> package.
>
> My sample XML document is located at "
> http://www.sec.gov/Archives/edgar/data/743988/000124636013000561/form.xml 
> "
>
> I have successfully import the XML document and then converted the XML
> document to a list.
>
> I am "stuck" trying to convert the document into a "2-dimenional"  
> table or
> dataframe.
>
> What is a "good" way to convert the XML document to a 2-dimensional  
> table
> or data.frame?  Ideally, I'd like a table with 1 row for each XML  
> document,
> and unique fieldnames.  If fieldnames repeat, I'd like the names to be
> numbered sequentially
>
> e.g.
> $nonDerivativeTable$nonDerivativeTransaction$transactionAmounts 
> $transactionPricePerShare$value_1
> $nonDerivativeTable$nonDerivativeTransaction$transactionAmounts 
> $transactionPricePerShare$value_2
> $nonDerivativeTable$nonDerivativeTransaction$transactionAmounts 
> $transactionPricePerShare$value_3
>


It might help if you picked a different example. At the moment the  
tree you are working with has no names of that sublist:

 > names(mylist$nonDerivativeTable)
NULL

(Most of your other calls throw errors.)


-- 
David.
> etc
>
>
>
>
> myxml = xmlParse("
> http://www.sec.gov/Archives/edgar/data/743988/000124636013000561/form.xml 
> ")
> mylist <- xmlToList(mydoc)
> mydf <- xmlToDataFrame(mydoc)
> mydf2 <- data.frame(mylist)
> mytable <- as.table(mylist)
> mydf2 <- data.frame(mydoc)
> mytable <- as.table(mydoc)
>
> 	[[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

David Winsemius, MD
Alameda, CA, USA



More information about the R-help mailing list