[R] reading multiple XML files into an R table

Kjetil Halvorsen kjetilbrinchmannhalvorsen at gmail.com
Wed Oct 27 16:15:57 CEST 2010


for the loop part:

put all your xml files in a directory with no other files,
do
setwd("directory/with/xml/files")
files <- list.files()
for (file in files) <processs the files>

kjetil


2010/10/27 Ista Zahn <izahn at psych.rochester.edu>:
> Hi Jørgen,
> You will be better served by learning how to find the answers to these
> kinds of questions on your own. You can either use a general search
> engine such as google:
> http://lmgtfy.com/?q=read+multiple+files+in+R
> http://lmgtfy.com/?q=read+xml+data+in+R
> or using an R specific search engine. At the R prompt, try
> RSiteSearch("read multiple files", restrict=NULL)
> RSiteSearch("read xml", restrict=NULL)
>
> If you run into difficulties many people on this list (including
> myself) will be happy to help.
> Best,
> Ista
>
> 2010/10/27 Jørgen Blystad Houge <jb.houge at gmail.com>:
>> Good morning fellow R users!
>>
>> I need to read multiple .XML files now gathered in one folder and collect
>> them in a table in R. The files have only numeric names and are named nearly
>> continuously (e.g 1.xml, 2.xml, 3.xml ..... up to about 40000.xml) but with
>> a few missing numbers/files. So the code must be able to handle missing
>> files.
>>
>> Can someone suggest a FOR-loop in R that could be able to read all these
>> files and categorize them correctly? I've pasted an example underneath. (For
>> those interested, this is an "Urgent Market Message" on Nord Pool Spot. I
>> would like to systematize them to observe the power market reserve margin as
>> historic time series. That is estimate how much electricity is actually
>> available for the market.
>>
>>  <?xml version="1.0" encoding="ISO-8859-1" ?>
>>  - <#> <participant_umm>
>>   <effect_after>100</effect_after>
>>   <station>Vinje</station>
>>   <affected_units>G1, G2, G3</affected_units>
>>  - <#> <umm predecessor_id="*0*" parent_id="*5244*" new_followup="*New*"u_id
>> ="*5244*">
>>   <event_start>19.07.04 hour 11:15</event_start>
>>   <registered>19.07.04 hour 11:15</registered>
>>   <event_type>Production failure</event_type>
>>   <status>Approved and does not have a Followup</status>
>>   <decission>19.07.04 hour 11:15</decission>
>>   <event_stop />
>>   <predefined_remark />
>>   <approved>19.07.04 hour 11:17</approved>
>>   <affected_areas>NO1</affected_areas>
>>   <remarks>Failure G2, unavailable until further notice.</remarks>
>>   <company>Statkraft SF</company>
>>  </umm>
>>   <effect_before>100</effect_before>
>>   <affected_fuels>Hydro</affected_fuels>
>>   <effect_during>0</effect_during>
>>   <prodcons>Production</prodcons>
>>   <effect_installed>300</effect_installed>
>>  </participant_umm>
>>
>> Thanks a lot!
>>
>> Best,
>> Jørgen Blystad Houge
>> MSc student Norwegian University of Science and Technology
>>
>>        [[alternative HTML version deleted]]
>>
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>>
>
>
>
> --
> Ista Zahn
> Graduate student
> University of Rochester
> Department of Clinical and Social Psychology
> http://yourpsyche.org
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



More information about the R-help mailing list