[R] Webscraping Data RCurl

Sven D sduve at hotmail.com
Sun May 27 15:26:32 CEST 2012


Hello,

I am trying to gather data from the following page:

http://toast.gasunie.de/Pages/search.aspx?lang=en&soid=GUD

I am trying to get hourly data. I did the following:

url = "http://toast.gasunie.de/Pages/search.aspx?lang=en&soid=GUD"
getFormParams(url)

fair enough, this did not return any sensible name=value parameters I could
use. I then looked into the HTML code to find name=value parameters that
would help me. I found the following name=value pairs, and set:

today = format(Sys.time(), "%d.%m.%Y")

ctl00_MainContent_webDatePickerFrom_DrpPnl1_DP_CAL_ID_1=today,
ctl00_MainContent_webDatePickerTo_DrpPnl2_DP_CAL_ID_2=today
GasQuality="H", 
PointType="EN", 
PointList="H070", 
Publishers="HourValues", 
ListHourValues="HTLF", 
Submit="Weiter"

So I went on and tried getForm() and postForm() with the above parameters,
but it did not show any success. So I THINK the problem is that the
webapplication is returning the table on a different URL as the one above,
please have a quick try with the webapplication. How can I come by this
problem?

Best, and thanks


Sven






--
View this message in context: http://r.789695.n4.nabble.com/Webscraping-Data-RCurl-tp4631500.html
Sent from the R help mailing list archive at Nabble.com.



More information about the R-help mailing list