[R] html into R

Rui Barradas ru|pb@rr@d@@ @end|ng |rom @@po@pt
Fri Aug 26 12:53:43 CEST 2022


You can try the following. It worked with me.
Read from the link and post-process the html data extracting the element 
"table" and then the table itself.

This table has 3 rows before the actual table so the lapply below will 
get the table and its header.


link <- 

page <- read_html(link)
page |>
   html_elements("table") |>
   html_table(header = TRUE) |>
   lapply(\(x) {
     hdr <- unlist(x[3, ])
     y <- x[-(1:3), ]
     names(y) <- hdr

Hope this helps,

Rui Barradas

Às 09:43 de 26/08/2022, Nick Wray escreveu:
> hello - I need to download flow data for Scottish river catchments.  The
> data is available from the Scottish Environmental protection Agency body
> and that doesn't present a problem.  For example the API beneath will
> access the 96 flow recordings on the River Tweed on Jan 1st 2020 at one
> station:
> https://timeseries.sepa.org.uk/KiWIS/KiWIS?service=kisters&type=queryServices&datasource=0&request=getTimeseriesValues&ts_path=1/14972/Q/15m.Cmd&from=2020-01-01&to=2020-01-07&returnfields=Timestamp,Value,Quality%20Code
> But this data comes as HTML.  I can copy and paste it into a text doc which
> can then be read into R but that's slow and time-consuming.  I have tried
> using the package "rvest" to import the HTML into R but I have got nowhere.
> Can anyone give me any pointers as to how to do this?
> Thanks Nick Wray
> 	[[alternative HTML version deleted]]
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

More information about the R-help mailing list