[R] How to access https page

Hui Du hui.du at savvyrookies.com
Mon Mar 9 23:39:33 CET 2015


Hi All,

I am trying to parse some information from website, say, a linkedin page.
The linkedin url was

url = "http://www.linkedin.com/in/huidu"

I had no problem to use readLines and XML package to collect the
information I need. However, that url became "
https://www.linkedin.com/in/huidu" now.

url = "https://www.linkedin.com/in/huidu"

It failed readLines function.

> readLines(url)
Error in file(con, "r") : cannot open the connection
In addition: Warning message:
In file(con, "r") : unsupported URL scheme


Do you know any way to read-in web information if the url is https? Thanks
a lot.

Hui

	[[alternative HTML version deleted]]



More information about the R-help mailing list