[R] How to access https page

Prof Brian Ripley ripley at stats.ox.ac.uk
Tue Mar 10 07:28:38 CET 2015


On 09/03/2015 22:39, Hui Du wrote:
> Hi All,
>
> I am trying to parse some information from website, say, a linkedin page.
> The linkedin url was
>
> url = "http://www.linkedin.com/in/huidu"
>
> I had no problem to use readLines and XML package to collect the
> information I need. However, that url became "
> https://www.linkedin.com/in/huidu" now.
>
> url = "https://www.linkedin.com/in/huidu"
>
> It failed readLines function.
>
>> readLines(url)
> Error in file(con, "r") : cannot open the connection
> In addition: Warning message:
> In file(con, "r") : unsupported URL scheme
>
>
> Do you know any way to read-in web information if the url is https? Thanks
> a lot.

Try R-devel, soon to become R 3.2.0.  That has support for this on 
platforms where libcurl is installed (which should be possible almost 
everywhere).

You did not give the 'at a minimum' information required by the posting 
guide.  This has long been possible on Windows with --internet2.

>
> Hui
>
> 	[[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>


-- 
Brian D. Ripley,                  ripley at stats.ox.ac.uk
Emeritus Professor of Applied Statistics, University of Oxford
1 South Parks Road, Oxford OX1 3TG, UK



More information about the R-help mailing list