[R] Reading from a webpage

Bert Gunter bgunter@4567 @end|ng |rom gm@||@com
Mon Apr 27 19:04:41 CEST 2020


This list provides "Help" ; it sounds like you are looking for a
tutorial with detailed instructions, which generally goes beyond what
will be provided here. See the posting guide linked at the bottom for
what you *can* expect.

https://cran.r-project.org/web/views/WebTechnologies.html
provides information that may well be relevant.

Also, try searching at rseek.org. Entering "scraping web pages" there
brought up what looked like useful information, for example.


Bert Gunter

"The trouble with having an open mind is that people keep coming along
and sticking things into it."
-- Opus (aka Berkeley Breathed in his "Bloom County" comic strip )

On Mon, Apr 27, 2020 at 8:56 AM Vahid Borji <vahid.borji65 using gmail.com> wrote:
>
> Hi there,
>
> I am using R. I want to save a text, which is inside the following webpage,
> in a variable, and then want to answer some questions.
>
> http://en.neyshabur.ac.ir/en/119-about-city-of-neyshabur/1232-city-of-neyshabur
>
> My questions are:
>
> 1) How many words do exist in the text (of the webpage)?
>
> 2) How many characters are there in the text (of the webpage)?
>
> 3) I want to find (specify) words (of the text) that have more than 9
> letters. How can I do it?
>
> 4) I want to find words that are repeated at least four times in the text.
> How can I do it?
>
> 5) Assume the words of the text are typed vice verse (from end to first),
> how can I edit it?
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list