[R] scan html: sep = "<td>"

Uwe Ligges ligges at statistik.uni-dortmund.de
Mon Apr 4 16:57:48 CEST 2005


Christoph Lehmann wrote:

> Hi
> I try to import html text and I need to split the fields at each <td> or 
> </td> entry
> 
> How can I succeed? sep = '<td>' doens't yield the right result

If it fits pairwise together, use
   sep=c("<td>", "</td>")

if not, you can read the whole lot with readLines and strsplit for both 
pattern after that, for example.

Uwe Ligges



> thanks for hints
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html




More information about the R-help mailing list