[R] Please help -- Cbind data frames within LIST() in R

PIKAL Petr petr.pikal at precheza.cz
Tue Mar 15 09:40:29 CET 2016


Hi

Keep your reply on rhelp list (I cc'd it).

Your code is not reproducible:
**only you has such directory
path_data <- 'C:/Users/pc/Documents/Projects/fb'

**where is function searchPages
Searching 10 Pages for  LinkedIn
Error: could not find function "searchPages"

So this is my last effort.

One of my suggestions was rbind.fill from package plyr, which may be not what you wanted. But in te same reply there was also function cbindPad.

Having list lll
dput(lll)
list(structure(list(mikro = structure(c(1L, 1L, 1L, 1L, 1L, 1L,
1L, 1L, 1L), .Label = "D", class = "factor"), hod = structure(1:9, .Label = c("12:30:00",
"12:31:00", "12:33:00", "12:34:00", "12:36:00", "12:38:00", "12:39:00",
"12:40:00", "12:45:00"), class = "factor"), cas = c(0L, 1L, 3L,
4L, 6L, 8L, 9L, 10L, 11L)), .Names = c("mikro", "hod", "cas"), class = "data.frame", row.names = c(NA,
-9L)), structure(list(Časm = structure(c(2L, 11L, 9L, 8L, 6L,
4L), .Label = c("1:00", "11:00", "13:00", "15:00", "17:00", "19:00",
"21:00", "23:00", "3:00", "5:00", "7:00", "9:00"), class = "factor"),
    Kontrolor = structure(c(6L, 6L, 6L, 6L, 6L, 6L), .Label = c("CLLAB",
    "KOŠUTKOVÁ", "KOUTNÁ PETRA", "KOVÁŘOVÁ", "KRUTILOVÁ", "LABRTG",
    "NAVRÁTILOVÁ"), class = "factor"), X.R.výp = c(0.5, 0.8,
    0.2, 0.5, 0.5, 0.2)), .Names = c("Časm", "Kontrolor", "X.R.výp"
), class = "data.frame", row.names = c(NA, 6L)))

and using function cbindPad

cbindPad <- function(...){
args <- list(...)
n <- sapply(args,nrow)
mx <- max(n)
pad <- function(x, mx){
    if (nrow(x) < mx){
        nms <- colnames(x)
        padTemp <- matrix(NA, mx - nrow(x), ncol(x))
        colnames(padTemp) <- nms
        if (ncol(x)==0) {
          return(padTemp)
        } else {
        return(rbind(x,padTemp))
          }
    }
    else{
        return(x)
    }
}
rs <- lapply(args,pad,mx)
return(do.call(cbind,rs))
}

I get this.

> dput(cbindPad(lll[[1]], lll[[2]]))
structure(list(mikro = structure(c(1L, 1L, 1L, 1L, 1L, 1L, 1L,
1L, 1L), .Label = "D", class = "factor"), hod = structure(1:9, .Label = c("12:30:00",
"12:31:00", "12:33:00", "12:34:00", "12:36:00", "12:38:00", "12:39:00",
"12:40:00", "12:45:00"), class = "factor"), cas = c(0L, 1L, 3L,
4L, 6L, 8L, 9L, 10L, 11L), Časm = structure(c(2L, 11L, 9L, 8L,
6L, 4L, NA, NA, NA), .Label = c("1:00", "11:00", "13:00", "15:00",
"17:00", "19:00", "21:00", "23:00", "3:00", "5:00", "7:00", "9:00"
), class = "factor"), Kontrolor = structure(c(6L, 6L, 6L, 6L,
6L, 6L, NA, NA, NA), .Label = c("CLLAB", "KOŠUTKOVÁ", "KOUTNÁ PETRA",
"KOVÁŘOVÁ", "KRUTILOVÁ", "LABRTG", "NAVRÁTILOVÁ"), class = "factor"),
    X.R.výp = c(0.5, 0.8, 0.2, 0.5, 0.5, 0.2, NA, NA, NA)), .Names = c("mikro",
"hod", "cas", "Časm", "Kontrolor", "X.R.výp"), row.names = c(NA,
9L), class = "data.frame")
>

If this is not what you want, instead of flooding list with your irreproducible code provide input and output data in proper form. (?dput)

Cheers
Petr
 -----Original Message-----
> From: Sunny Singha [mailto:sunnysingha.analytics at gmail.com]
> Sent: Monday, March 14, 2016 11:17 AM
> To: PIKAL Petr
> Subject: Re: [R] Please help -- Cbind data frames within LIST() in R
>
> Petr,
> There is no common column in the data frames ($posts, $comments,
> $likes) within the list. I'm reposting the part of the code in plain
> text format:
>
> library(Rfacebook)
> library(plyr)
> library(data.table)
>
> page_keywords <- c('LinkedIn','youtube', 'facebook') p <- list() for(i
> in 1:length(page_keywords)){
>   cat('Searching 10 Pages for ',page_keywords[i],'\n')
>   p[[i]] <- searchPages(page_keywords[i], token=fb_oauth, 10)
>   cat('Found : ',nrow(p[[i]]),page_keywords[i],'\n\n')
> }
>
> # Appending pages found in the data frame 'pages_df pages_df <-
> rbindlist(p) cat('Category of pages found::: \n',
>     sort(paste0(unique(pages_df$category), '\n')))
>
>
> # Function to scrap facebook pages' posts, comments and likes
> page_data_extract <- function(pg_id){
>   path_data <- 'C:/Users/pc/Documents/Projects/fb'
>   cat('Extracting list of posts from page', pg_id, '\n')
>   page <- getPage(pg_id, token=fb_oauth, since='2016/03/01')
>
>   for(i in 1:nrow(page)){
>     cat('\nScrapping post \n',page$id[i])
>
>     posts <- getPost(page$id[i], n=100, token =fb_oauth)
>     time <- format(Sys.time(), '%b_%d_%Y')
>     for(j in 1:length(posts)){
>       myfile <- file.path(path_data, paste0(names(posts[j]), time, '_',
> i, '.csv'))
>       write.csv(posts[[j]], file=myfile, row.names = F)
>     }
>   }
> }
>
> # Call page_data_extract() function for each pages found to gather
> posts, comments & Likes for(i in 1:nrow(pages_df)){
>     e_catch <- try(page_data_extract(pages_df$id[i]))
>     if(isTRUE(all.equal(class(e_catch), 'try-error'))){
> #      writeLines('No public posts were found for == ')
>       cat('No public posts were found for ==
> ',pages_df$name[i],'(',pages_df$id[i],')\n\n')
>     }
> }
>
> Regards,
> Sunny
>
> On Mon, Mar 14, 2016 at 3:23 PM, PIKAL Petr <petr.pikal at precheza.cz>
> wrote:
> > Hi
> >
> >
> >
> > From: Sunny Singha [mailto:sunnysingha.analytics at gmail.com]
> > Sent: Monday, March 14, 2016 10:37 AM
> > To: r-help; PIKAL Petr
> > Subject: Re: [R] Please help -- Cbind data frames within LIST() in R
> >
> >
> >
> > Thanks Petr,
> >
> > I'm going through the link that you have provided. Merge won't be
> > useful in my case. Let me give the complete picture of what I'm
> trying to achieve.
> >
> > Why do you think so? If you have several data frames of unequal
> length
> > with some common column I am almost sure that you want to align all
> > values according to this common column.
> >
> >
> > I'm extracting data from fb pages. I have the data frame 'pages_df'
> > which has all the details of the fb pages. 'Page_id' is the important
> > column in it to get all the posts from that page.
> >
> > I have created below function to which page_id is passed and data
> > frame is created separately for 'posts.csv', 'comments.csv',
> 'likescsv'.
> >
> > Ideally this is not what I want as the final output. I want to modify
> > this function so that it outputs single flat file where columns of
> >
> > each posts, comments and likes are cbinded.
> >
> >
> > # Function to scrap facebook pages' posts, comments and likes
> > page_data_extract <- function(pg_id){
> >   cat('Extracting list of posts from page', pg_id, '\n')
> >   page <- getPage(pg_id, token=fb_oauth, since='2016/03/01') # This
> > function extracts posts metadata from page_id argument specified.
> >
> >   # Loop to information of the post.
> >
> >   for(i in 1:nrow(page)){
> >
> >     path_dat <- <custom_path>
> >
> >     cat('\nScrapping post \n',page$id[i])
> >
> >     posts <- getPost(page$id[i], n=100, token =fb_oauth)
> >     time <- format(Sys.time(), '%b_%d_%Y')
> >     for(j in 1:length(posts)){
> >       myfile <- file.path(path_data, paste0(names(posts[j]), time,
> > '_', i,
> > '.csv'))
> >       write.csv(posts[[j]], file=myfile, row.names = F)
> >     }
> >   }
> > }
> >
> > I get this error with your code
> >
> > +     path_dat <- <custom_path>
> >
> > Error: unexpected '<' in:
> >
> > "  for(i in 1:nrow(page)){
> >
> >     path_dat <- <"
> >
> >>     cat('\nScrapping post \n',page$id[i])
> >
> > Error in page$id : object of type 'closure' is not subsettable
> >
> >>
> >
> > So please follow Posting guide for this help list if you really want
> > some usefull answers:
> >
> > No HTML, only plain text mails.
> >
> > Reproducible code
> >
> > Some toy data e.g. by posting result of
> >
> > dput(head(pages_df, 20))
> >
> > Cheers
> >
> > Petr
> >
> >
> > Here I call the above function:
> >  # Call page_data_extract() function for each pages found to gather
> > posts, comments & Likes for(i in 1:nrow(pages_df)){
> >     e_catch <- try(page_data_extract(pages_df$id[i]))
> >     if(isTRUE(all.equal(class(e_catch), 'try-error'))){
> >       cat('No public posts were found for ==
> > ',pages_df$name[i],'(',pages_df$id[i],')\n\n')
> >     }
> > }
> >
> > Regards,
> >
> > Sunny
> >
> >
> >
> > On Mon, Mar 14, 2016 at 1:19 PM, PIKAL Petr <petr.pikal at precheza.cz>
> wrote:
> >
> > Hi
> >
> > There are several options e.g.
> >
> > http://stackoverflow.com/questions/6988184/combining-two-data-frames-
> o
> > f-different-lengths
> > http://stackoverflow.com/questions/7196450/create-a-data-frame-of-
> uneq
> > ual-lengths
> >
> > but maybe you want to do actually merging
> >
> > see
> >
> > ?merge
> >
> > Cheers
> > Petr
> >
> >> -----Original Message-----
> >> From: R-help [mailto:r-help-bounces at r-project.org] On Behalf Of
> Sunny
> >> Singha
> >> Sent: Monday, March 14, 2016 8:16 AM
> >> To: r-help
> >> Subject: [R] Please help -- Cbind data frames within LIST() in R
> >>
> >> Hi,
> >> Please help and guide. I want to cbind dataframes within the list
> >> object:
> >> List object 'post_data' contains below sampled data frames:
> >>
> >> post_data
> >> $posts
> >>       from_id user_name post_msg
> >>
> >> $comm
> >>        comm_id from_id message
> >>
> >> $Likes
> >>        username like_id
> >>
> >>
> >> *Note: The columns in each data frame are of unequal length. hence,
> >> missing values will be filled with 'NA' after combining* Regards,
> >> Sunny
> >>

________________________________
Tento e-mail a jakékoliv k němu připojené dokumenty jsou důvěrné a jsou určeny pouze jeho adresátům.
Jestliže jste obdržel(a) tento e-mail omylem, informujte laskavě neprodleně jeho odesílatele. Obsah tohoto emailu i s přílohami a jeho kopie vymažte ze svého systému.
Nejste-li zamýšleným adresátem tohoto emailu, nejste oprávněni tento email jakkoliv užívat, rozšiřovat, kopírovat či zveřejňovat.
Odesílatel e-mailu neodpovídá za eventuální škodu způsobenou modifikacemi či zpožděním přenosu e-mailu.

V případě, že je tento e-mail součástí obchodního jednání:
- vyhrazuje si odesílatel právo ukončit kdykoliv jednání o uzavření smlouvy, a to z jakéhokoliv důvodu i bez uvedení důvodu.
- a obsahuje-li nabídku, je adresát oprávněn nabídku bezodkladně přijmout; Odesílatel tohoto e-mailu (nabídky) vylučuje přijetí nabídky ze strany příjemce s dodatkem či odchylkou.
- trvá odesílatel na tom, že příslušná smlouva je uzavřena teprve výslovným dosažením shody na všech jejích náležitostech.
- odesílatel tohoto emailu informuje, že není oprávněn uzavírat za společnost žádné smlouvy s výjimkou případů, kdy k tomu byl písemně zmocněn nebo písemně pověřen a takové pověření nebo plná moc byly adresátovi tohoto emailu případně osobě, kterou adresát zastupuje, předloženy nebo jejich existence je adresátovi či osobě jím zastoupené známá.

This e-mail and any documents attached to it may be confidential and are intended only for its intended recipients.
If you received this e-mail by mistake, please immediately inform its sender. Delete the contents of this e-mail with all attachments and its copies from your system.
If you are not the intended recipient of this e-mail, you are not authorized to use, disseminate, copy or disclose this e-mail in any manner.
The sender of this e-mail shall not be liable for any possible damage caused by modifications of the e-mail or by delay with transfer of the email.

In case that this e-mail forms part of business dealings:
- the sender reserves the right to end negotiations about entering into a contract in any time, for any reason, and without stating any reasoning.
- if the e-mail contains an offer, the recipient is entitled to immediately accept such offer; The sender of this e-mail (offer) excludes any acceptance of the offer on the part of the recipient containing any amendment or variation.
- the sender insists on that the respective contract is concluded only upon an express mutual agreement on all its aspects.
- the sender of this e-mail informs that he/she is not authorized to enter into any contracts on behalf of the company except for cases in which he/she is expressly authorized to do so in writing, and such authorization or power of attorney is submitted to the recipient or the person represented by the recipient, or the existence of such authorization is known to the recipient of the person represented by the recipient.


More information about the R-help mailing list