[R] Output for pasting multiple vectors

Spencer Brackett @pbr@ckett20 @end|ng |rom @@|ntjo@ephh@@com
Thu Jun 27 16:48:28 CEST 2019


On Thu, Jun 27, 2019 at 10:48 AM Spencer Brackett <
spbrackett20 using saintjosephhs.com> wrote:

> Hello,
>
> Was that helpful in clarifying?
>
> Best,
>
> Spencer
>
> On Tue, Jun 25, 2019 at 5:14 PM Spencer Brackett <
> spbrackett20 using saintjosephhs.com> wrote:
>
>> Thank you! The following is the result of the corrected code using
>> >cnames ....
>>
>> > cnames
>>  [1] "sample.NA.NA.NANA"          "TCGA.27.1832.01.NA.NA.NANA"
>> "TCGA.27.1831.01.NA.NA.NANA"
>>  [4] "TCGA.28.5216.01.NA.NA.NANA" "TCGA.16.0846.01.NA.NA.NANA"
>> "TCGA.28.5218.01.NA.NA.NANA"
>>  [7] "TCGA.06.0178.01.NA.NA.NANA" "TCGA.06.0238.01.NA.NA.NANA"
>> "TCGA.06.0125.01.NA.NA.NANA"
>> [10] "TCGA.06.0219.01.NA.NA.NANA"
>>
>> And then the first 2 lines of the output for the corrected paste() bit of
>> the original coding...
>>
>> [[1]]
>> [1] "sample.NA.NA.NA"
>>
>> [[2]]
>> [1] "TCGA.27.1832.01.NA.NA.NANA"
>>
>> I would have to reconvene with my instructor who wrote the original code
>> to determine what his desired output for the data would be, but from what I
>> can tell, this bit of coding is formatting the given dataset(s) that we are
>> working with, prepping them for what will be a Linear regression model of
>> the protein expression datatype contained with the larger dataset. Sorry, I
>> know that this explanation doesn't really narrow things down.
>>
>> For reference, here are the proceeding few lines of code following the
>> paste() function portion; specifically after the "})" ...
>>
>> exp = read.table(file =
>> "/rsrch1/bcb/kchen_group/v_mohanty/data/TCGA/RNAseq/GBM.txt", sep = "\t",
>> header = T, row.names = 1)
>> exp = as.matrix(exp)
>> c = intersect(colnames(exp),colnames(meth))
>> exp = exp[,c]
>> meth = meth[,c]
>> m = apply(meth, 1, function(i){
>>   log2(i/(1-i))
>> })
>>
>> Best,
>>
>> Spencer Brackett
>>
>> On Tue, Jun 25, 2019 at 4:54 PM Rui Barradas <ruipbarradas using sapo.pt>
>> wrote:
>>
>>> Hello,
>>>
>>> The following works.
>>> First, get the colnames you have posted in a format that R can process.
>>> in your code you would skip this.
>>>
>>> cnames <- scan(what = character(), text = '
>>> "sample.NA.NA.NANA"
>>> "TCGA.27.1832.01.NA.NA.NANA"
>>> "TCGA.27.1831.01.NA.NA.NANA"
>>> "TCGA.28.5216.01.NA.NA.NANA"
>>> "TCGA.16.0846.01.NA.NA.NANA"
>>> "TCGA.28.5218.01.NA.NA.NANA"
>>> "TCGA.06.0178.01.NA.NA.NANA"
>>> "TCGA.06.0238.01.NA.NA.NANA"
>>> "TCGA.06.0125.01.NA.NA.NANA"
>>> "TCGA.06.0219.01.NA.NA.NANA"
>>> ')
>>> cnames
>>>
>>>
>>> Now your code corrected. I do not understand the output you are
>>> expecting. Can you give an example of the expected output of, say, the
>>> first 2 or 3 strings?
>>>
>>> lapply(cnames, function(i){
>>>    c1 <- strsplit(i, split = '\\.')[[1]]
>>>    c1[4] <- paste(strsplit(c1[4], split = "",fixed = T)[[1]][1:2],
>>> collapse =  "")
>>>    paste(c1, collapse = ".")
>>> })
>>>
>>>
>>> Hope this helps,
>>>
>>> Rui Barradas
>>>
>>> Às 20:24 de 25/06/19, Spencer Brackett escreveu:
>>> > Mr. Barradas,
>>> >
>>> > Oh haha my mistake. The following is the output you requested....
>>> >
>>> > [1] "sample.NA.NA.NANA"
>>> >   [2] "TCGA.27.1832.01.NA.NA.NANA"
>>> >   [3] "TCGA.27.1831.01.NA.NA.NANA"
>>> >   [4] "TCGA.28.5216.01.NA.NA.NANA"
>>> >   [5] "TCGA.16.0846.01.NA.NA.NANA"
>>> >   [6] "TCGA.28.5218.01.NA.NA.NANA"
>>> >   [7] "TCGA.06.0178.01.NA.NA.NANA"
>>> >   [8] "TCGA.06.0238.01.NA.NA.NANA"
>>> >   [9] "TCGA.06.0125.01.NA.NA.NANA"
>>> > [10] "TCGA.06.0219.01.NA.NA.NANA"
>>> >
>>> > Best,
>>> >
>>> > Spencer
>>> >
>>> > On Tue, Jun 25, 2019 at 2:50 PM Rui Barradas <ruipbarradas using sapo.pt
>>> > <mailto:ruipbarradas using sapo.pt>> wrote:
>>> >
>>> >     Hello,
>>> >
>>> >     No, you have not understood, post the colnames, not the row.names.
>>> >
>>> >     Run
>>> >
>>> >     colnames(meth)[1:10]
>>> >
>>> >     and post the output of that code line.
>>> >
>>> >     Hope this helps,
>>> >
>>> >     Rui Barradas
>>> >
>>> >     Às 19:31 de 25/06/19, Spencer Brackett escreveu:
>>> >      > The requested reprex....
>>> >      >
>>> >      > Rendering reprex...
>>> >      > Error in parse(text = x, keep.source = TRUE) :
>>> >      >    <text>:26:2: unexpected ')'
>>> >      > 25:
>>> >      > 26: 2)
>>> >      >       ^
>>> >      >
>>> >      > And the subset of the data....
>>> >      >
>>> >      >   row.names = c(NA, 6L), class = "data.frame")
>>> >      >
>>> >      > Best,
>>> >      >
>>> >      > Spencer
>>> >      >
>>> >      >
>>> >      > On Tue, Jun 25, 2019 at 2:25 PM Rui Barradas
>>> >     <ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>
>>> >      > <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>>>
>>> wrote:
>>> >      >
>>> >      >     Hello,
>>> >      >
>>> >      >     No I wasn't expecting a very large file.
>>> >      >     Try to make a reproducible example. If the problem seems to
>>> >     be with the
>>> >      >     sapply/strsplit post a subset of the data like just some of
>>> the
>>> >      >     colnames:
>>> >      >
>>> >      >     colnames(meth)[1:10]
>>> >      >
>>> >      >
>>> >      >     This would allow us to run the code you are having trouble
>>> with.
>>> >      >
>>> >      >
>>> >      >     Hope this helps,
>>> >      >
>>> >      >     Rui Barradas
>>> >      >
>>> >      >     Às 17:36 de 25/06/19, Spencer Brackett escreveu:
>>> >      >      > The result of dput(head(meth)) is a very large data
>>> >     table/listing of
>>> >      >      > data by category of what I presume to be the data from
>>> the two
>>> >      >     TCGA .txt
>>> >      >      > files I referenced previously. Is this the output you
>>> were
>>> >     expecting?
>>> >      >      >
>>> >      >      > On Tue, Jun 25, 2019 at 12:19 PM Rui Barradas
>>> >      >     <ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>
>>> >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>>
>>> >      >      > <mailto:ruipbarradas using sapo.pt <mailto:
>>> ruipbarradas using sapo.pt>
>>> >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>>>>
>>> wrote:
>>> >      >      >
>>> >      >      >     Hello,
>>> >      >      >
>>> >      >      >     1) That error comes from not closing }) after the
>>> paste()
>>> >      >     instruction.
>>> >      >      >     The complete statement would be
>>> >      >      >
>>> >      >      >
>>> >      >      >     colnames(meth) = sapply(colnames(meth), function(i){
>>> >      >      >         c1 = strsplit(i, split ='\\', fixed = T)[[1]]
>>> >      >      >         c1[4] = paste(strsplit(c1[4],split = "",fixed =
>>> >      >      >     T)[[1]][1:2],collapse
>>> >      >      >     =  "")
>>> >      >      >         paste(c1,collapse = ".")
>>> >      >      >     })
>>> >      >      >
>>> >      >      >
>>> >      >      >     2) Can you post the output of the following?
>>> >      >      >
>>> >      >      >     dput(head(meth))
>>> >      >      >
>>> >      >      >
>>> >      >      >     Hope this helps,
>>> >      >      >
>>> >      >      >     Rui Barradas
>>> >      >      >
>>> >      >      >     Às 16:58 de 25/06/19, Spencer Brackett escreveu:
>>> >      >      >      > The following is what I have implemented thus
>>> far...
>>> >      >      >      >
>>> >      >      >      > The file object for the two files listed in lines
>>> >     1-2 was
>>> >      >     set in my
>>> >      >      >      > working directory, and are under the folder
>>> >     "Vakul's GBM
>>> >      >     Code"...
>>> >      >      >     hence
>>> >      >      >      > the source of the odd 'prefix' for .txt files
>>> shown
>>> >      >      >      >
>>> >      >      >      >  >library(data.table)>.anno =
>>> >     as.data.frame(fread(file =
>>> >      >      >     "~Vakul's GBM
>>> >      >      >      > Code/mapper.txt", sep ="\t", header = T))
>>> >      >      >      >  >meth = read.table(file = "~Vakul's GBM
>>> Code/GBM.txt",
>>> >      >     sep  ="\t",
>>> >      >      >      > header = T, row.names = 1)
>>> >      >      >      >  >meth = as.matrix(meth)
>>> >      >      >      >
>>> >      >      >      > ##  the loop just formats the methylation column
>>> >     names to
>>> >      >     match
>>> >      >      >     format ##
>>> >      >      >      > colnames(meth) = sapply(colnames(meth),
>>> function(i){
>>> >      >      >      >    c1 = strsplit(i,split ='\\', fixed = T)[[1]]
>>> >      >      >      >    c1[4] = paste(strsplit(c1[4],split = "",fixed =
>>> >      >      >     T)[[1]][1:2],collapse
>>> >      >      >      > =  "")
>>> >      >      >      >    paste(c1,collapse = ".")
>>> >      >      >      >
>>> >      >      >      > Rendering reprex...
>>> >      >      >      > Error in parse(text = x, keep.source = TRUE) :
>>> >      >      >      >    <text>:28:0: unexpected end of input
>>> >      >      >      >
>>> >      >      >      > Best,
>>> >      >      >      >
>>> >      >      >      > Spencer
>>> >      >      >      >
>>> >      >      >      >
>>> >      >      >      >
>>> >      >      >      > On Tue, Jun 25, 2019 at 11:45 AM Spencer Brackett
>>> >      >      >      > <spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>
>>> >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>>
>>> >      >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>
>>> >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>>>
>>> >      >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>
>>> >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>>
>>> >      >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>
>>> >      >     <mailto:spbrackett20 using saintjosephhs.com
>>> >     <mailto:spbrackett20 using saintjosephhs.com>>>>>
>>> >      >      >      > wrote:
>>> >      >      >      >
>>> >      >      >      >     Mr. Barradas,
>>> >      >      >      >
>>> >      >      >      >     I got the same "output" as before, which is
>>> the +
>>> >      >     indicating that
>>> >      >      >      >     the expression is incomplete (according to
>>> some
>>> >     R users
>>> >      >      >     response in
>>> >      >      >      >     this chain).
>>> >      >      >      >
>>> >      >      >      >     Should the argument perhaps be c1 =
>>> strsplit(i,
>>> >     split
>>> >      >     = '\\',
>>> >      >      >     fixed
>>> >      >      >      >     = T)[[1]] .... thereby eliminating the "." ?
>>> >      >      >      >     The reprex and error message as the result of
>>> >     this is
>>> >      >     the same as
>>> >      >      >      >     the previous one I sent.
>>> >      >      >      >
>>> >      >      >      >     I will send a more detailed description of
>>> what
>>> >     code I
>>> >      >     have
>>> >      >      >     done so
>>> >      >      >      >     far for context.
>>> >      >      >      >
>>> >      >      >      >     Best,
>>> >      >      >      >
>>> >      >      >      >     Spencer
>>> >      >      >      >
>>> >      >      >      >     On Tue, Jun 25, 2019 at 11:30 AM Rui Barradas
>>> >      >      >     <ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>
>>> >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>>
>>> >      >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>
>>> >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>>>
>>> >      >      >      >     <mailto:ruipbarradas using sapo.pt
>>> >     <mailto:ruipbarradas using sapo.pt>
>>> >      >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>>
>>> >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt>
>>> >      >     <mailto:ruipbarradas using sapo.pt <mailto:ruipbarradas using sapo.pt
>>> >>>>>
>>> >      >      >     wrote:
>>> >      >      >      >
>>> >      >      >      >         Hello,
>>> >      >      >      >
>>> >      >      >      >         Maybe with
>>> >      >      >      >
>>> >      >      >      >         c1 = strsplit(i, split = '\\.', fixed =
>>> T)[[1]]
>>> >      >      >      >
>>> >      >      >      >         instead of split = '.'
>>> >      >      >      >
>>> >      >      >      >         The dot is a metacharacter that matches
>>> any
>>> >      >     character so
>>> >      >      >     it has
>>> >      >      >      >         to be
>>> >      >      >      >         escaped.
>>> >      >      >      >
>>> >      >      >      >         Hope this helps,
>>> >      >      >      >
>>> >      >      >      >         Rui Barradas
>>> >      >      >      >
>>> >      >      >      >         Às 16:11 de 25/06/19, Spencer Brackett
>>> >     escreveu:
>>> >      >      >      >          > c1 = strsplit(i,split ='.', fixed =
>>> T)[[1]]
>>> >      >      >      >
>>> >      >      >
>>> >      >
>>> >
>>>
>>

	[[alternative HTML version deleted]]



More information about the R-help mailing list