[R] how do I remove entries in data frame from a vector

Rui Barradas ru|pb@rr@d@@ @end|ng |rom @@po@pt
Wed Oct 21 23:47:23 CEST 2020


Hello,

To remove the file extension it's much easier to use base R


filename <- tools::file_path_sans_ext(basename(celFiles))


Hope this helps,

Rui Barradas

Às 22:41 de 21/10/20, Rui Barradas escreveu:
> Hello,
> 
> This is probably because basename keeps the file extension, try instead
> 
> 
> filename <- sub("(^[^\\.]*)\\..+$", "\\1", basename(celFiles))
> celFiles[!filename %in% as.character(remove$V1)]
> 
> 
> Hope this helps,
> 
> Rui Barradas
> 
> Às 22:15 de 21/10/20, Ana Marija escreveu:
>> Hello,
>>
>> I have a data frame with one column:
>>
>>> remove
>>
>>                                  V1
>>
>> 1 ABAFT_g_4RWG569_BI_SNP_A10_35096
>> 2 ABAFT_g_4RWG569_BI_SNP_B12_35130
>> 3 ABAFT_g_4RWG569_BI_SNP_E09_35088
>> 4 ABAFT_g_4RWG569_BI_SNP_E12_35136
>> 5 ABAFT_g_4RWG569_BI_SNP_F11_35122
>> 6 ABAFT_g_4RWG569_BI_SNP_F12_35138
>> 7 ABAFT_g_4RWG569_BI_SNP_G07_35060
>> 8 ABAFT_g_4RWG569_BI_SNP_G12_35140
>>
>> I want to remove these 8 entries from remove data frame from this
>> vector that looks like this:
>>
>>> head(celFiles)
>>
>> [1] 
>> "/GOKIND/75327/PhenoGenotypeFiles/RootStudyConsentSet_phs000018.GAIN_GoKinD.v2.p1.c1.DS-T1DCR-IRB/GenotypeFiles/ABAFT_g_4RWG569_BI_SNP_A01_34952.CEL" 
>>
>> [2] 
>> "/GOKIND/75327/PhenoGenotypeFiles/RootStudyConsentSet_phs000018.GAIN_GoKinD.v2.p1.c1.DS-T1DCR-IRB/GenotypeFiles/ABAFT_g_4RWG569_BI_SNP_A02_34968.CEL" 
>>
>>
>> [3] 
>> "/GOKIND/75327/PhenoGenotypeFiles/RootStudyConsentSet_phs000018.GAIN_GoKinD.v2.p1.c1.DS-T1DCR-IRB/GenotypeFiles/ABAFT_g_4RWG569_BI_SNP_A03_34984.CEL" 
>>
>>
>> [4] 
>> "GOKIND/75327/PhenoGenotypeFiles/RootStudyConsentSet_phs000018.GAIN_GoKinD.v2.p1.c1.DS-T1DCR-IRB/GenotypeFiles/ABAFT_g_4RWG569_BI_SNP_A04_35000.CEL" 
>>
>>
>> [5] 
>> "/GOKIND/75327/PhenoGenotypeFiles/RootStudyConsentSet_phs000018.GAIN_GoKinD.v2.p1.c1.DS-T1DCR-IRB/GenotypeFiles/ABAFT_g_4RWG569_BI_SNP_A05_35016.CEL" 
>>
>>
>> [6] 
>> "/GOKIND/75327/PhenoGenotypeFiles/RootStudyConsentSet_phs000018.GAIN_GoKinD.v2.p1.c1.DS-T1DCR-IRB/GenotypeFiles/ABAFT_g_4RWG569_BI_SNP_A06_35032.CEL" 
>>
>> ...
>>
>> I tried doing this:
>>
>> b= celFiles[!basename(celFiles) %in% as.character(remove$V1)]
>>
>> but none of the 8th entries in "remove" data frame have been removed.
>>
>> Please advise,
>> Ana
>>
>> ______________________________________________
>> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide 
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
> 
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide 
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list