[R] counting number of "G" in "TCGGGGGACAATCGGTAACCCGTCT"

Wolfgang Huber huber at ebi.ac.uk
Tue Jul 15 17:59:17 CEST 2008


Hi,

And the Bioconductor package "Biostrings" is the place to go for any
serious work with sequences.
-- 
Best wishes
 Wolfgang

------------------------------------------------------------------
Wolfgang Huber  EBI/EMBL  Cambridge UK  http://www.ebi.ac.uk/huber


15/07/2008 16:43 Henrik Bengtsson scripsit
> Seems like you can do:
> 
> library("matchprobes")   # on Bioconductor
> countbases("TCGGGGGACAATCGGTAACCCGTCT")[,"G"]
> 
> The catch is that it only counts A, C, G, and T:s and no other symbols.
> 
> /Henrik
> 
> On Tue, Jul 15, 2008 at 8:27 AM, Daren Tan <daren76 a hotmail.com> wrote:
>> Any better solution than this ?
>>
>> sum(strsplit("TCGGGGGACAATCGGTAACCCGTCT", "")[[1]] == "G")
>> _________________________________________________________________



More information about the R-help mailing list