[BioC] rtrackleyer/GenomicRanges: How can I group GRanges by metadata attributes

chris warth [guest] guest at bioconductor.org
Fri Nov 1 06:51:51 CET 2013


After reading in a GTF file with rtrackler::import(), I have a GRanges object.  How can I select the ranges whose gene_id (or transcript_id) match a particular value?

> library(rtracklayer)
> gtf <- import(system.file("tests", "gtf.gff", package="rtracklayer"), asRangedData=F)

I can see the metadata with 'mcols(gtf)', and I can even see the gene_id with 'mcols(gtf)$group', 

> mcols(gtf)$group
[1] gene_id "ENSMUSG00000033501.1"; transcript_id "ENSMUST00000040592.1"; exon_id "ENSMUSE00000310143.1";
[omitted]
3 Levels: gene_id "ENSMUSG00000033501.1"; transcript_id "ENSMUST00000040592.1"; exon_id "ENSMUSE00000310143.1"; ...

but I don't see any way to utilize the name=value strings in the group column.

Do I just have to parse the values group column myself?

Thanks in advance for any help.


 -- output of sessionInfo(): 

R version 3.0.2 (2013-09-25)
Platform: x86_64-apple-darwin10.8.0 (64-bit)

locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] parallel  stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] rtracklayer_1.20.4   GenomicRanges_1.12.5 IRanges_1.18.4       BiocGenerics_0.6.0   BiocInstaller_1.10.4

loaded via a namespace (and not attached):
[1] Biostrings_2.28.0 bitops_1.0-6      BSgenome_1.28.0   RCurl_1.95-4.1    Rsamtools_1.12.4  stats4_3.0.2     
[7] tools_3.0.2       XML_3.95-0.2      zlibbioc_1.6.0   


--
Sent via the guest posting facility at bioconductor.org.



More information about the Bioconductor mailing list