[BioC] ChIPpeakAnno, getAnnotation question

Neha [guest] guest at bioconductor.org
Fri Sep 20 14:56:12 CEST 2013


Hi Julie


I am using ChIPpeakAnno for my data. The annotation data I would like to use Zebrafish but Zv9, though annotated data package has TSS.zebrafish.Zv8.
I downloaded the "GTF"  file for Zv9 but I am unable to convert it into Ranged data using GFF2RangedData function since it accepts GFF file. Therefore, I am trying to make custom annotation using the following code.


"

# create a biomart object and select a mart to use
ensembl <- useDataset("drerio_gene_ensembl", mart=useMart("ensembl"))

# what kind of attributes can we retrieve?
listAttributes(ensembl)

# set the properties to extract
props <- c("ensembl_gene_id", "external_gene_id", "transcript_biotype", "chromosome_name", "start_position", "end_position", "strand")

# get the information
genes <- getBM(attributes=props, mart=ensembl)

# create a RangedData object with my custom annotations
myCustomAnno <- RangedData(IRanges(start=genes[,"start_position"], end=genes[,"end_position"], names=genes[,"ensembl_gene_id"]), space=genes[,"chromosome_name"], strand=genes[,"strand"])
"


But I am getting following error:

"Error in `rownames<-`(`*tmp*`, value = c("ENSDARG00000085528", "ENSDARG00000087771",  : 
  duplicate rownames not allowed"

I am unable to proceed ahead, also I very new in using this package. Could you suggest a solution.



Cheers,
Neha

 -- output of sessionInfo(): 

R version 2.14.2 (2012-02-29)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)

locale:
[1] en_GB.UTF-8/en_GB.UTF-8/en_GB.UTF-8/C/en_GB.UTF-8/en_GB.UTF-8

attached base packages:
[1] grid      stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] ChIPpeakAnno_2.2.0                  gplots_2.10.1                       KernSmooth_2.23-7                   caTools_1.12                       
 [5] bitops_1.0-4.1                      gdata_2.8.2                         gtools_2.6.2                        limma_3.10.3                       
 [9] org.Hs.eg.db_2.6.4                  GO.db_2.6.1                         RSQLite_0.11.1                      DBI_0.2-5                          
[13] AnnotationDbi_1.16.19               BSgenome.Ecoli.NCBI.20080805_1.3.17 BSgenome_1.22.0                     GenomicRanges_1.6.7                
[17] Biostrings_2.22.0                   IRanges_1.12.6                      multtest_2.10.0                     Biobase_2.14.0                     
[21] biomaRt_2.10.0                     

loaded via a namespace (and not attached):
[1] MASS_7.3-18      RCurl_1.91-1     splines_2.14.2   survival_2.36-14 tools_2.14.2     XML_3.9-4       

--
Sent via the guest posting facility at bioconductor.org.



More information about the Bioconductor mailing list