[BioC] Howto annotate blast subject.id with AnnotationDbi

Marc Carlson mcarlson at fhcrc.org
Tue May 28 19:37:22 CEST 2013


Hi Arnaud,

It should be fine as long as you remember that the shortened name is 
that of the locus ID, and that you are therefore annotating at the gene 
level (as is the case for all the org packages).

You can see the TAIR website for more clarification about what the 
different IDs mean.  Here is a page for the locus ID "AT1G01010":

http://www.arabidopsis.org/servlets/TairObject?type=locus&id=137158

And here is a page for the gene model "AT1G01010.1":

http://www.arabidopsis.org/servlets/TairObject?type=gene&id=138508


   Marc





On 05/27/2013 01:12 AM, Arnaud Mounier wrote:
> Hi,
>
> I try to annotate a blastp result (not so much, just 270 query) with 
> the org.At.tair.db from AnnotationDBi package throw bioconductor.
>
> First I read blast output with RFLPtools :
>
> > df.blast.report <- read.blast(file = f.blast.report)
> > head(df.blast.report)
>          query.id  subject.id identity alignment.length mismatches 
> gap.opens q.start q.end s.start s.end evalue bit.score
> 1 medtr8g018420.1 AT1G55020.1    59.77              860        314    
> 9       9   856      20   859      0      1058
> 2 medtr8g018420.1 AT3G22400.1    56.16              869        344   
> 10       9   856      34   886      0      1004
> 3 medtr8g018420.1 AT1G72520.1    45.19              821        433   
> 10      45   856     114   926      0       729
>
> The subject ID have version number (.1 or .2) and the original 
> ATH_GO_GOSLIM.txt from tair site two. But this version number is not 
> present in the org.At.tait.dbTAIR :
> > head(keys(org.At.tair.db, keytype="TAIR"))
> [1] "AT1G01010" "AT1G01020" "AT1G01030" "AT1G01040" "AT1G01050" 
> "AT1G01060"
>
>
> * Is this relevant or can I annotate without taking care of the 
> version number ?
> Does Org.At.tair.db keep the version number elsewhere ?
> Because the source file for this package 
> (ftp://ftp.arabidopsis.org/Ontologies/Gene_Ontology/ATH_GO_GOSLIM.txt) 
> store it initialy.
>
> * As the query must be selected in function of her subjects 
> annotations and GO.db, I want to merge all info (blast report, 
> org.At.tair.db + GO.db) in one db with a bioconductor package 
> (annotationForge perhaps).
> So, is there a package or a GNU script to manage this association 
> easily or do i wrote my own R scripts ?
>
> Any links are welcomes !
> Thank's in advance,
> Ar.
>



More information about the Bioconductor mailing list