[BioC] problem with org.Hs.egCHRLOCEND

James W. MacDonald jmacdon at med.umich.edu
Fri Jun 12 14:51:41 CEST 2009


Hi James,

James F. Reid wrote:
> Hi James and Kay,
> 
> I can see org.Hs.egCHRLOCEND in both the current release version 2.2.11 
> and the devel release 2.3.0.
> In which version has this been removed or does not appear?

Good call - I do remember that this was supposed to be added, but 
neglected to look to see if it had happened.

> 
>  From ucsc I get that the following RefSeq genes coordinates:
> CRIPAK at chr4:1375340-1379782 - (NM_175918) cysteine-rich PAK1 inhibitor
> CAND2 at chr3:12813171-12851301 - (NM_012298) TBP-interacting protein
> STK25 at chr2:242083105-242096707 - (NM_006374) serine/threonine kinase 25
> 
> the starting positions obtained via CHRLOC all have an offset of -1 but 
> not the ending positions (CHRLOCEND), do you know why that is?

http://genome.ucsc.edu/FAQ/FAQtracks#tracks1

Best,

Jim


> 
>  > library("org.Hs.eg.db")
>  > org.Hs.egCHRLOCEND
> CHRLOCEND map for Human (object of class "AnnDbMap")
>  > glist <- c("CRIPAK", "CAND2", "STK25")
>  > eglist <- unlist(mget(glist, org.Hs.egSYMBOL2EG))
>  > eglist
>   CRIPAK    CAND2    STK25
> "285464"  "23066"  "10494"
>  > mget(eglist, org.Hs.egCHRLOC)
> $`285464`
>       4
> 1375339
> 
> $`23066`
>        3
> 12813170
> 
> $`10494`
>          2
> -242083104
> 
>  > mget(eglist, org.Hs.egCHRLOCEND)
> $`285464`
>       4
> 1379782
> 
> $`23066`
>        3
> 12851301
> 
> $`10494`
>          2
> -242096707
> 
>  > sessionInfo()
> R version 2.9.0 (2009-04-17)
> x86_64-redhat-linux-gnu
> 
> locale:
> LC_CTYPE=en_US.UTF-8;LC_NUMERIC=C;LC_TIME=en_US.UTF-8;LC_COLLATE=en_US.UTF-8;LC_MONETARY=C;LC_MESSAGES=en_US.UTF-8;LC_PAPER=en_US.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US.UTF-8;LC_IDENTIFICATION=C 
> 
> 
> attached base packages:
> [1] stats     graphics  grDevices utils     datasets  methods   base
> 
> other attached packages:
> [1] org.Hs.eg.db_2.2.11 RSQLite_0.7-1       DBI_0.2-4
> [4] AnnotationDbi_1.6.0 Biobase_2.4.1
> 
> Best,
> J.
> 
> 
> James W. MacDonald wrote:
>> Hi Kay,
>>
>>
>> Kay Jaja wrote:
>>> Does any one know where i can find org.Hs.egCHRLOCEND
>>> to find the end base pair position for a gene. I have downloaded 
>>> org.Hs.eg.db package and org.Hs.egCHRLOCEND is not there.
>>
>> That is correct - the gene ends are not part of that package. You can 
>> use biomaRt for this.
>>
>>  > library(biomaRt)
>>  > mart <- useMart("ensembl","hsapiens_gene_ensembl")
>> Checking attributes ... ok
>> Checking filters ... ok
>>  > gns <- c("CRIPAK","CAND2","STK25")
>>  > getBM(c("hgnc_symbol","chromosome_name", 
>> "start_position","end_position"), "hgnc_symbol", gns, mart)
>>   hgnc_symbol chromosome_name start_position end_position
>> 1      CRIPAK               4        1378152      1379776
>> 2       CAND2               3       12813171     12851301
>> 3       STK25               2      242083105    242096776
>>
>> Alternatively, you could use getGene(), which has a simpler interface 
>> but returns things you might not want.
>>
>> Best,
>>
>> Jim
>>
>>
>>> I have tried the following example  Library(org.Hs.eg.db)
>>>  
>>>> glist
>>> [1] "CRIPAK" "CAND2"  "STK25"
>>>  
>>>> mget(eglist, org.Hs.egCHR)
>>> $`285464`
>>> [1] "4"
>>>  
>>> $`23066`
>>> [1] "3"
>>>  
>>> $`10494`
>>> [1] "2"
>>>  
>>> Find the start position:
>>>  > mget(eglist, org.Hs.egCHRLOC)
>>> $`285464`
>>>       4
>>> 1375339
>>>  
>>> $`23066`
>>>        3
>>> 12813170
>>>  
>>> $`10494`
>>>          2
>>> -242083104
>>>  
>>> And the end positions I get an error   
>>>> mget(eglist, org.Hs.egCHRLOCEND)
>>> Error: object "org.Hs.egCHRLOCEND" not found
>>> Error in mget(eglist, org.Hs.egCHRLOCEND) :   error in evaluating the 
>>> argument 'envir' in selecting a method for function 'mget'
>>>  
>>>  
>>> Thanks for your help
>>>
>>>
>>>           [[alternative HTML version deleted]]
>>>
>>>
>>>
>>> ------------------------------------------------------------------------
>>>
>>> _______________________________________________
>>> Bioconductor mailing list
>>> Bioconductor at stat.math.ethz.ch
>>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>>> Search the archives: 
>>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: 
> http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Douglas Lab
University of Michigan
Department of Human Genetics
5912 Buhl
1241 E. Catherine St.
Ann Arbor MI 48109-5618
734-615-7826



More information about the Bioconductor mailing list