[BioC] Help using biomaRt

john seers (IFR) john.seers at bbsrc.ac.uk
Mon Jan 26 09:42:58 CET 2009


Hi Steve

Thanks for the answers. I will go back and try again and perhaps make
some progress.

On your second point I will check for illegal characters; but the
transcript names were returned from Biomart/Ensembl so they should be
OK.  


Regards

John Seers 



 
---


mailto:John.Seers at bbsrc.ac.uk


John Seers
Bioinformatics & Statistics
Institute of Food Research
Norwich Research Park
Colney
Norwich
NR4 7UA
 
Location: IFR1 N102
PC Machine ID: N198


tel +44 (0)1603 251497
fax +44 (0)1603 507723
e-mail john.seers at bbsrc.ac.uk                         
e-disclaimer at http://www.ifr.ac.uk/edisclaimer/ 
 
Web sites:

www.ifr.ac.uk   
www.foodandhealthnetwork.com

-----Original Message-----
From: Steve Lianoglou [mailto:mailinglist.honeypot at gmail.com] 
Sent: 23 January 2009 16:46
To: john seers (IFR)
Cc: bioconductor at stat.math.ethz.ch
Subject: Re: [BioC] Help using biomaRt

Hi,

> Question 1
> ===========
>
> I am using the Ensembl database. Looking at the www.ensembl.org  
> website
> it looks like you do not have to specify a filter. Do you have to
> specify a filter using biomaRt/R? If not how is it done?

I'm not sure I get what you're getting at, and probably others will  
jump in to answer your question with ease (I'm also getting more  
acquainted w/ biomaRt), but could you give an example of a query  
you're trying to do that wouldn't use a filter?

Perhaps one I can think of off hand would be to get all the gene id's  
for your organism. Intuitively you might think that there are no  
filters you can use in this case, but you do: you would use the  
chromosomes you want to get the info for (in this case, it would be  
all of them). So, you would just do something like:

annos <- getBM(..., attributes='ensembl_gene_id',  
filters='chromosome_name', values=CHROMOSOMES)

That's a butchered-for-brevity piece of code. There is an example in  
the recent Toedling and Huber chip-chip tutorial from plos compbio  
where they call getBM as many times as there are chromosomes and  
filter/collate afterwards.

Anyway ... an example of the query you have in mind might help.

> Question 2
> ==============
>
> While trying things I get the following error message, presumably  
> caused
> by the attribute not being in that table.
>
>
> V1
> 1 Query ERROR: caught BioMart::Exception::Usage: Attributes from
> multiple attribute pages are not allowed
> Error in getBM(attnames, filters = filtnames, values = transcripts,  
> mart
> = mart) :
>  Number of columns in the query result doesn't equal number of
> attributes in query.  This is probably an internal error, please  
> report.


I was fighting with a similar error recently, I was using gene symbols/ 
names to look up annotation info for them. It turned out that some of  
my names had an apostrophe in the ('), and another gene name somehow  
had a "<b>" tacked on to it. I received a gene list from an  
experimentalist and just didn't think to scrub it, so it took a while  
to figure out that that was the problem.

Perhaps you have some illegal characters in your transcript names, too.

HTH,
-steve

--
Steve Lianoglou
Graduate Student: Physiology, Biophysics and Systems Biology
Weill Medical College of Cornell University

http://cbio.mskcc.org/~lianos



More information about the Bioconductor mailing list