[BioC] how to get number of samples from GSE with GEOquery
Dick Beyer
dbeyer at u.washington.edu
Thu Dec 17 00:35:37 CET 2009
Please ignore my last request. The solution was pretty straighforward, after a peek at the entity-relationship diagram.
sql <- paste("SELECT DISTINCT gse.gse, gse.title, gse_gsm.gsm, gsm.title",
"FROM", " gsm JOIN gse_gsm ON gsm.gsm=gse_gsm.gsm",
" JOIN gse ON gse_gsm.gse=gse.gse", " JOIN gse_gpl ON gse_gpl.gse=gse.gse",
" JOIN gpl ON gse_gpl.gpl=gpl.gpl", "WHERE",
#" gsm.molecule_ch1 like '%total RNA%' AND",
" gse.title LIKE '%colorectal cancer%' AND",
" gpl.organism LIKE '%Homo sapiens%'",
sep = " ")
Thanks,
Dick
*******************************************************************************
Richard P. Beyer, Ph.D. University of Washington
Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695
Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100
Seattle, WA 98105-6099
http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
http://staff.washington.edu/~dbeyer
*******************************************************************************
On Wed, 16 Dec 2009, Dick Beyer wrote:
> Would someone please tell me if there is a way to get the number of samples,
> GSMs associated with a GSE, from doing a GEOquery that returns GSEs?
> Specifically, I have this query:
>
> sql <- paste("SELECT DISTINCT gse.gse, gse.title, gse.overall_design,
> gse.repeats, gse.repeats_sample_list, gse.variable, gse.variable_description,
> gse.supplementary_file, gse.summary",
> "FROM", " gsm JOIN gse_gsm ON gsm.gsm=gse_gsm.gsm",
> " JOIN gse ON gse_gsm.gse=gse.gse", " JOIN gse_gpl ON gse_gpl.gse=gse.gse",
> " JOIN gpl ON gse_gpl.gpl=gpl.gpl", "WHERE",
> #" gsm.molecule_ch1 like '%total RNA%' AND",
> " gse.title LIKE '%colorectal cancer%' AND",
> " gpl.organism LIKE '%Homo sapiens%'",
> sep = " ")
>
> So I am getting all the fields I can (I think) for the GSE entries, but I don't
> see how to get the number of samples for each GSE. I see from the GEOquery
> documentation that I could ue getGEO(), but I don't want to get all the
> samples, I just want to know their names and how many.
>
> If I use the ncbi website (http://www.ncbi.nlm.nih.gov/sites/entrez) and do a
> GEO DataSets query for a particular GSE, then the number of samples and their
> names shows up in the browser.
>
> Any help or ideas will be greatly appreciated.
>
> Thanks very much,
> Dick
> *******************************************************************************
> Richard P. Beyer, Ph.D. University of Washington
> Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695
> Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100
> Seattle, WA 98105-6099
> http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
> http://staff.washington.edu/~dbeyer
> *******************************************************************************
>
>
>
>
More information about the Bioconductor
mailing list