[BioC] ArrayQuest at MUSC

Gary Argraves garyArgraves at sbcglobal.net
Tue Jun 14 22:22:50 CEST 2005


Greetings to bioconductor users,

The purpose of this letter is to introduce you to ArrayQuest. ArrayQuest™ is
a web-based program for the analysis of DNA microarray data. ArrayQuest™ is
designed to apply various types of analysis scripts (i.e., Bioconductor
statistical and graphical methods) to microarray data stored in the MUSC DNA
Microarray Database and the Gene Expression Omnibus (GEO).

We have developed a software suite that includes four modules that work in
conjunction to mediate the archiving and analysis of DNA microarray data.
The four modules of this suite are as follows:

   1. µArrayDB™ module: µArrayDB™ is a program for archiving DNA microarray
data derived from experimentation performed through our DNA Microarray
Facility. The data for µArrayDB™ is stored on our Internet front-end server,
a Dell PoweEdge 440. µArrayDB™ and ArrayQuest™ (see next) both utilize MySQL
server to store meta data such as project information (scanned and analysis
project information), cRNA target information and process controlling
information.

   2. ArrayQuest™ analysis module: this module allows a client to select
microarray information from the µArrayDB™ database for analysis. The
ArrayQuest™ analysis paradigm can be thought of as an analysis process
controller in which a user specifies an analysis process that includes a
selectable analysis method (i.e., a particular R script such as RMA) and
related input for that method (i.e., Affymetrix .CEL files). ArrayQuest™ has
been designed to automatically distribute loads across a computing cluster.
Before assigning a clients job, ArrayQuest™ will query a load control
program that will indicate which CPU in the cluster is best suited to
analyzing the job at the time. ArrayQuest presently uses Linux operating
systems RedHat and Fedora Core2. ArrayQuest™ is operated through web
browsing portals such as (i.e., Internet Explorer, Safari, Mozilla). The R
package that is used to run many ArrayQuest methods can be compiled to
utilize the increased performance that a 64 bit architecture offers.

   3. Analysis job-distribution module: ArrayQuest™ communicates with a
job-distribution module to send an analysis job to a CPU within the backend
analysis cluster, that is ready to accomplish the job. A key application
used in the analysis system is a statistics package entitled R. Most
analysis jobs that are launched by clients will take a considerable amount
of time for the results to be returned. ArrayQuest™ will notify the client
by email as well through flagging status within the process database, the
current state of any process being executed.

   4. Upload module: This module will allow off site clients to import data
into µArrayDB™ or to a private database. This will allow the µArrayDB™
database to grow. When the database grows then our client base will grow and
in turn our analysis requirements will grow. For this reason we need the CPU
Analysis Custers. µArrayDB™ is designed to protect user data and not allow
it to be released into the public domain without authorization, as data
security is a major issue in the field. The µArrayDB™ module has been
describe in a published article (Argraves et al., 2003).


URL:
http://proteogenomics.musc.edu/quickSite/arrayQuest.php?page=home&act=manage

Please direct your interest in this system to:

W. Scott Argraves, Ph.D., Professor
Medical University of South Carolina
Dept of Cell Biology and Anatomy
173 Ashley Avenue
Charleston, SC 29425
Email: argraves at musc.edu

or to

garyArgraves at sbcglobal.net

Sincerely,
Gary Argraves



More information about the Bioconductor mailing list