[BioC] snarfing GEO datasets...

Vincent J. Carey, Jr. stvjc@channing.harvard.edu
Tue, 4 Jun 2002 13:47:00 -0400 (EDT)


in package annotate there is GEOAnno, a first pass
at this application for GEO annotation of platforms

arguments are

platform (a string that identifies the platform in the
GEO catalog)

massageStruct (a list with elements keepIndsExpr,
ncols, header, pad, stripCols -- self explanatory?
the idea is that they give you a ragged table, you
can find the regimented subtable (with common column
number ncols) in rows given by keepIndsExpr), state
the header titles you want, and do something with
"," as last token and omit columns you don't need)

inURL (where do you go to look at GEO datasets)

the massage struct is the diciest part, but examples
exist for GPL80 (HUGeneFL) and GPL91 (U95A) -- all
in the annotate package under GEOAnno 
-- 
---
Vince Carey, PhD
Ass't Prof Med (Biostatistics)
Harvard Medical School
Channing Laboratory - ph 6175252265 fa 6177311541
181 Longwood Ave Boston MA 02115 USA

stvjc@channing.harvard.edu

-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: PGP 6.5.8

mQCNAzqIeGUAAAEEAMJXU941vIornTS52rl6z7eo+A7wwB0km/idLnkxzIhc1uLi
Qtn19OyOfG6IDSucLrtmpvwagemAnQ9jL6TVDrmlrKnqsh+FFtvUuZ37eV85L70E
BsS8RZCmMYHJKfrpCwegbTVZrEkd1ByquLIN/yUwxU4IcVuHxbNQk69riQ8tAAUR
tBVWaW5jZW50IEouIENhcmV5LCBKci6JAJUDBRA6iHhls1CTr2uJDy0BAdsLA/wM
cCzEDsP9MqodKZfDI1s/gXW6BcCuQ6n7MdEplLgmWvyqfbvRYx4upYZ3pNp8L0zU
MrlR6eCTs/eDtMO/ZbGvkqqiQO6wS2fZb1T5L/DhhtT4mEAHt0E8dNBVCj+lKr3W
vYS5GqO9gY4CiT3JXFH9N19pSbUQFiNDqpmG6EbWng==
=DQNF
-----END PGP PUBLIC KEY BLOCK-----