[BioC] Using as.geneSet in GeneticsBase - problems with gene.columns

Johannes Gulmann Madsen johannes at dsr.life.ku.dk
Thu Mar 19 15:03:27 CET 2009


Hello

Im a new user of Bioconductor and therefore also a new user of the package
GeneticsBase. I have a dataframe looking like this (only a tiny bit of it):

> b[1:10,]
                  SNP.Name Sample.ID Allele1...Top Allele2...Top GC.Score
1      0_A2M_DS066406.1_15 602358305             G             G   0.8607
2       0_A2M_DS068238.1_4 602358305             G             G   0.8311
3     0_AARSL_DS061819.1_2 602358305             G             G   0.6840
4    0_ABCA1_DS062937.1_32 602358305             G             G   0.7499
5  0_ABCA1_DS073864.1_41_2 602358305             A             A   0.7900
6    0_ABCA1_DS073864.1_41 602358305             G             G   0.7383
7    0_ABCA1_DS078984.1_39 602358305             A             A   0.8360
8    0_ABCA1_DS082793.1_19 602358305             G             G   0.8440
9    0_ABCC10_DS066718.1_3 602358305             C             C   0.8859
10   0_ABCC6_DS063353.1_20 602358305             A             C   0.6117

Its of class 'data.frame' and i want it converted to a 'geneset' using
as.geneSet.

So i tried:

b2 <- as.geneSet(b, gene.columns=c("Allele1...Top", "Allele2...Top",
format=c("adjacent")), because the "Allele1...Top" and "Allele2...Top" are
(combined, thats why i want to use format=c("adjacent"), is that wrong?) my
genotypes, but it won't work. Does any of you have any experience using
'as.geneSet', i can't find any examples on the net that looks like mine? My
idea of the dataset is that i want to genotype each 'Sample.ID' like this (a
simple example):

Sample.ID 1:
SNP1: AA 0
      AC 1
      CC 2

Sample.ID 2:
SNP1: AA 0
      AC 1
      CC 2

So I want to make sure that the same SNP on different Sample.ID's is genotyped
the same way (like illustrated above).

Hope you can help me.

Regards Johannes



More information about the Bioconductor mailing list