[BioC] What to do with multiple probes?

Sean Davis sdavis2 at mail.nih.gov
Wed Nov 30 12:23:45 CET 2005


On 11/30/05 5:59 AM, "krasikov at science.uva.nl" <krasikov at science.uva.nl>
wrote:

> Hi
> Thanks Robert and Sean for your comments on my problem.
> 
> 
> Robert Gentleman wrote:
>> Hi,
>>  Sean has already answered some of your questions, but I will provide a
>> few of my thoughts on this.
>> 
>>  1) there is little discussion because it is a reasonably difficult
>> topic and there is not clear cut answers, besides "it depends" and it
>> does depend on a lot of different things.
>> 
>>  For example, you might exclude the information on some probe sets if
>> they are far from the poly-A tail and dT-priming was used, if random
>> priming was used, then all should be equally good (but I am not aware of
>> a comprehensive comparison).
>> 
>>  In some cases, depending on the data, processing etc, you can develp
>> tools for comparing duplicate probe sets and combining the information
>> to get better estimates for whether genes are expressed and at what
>> levels (you could compare the probes and see if they are unique in the
>> genome, for example). In these situations, using R is a good thing,
>> since you can pretty much do any reasonable analysis, but you need to
>> know some statistics and some programming to do it, and there is no
>> clear recipe to follow.
> 
> I have complete bacterial genome custom Agilent microarrays.
> In my case custom means that we design all probes ourself (not Agilent),
> with assigning quality-scores to the probes, and have tried to avoid
> "poor" probes.
> 
> The quality metrics for the probes should be equally good. It is random
> priming experiment. This is an bacterial platform, so no tissue
> specificity is possible, more over I may expect that there are not to
> much biological variation between biological replicates.
> 
> Taking into the account above  discussion about multiple probes, I guess
> I can only make decisions individually gene-by-gene.
> For that I can imagine the design in my point 2.
> My question is: what kind of script should be to rearrange MAlist and
> save it in text format?

Just to be clear--your multiple probes per gene are different and not
identical for each gene?  If that is the case, then I think that you will
need to write the script yourself, as I don't think one is available
"off-the-shelf".  I would look at commands like "reshape", "split", and
"aggregate" for good general purpose building blocks for jobs like this.

Sean



More information about the Bioconductor mailing list