[R] create groups from data with duplicates, such that each group has a duplicate represented once

Kevin Wamae KW@m@e @end|ng |rom kemr|-we||come@org
Thu Jan 17 01:29:18 CET 2019


Hi, I have a sequencing run with ~3000 samples (attached dataset). The samples were initially tagged and amplified by PCR in duplicate. The tags used range from MID01 to MID26.

MID01-MID13 were used for pair 1 while MID14-MID26 were used for pair 2. The tags are re-used to allow samples to be pooled.

The pooling process will involve mixing samples with MID01-26 into the first group, the next group samples with MID01-26 into the second group and so on.

I'm hoping to get an R script that can create these groups such that for each group, any of the Tags appears only once. An example is shown below.

ID

TagA

TagB

group

180

MID03

MID10

group1

181

MID04

MID06

group1

182

MID05

MID07

group1

183

MID03

MID09

group2

184

MID04

MID10

group2

185

MID05

MID06

group2

186

MID01

MID06

group3

187

MID02

MID07

group3

188

MID03

MID08

group3



______________________________________________________________________

This e-mail contains information which is confidential. It is intended only for the use of the named recipient. If you have received this e-mail in error, please let us know by replying to the sender, and immediately delete it from your system.  Please note, that in these circumstances, the use, disclosure, distribution or copying of this information is strictly prohibited. KEMRI-Wellcome Trust Programme cannot accept any responsibility for the  accuracy or completeness of this message as it has been transmitted over a public network. Although the Programme has taken reasonable precautions to ensure no viruses are present in emails, it cannot accept responsibility for any loss or damage arising from the use of the email or attachments. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of KEMRI-Wellcome Trust Programme.
______________________________________________________________________


More information about the R-help mailing list