[R] split data into training and testing sets

Sundar Dorai-Raj sundar.dorai-raj at pdf.com
Fri Nov 11 19:18:06 CET 2005



Dhiren DSouza wrote:
> How can I split a dataset randomly into a training and testing set.  I would 
> like to have the ability to specify the size of the training set and use the 
> remaining data as the testing set.
> 
> For example 90% training data and 10% testing data split.  Is there a 
> function that will accomplish this?
> 
> Thank you,
> 
> -Dhiren
> 
> Rutgers University
> Graduate Student
> 

See ?sample.

sub <- sample(nrow(x), floor(nrow(x) * 0.9))
training <- x[sub, ]
testing <- x[-sub, ]

HTH,

--sundar




More information about the R-help mailing list