[R] Cleaning
    Ashta 
    sewashm at gmail.com
       
    Thu Nov 12 00:51:48 CET 2015
    
    
  
Hi all,
I have a data frame with  huge rows and columns.
When I looked at the data,  it has several garbage values need to be
cleaned. For a sample I am showing you the frequency distribution
of one variables
    Var1 Freq
1    :    3
2    ]    6
3    MSN 1040
4    YYZ  300
5    \\    4
6    +     3
7.   ?>   15
and continues.
I want to keep those rows that contain only a valid variable value
In this  case MSN and YYZ. I tried the following
*test <- dat[dat$Var1 == "YYZ" | dat$Var1 =="MSN" ,]*
but I am not getting the desired result.
 I have
Any help or idea?
	[[alternative HTML version deleted]]
    
    
More information about the R-help
mailing list