[R] How to change the number of bins?

David Winsemius dw|n@em|u@ @end|ng |rom comc@@t@net
Mon Mar 11 02:39:27 CET 2019


On 3/10/19 5:29 PM, wong bowie wrote:
> You are right. Actually this variable represents the number of day 
> passed after contacting a client, 999 means the client has never been 
> contacted.
>
> But I am not supposed to change the value, am I?


I certainly would. SAS allows one to specify a value such as 999 to be 
missing but R needs to have it changed to NA

is.na(Table$pdays) <- Table$pdays == 999


-- 

David


>
> David Winsemius <dwinsemius using comcast.net 
> <mailto:dwinsemius using comcast.net>> 於 2019年3月10日 週日 下午10:48寫道:
>
>     Seems rather likely that 999 is not really a measured value but
>     rather
>     is a missing value indicator.
>
>
>     -- 
>
>     David.
>
>     On 3/10/19 1:54 PM, wong bowie wrote:
>     > I wish to calculate the weight of evidence of a variable x, which is
>     > positively skewed, with over 6000 of the observations are 999
>     but only 200
>     > range from 1-27. I used the code,
>     >
>     > “IV<-create_infotables(data=Test[,-1],y="class",bins=10)”
>     >
>     > However, no matter what number I used in bins parameter, I can
>     only get 2
>     > bins, [1,27] and [999,999]. Is there any way I can look into the
>     [1,27]
>     > closely because they represent a lot? The output from R is shown
>     below,
>     >
>     > Table$pdays
>     >      pdays        N   Percent     WOE       IV
>     >      1 [1,27]    243  0.03807584  2.6743166 0.5267751
>     >      2 [999,999] 6139 0.96192416 -0.2230081 0.5707022
>     >
>     > Thank you very much!!
>     >
>     >       [[alternative HTML version deleted]]
>     >
>     > ______________________________________________
>     > R-help using r-project.org <mailto:R-help using r-project.org> mailing list
>     -- To UNSUBSCRIBE and more, see
>     > https://stat.ethz.ch/mailman/listinfo/r-help
>     > PLEASE do read the posting guide
>     http://www.R-project.org/posting-guide.html
>     > and provide commented, minimal, self-contained, reproducible code.
>

	[[alternative HTML version deleted]]



More information about the R-help mailing list