[R] an interesting qqnorm question

WeiWei Shi helprhelp at gmail.com
Sat Apr 23 00:39:36 CEST 2005


Hi, r-gurus:

I happened to have a question in my work:

I have a dataset, which has only one dimention, like
0.99037297527605
0.991179836732708
0.995635340631367
0.997186769599305
0.991632565640424
0.984047197106486
0.99225943762649
1.00555642128421
0.993725402926564
....

the data is saved in a file called f392.txt.

I used the following codes to play around :)

k<-read.table("f392.txt", header=F)    # read into k
kk<-k[[1]]
l<-qqnorm(kk)  
diff=c()
lenk<-length(kk)
i=1
while (i<=lenk){
diff[i]=l$y[i]-l$x[i]   # save the difference of therotical quantile
and sample quantile
                           # remember, my sample mean is around 1
while the therotical one, 0
i<-i+1
}
hist(diff, breaks=300)  # analyze the distr of such diff
qqnorm(diff)

my question is:
from l<-qqnorm(kk), I wanted to know, from which point (or cut), the
sample points start to become away from therotical ones. That's the
reason I played around the "diff" list, which gives me the difference.
To my surprise, the diff is perfectly normal. I tried to use some
kk<-c(1, 2, 5, -1 , ...) to test, I concluded it must be some
distribution my sample follows gives this finding.

So, any suggestion on the distribution of my sample?   I think there
might be some mathematical inference which can leads this observation,
but not quite sure.

btw, 
> fitdistr(kk, 't')
        m              s              df
  9.999965e-01   7.630770e-03   3.742244e+00
 (5.317674e-05) (5.373884e-05) (8.584725e-02)

btw2, can anyone suggest a way to find the "cut" or "threshold" from
my sample to discretize them into 3 groups: two tail-group and one
main group.--------- my focus.

Thanks,

Ed




More information about the R-help mailing list