[R] unexpected GAM result - at least for me!

Duncan Murdoch murdoch at stats.uwo.ca
Mon Mar 31 14:47:48 CEST 2008


On 3/31/2008 8:34 AM, Monica Pisica wrote:
> 
> Hi
> 
> 
> I am afraid i am not understanding something  very fundamental.... and does not matter how much i am looking into the book "Generalized Additive Models"  of S. Wood i still don't understand my result.
> 
> I am trying to model presence / absence (presence = 1, absence = 0) of a species using some lidar metrics (i have 4 of these). I am using different models and such .... and when i used gam i got this very weird (for me) result which i thought it is not possible - or i have no idea how to interpret it.
> 
>> can3.gam <- gam(can>0~s(be)+s(crr)+s(ch)+s(home), family = 'binomial')
>> summary(can3.gam)
> Family: binomial
> Link function: logit
> Formula:
> can> 0 ~ s(be) + s(crr) + s(ch) + s(home)
> Parametric coefficients:
>             Estimate Std. Error z value Pr(>|z|)
> (Intercept)    85.39     162.88   0.524      0.6
> Approximate significance of smooth terms:
>           edf Est.rank Chi.sq p-value
> s(be)   1.000        1  0.100   0.751
> s(crr)  3.929        8  0.380   1.000
> s(ch)   6.820        9  0.396   1.000
> s(home) 1.000        1  0.314   0.575
> R-sq.(adj) =      1   Deviance explained =  100%
> UBRE score = -0.81413  Scale est. = 1         n = 148
> 
> Is this a perfect fit with no statistical significance, an over-estimating or what???? It seems that the significance of the smooths terms is "null". Of course with such a model i predict perfectly presence / absence of species.
> 
> Again, i hope you don't mind i'm asking you this. Any explanation will be very much appreciated.

Look at the data.  You can get a perfect fit to a logistic regression 
model fairly easily, and it looks as though you've got one.  (In fact, 
the huge intercept suggests that all predictions will be 1.  Do you 
actually have any variation in the data?)

Duncan Murdoch



More information about the R-help mailing list