[R] Quality of fit statistics for NLS?

John C Nash nashjc at uottawa.ca
Fri Jan 27 18:55:26 CET 2012


Peter and Bert have already made some pertinent remarks. This comment is a bit tangential,
but in the same flavour. As they note, it is "goodness of fit relative to what?" that is
important.

As a matter of course when doing nonlinear least squares, I generally compute the quantity
   [1 - residual_sumsquares/(total sum of squares)].

In linear modelling this is usually called R-squared, but I don't want to create a
firestorm of complaints by suggesting it be called that here. I'm not doing anything here
other than a check for silly results. All I'm suggesting is that a comparison to the model
that is the mean of the variable being fitted is a minimal sanity check. Surely we should
be able to do better than the mean?  It's saved me from wasting time on several occasions,
sometimes because the model proposed was really wrong, sometimes because there was a
nuisance local minimum well away from a solution, and most often due to a silly typo in
setting things up. And it can usually be computed within a cat() statement.

Best, John Nash


On 01/27/2012 06:00 AM, r-help-request at r-project.org wrote:
> Message: 81
> Date: Fri, 27 Jan 2012 10:58:04 +0100
> From: peter dalgaard <pdalgd at gmail.com>
> To: Bert Gunter <gunter.berton at gene.com>
> Cc: Max Brondfield <max.brondfield at gmail.com>, r-help at r-project.org
> Subject: Re: [R] Quality of fit statistics for NLS?
> Message-ID: <BDC6D36D-F152-41E0-87DC-38A28CCF3B53 at gmail.com>
> Content-Type: text/plain; charset=windows-1252
> 
> 
> On Jan 26, 2012, at 22:51 , Bert Gunter wrote:
> 
>> > Inline below.
>> > 
>> > -- Bert
>> > 
>> > On Thu, Jan 26, 2012 at 12:16 PM, Max Brondfield
>> > <max.brondfield at gmail.com> wrote:
>>> >> Dear all,
>>> >> I am trying to analyze some non-linear data to which I have fit a curve of
>>> >> the following form:
>>> >> 
>>> >> dum <- nls(y~(A + (B*x)/(C+x)), start = list(A=370,B=100,C=23000))
>>> >> 
>>> >> I am wondering if there is any way to determine meaningful quality of fit
>>> >> statistics from the nls function?
>>> >> 
>>> >> A summary yields highly significant p-values, but it is my impression that
>>> >> these are questionable at best given the iterative nature of the fit:
>> > No. They are questionable primarily because there is no clear null
>> > model. They are based on profile likelihoods (as ?confint tells you),
>> > which may or may not be what you want for "goodness of fit."
....



More information about the R-help mailing list