[R] Data handling/optimum glm method.

abigailclifton at me.com abigailclifton at me.com
Thu Mar 29 13:12:08 CEST 2012


Hi there,

I am trying to fit a generalised linear model to some loan application and default data. The purpose of this is to eventually work out the probability an applicant will default.

However, R seems to crash or die when I run "glm" on anything greater than a 5-way saturated model for my data.

My first question: is the best way to fit a generalised linear model in R to fit the saturated model and extract the significant terms only, or to start at the null model and to work up to the optimum one?
 
I am importing a csv file with 3500 rows and 27 columns (3500x27 matrix).

My second question: is there anyway to increase the memory I have so R can cope with more analysis?

I can send my code if it would help to answer the question.

Kind regards,

AJC
Sent from my BlackBerry smartphone from Virgin Media



More information about the R-help mailing list