When acceptance is really rejection: Death by Green Pants

ByAnnMaria De Mars June 12, 2009June 12, 2009

The model is non-significant, therefore my theory is supported.

Huh?

Just when you thought it was safe to get back into statistics… It took you two years of graduate school but now you have it down. P-value low = good, relationship detected, publication, tenure, Abercrombie & Fitch models at your feet.

P-value = high, no relationship, no publications, no money, dating the creepy guy next door.

Enter Hosmer to screw things up.

There are a whole bunch of reasons you might want to do a logistic regression (no, I’m serious). If you want to predict a categorical dependent variable like death, drop-out or watching Afghan Star. If you were going to do a propensity score match you would start with logistic regression. If you plain can’t think of anything else to do with your evenings.

The first thing would be to see if your dependent had a relationship with your grouping variable or you really are wasting your time. Okay, now that is settled, you have found that people seen in hospitals with Intensive Care Units are more likely to die than those seen at other hospitals.

You also want to see if the variables on which they differ have anything to do with the outcome. For example, I ran an analysis where I coded their favorite colors of pants – blue, brown, white, black or green pants (seriously, who buys green pants?) . People who went into intensive care were more likely to own green pants. To test if this is significant, I run a logistic regression with death as the outcome variable and pants color as the predictor.

In SPSS you go to ANALYZE > REGRESSION > BINARY LOGISTIC

So, the Hosmer and Lemeshow Test is statistically significant with a chi-square of 349.06, df = 4 and p < .001. Is that exciting? Do I immediately publish an article on “The American Apparel Effect” and how poor fashion taste is dangerous to your health?

Not so fast. You see, Hosmer & Lemeshow tests the Goodness of Fit of the model predictions to the observed data. If you reject the hypothesis that your model fits the data, that is bad!

In my next logistic regression, I used age over 65 as a dichotomous variable. My second variable was the Dr. MechOth scale. Dr MechOth (not her real name) was a friend of mine when I was a young Assistant Professor who occasionally hung out in bars. Dr. MechOth rated all men on a 1 to 3 scale, where 1= “Yes” , 2 =”Maybe if I was drunk” & 3=”I couldn’t get drunk enough”.

The results of the Hosmer & Lemeshow test shown below, with a chi-square = 4.52, df = 3, p > .20 show that the data fit the model somewhat, although it could be better.

significlogistic

Does this mean that in logistic regression high p-values are always a good thing? Nope, that would be too easy for you to remember. In fact, no sooner have we inverted our understanding of p-values but now it is time to do it again. When interpreting the COEFFICIENTS, a low p-value is a good thing. So, which of Dr. MechOth’s groups one is in, and being really, really old are related to probability of death.

significlogistic2

Sadly, my original hypothesis about death by green pants is not supported and all I have discovered is that if you are really, really old and no one would go home from a bar with you if you are the last person on earth, you are more likely to keel over dead from natural causes or suicide, whichever comes first, than hot, young people.

I do not think I will be winning the Nobel Prize for Medicine any time soon. I wonder if that guy next door likes Cup-A-Noodle soup.

Learning Advanced SAS from a Macro

ByAnnMaria De Mars February 16, 2012February 21, 2012

I was using a macro this week written by Feng, Yu & Xu and there were so many less-used and advanced techniques in it I thought it would be a great example to use for teaching SAS. ** ** Nifty thing one computing the caliper width ***** ; Like with other propensity score macros, it…

Dr. De Mars General Life Ramblings | The Julia Group

Making Money in Consulting

ByAnnMaria De Mars July 9, 2012

In History of Psychology, a course I was required to take for reasons still unclear to me, each students was required to give a presentation on a famous, dead psychologist. It got kind of boring after the first 13 seconds, so we all started doing different things – debates, slideshows. My friend who drew Sigmund…

Software | statistics | Technology

SAS Global Forum: Getting my Geek On

ByAnnMaria De Mars April 9, 2012April 9, 2012

I am an unashamed statistical programming geek. I’m leaving very soon, stopping to visit my mom on the way to SAS Global Forum because 94.7% of all mothers have retired to Florida by age 68. (That is a real statistic. As someone commented about fake boobs – if they exist, they’re real.) I admit it,…

statistics

Native Americans: Why Heidi Heitkamp won & Nate Silver was wrong?

ByAnnMaria De Mars November 19, 2012

The past couple of weeks, I’ve been hearing my friends from Turtle Mountain and Spirit Lake talk about the election in North Dakota. I was particularly interested because this was the one election that Nate Silver predicted incorrectly. He had Heitkamp down by 3.9 percent, and yet she won. I have no idea how Silver’s…

statistics

Yes, You Totally CAN Understand Model Fit Statistics, with M & M’s

ByAnnMaria De Mars October 15, 2014

Ever wonder why with goodness of fit tests non- significance is what you want? Why is that sometimes when you have a significant p-value it means your hypothesis is correct, there is a relationship between the price of honey and the number of bees, and in other cases, significance means your model is rejected? Well,…

statistics

Race, Income and Education – AnnMaria Explains it All

ByAnnMaria De Mars March 24, 2011March 24, 2011

If you are the right age to have watched re-runs of the show on Nickelodeon, Clarissa Explains It All, then you are the age group today’s blog was written for. And don’t tell me the previous statement is grammatically incorrect. After having looked at these results, I’m already pissed off (note to self: don’t swear…

2 Comments

Pingback: SAS Enterprise Guide: It’s a woman’s prerogative : AnnMaria’s Blog
Dave Houg says:

July 15, 2009 at 10:07 am

Which is cause and which is effect is still a tough call even after finding a great corelation.

Or even IF there is a cause and effect. If (pulled out of thin air) 97% of traffic accident victims had eaten a pickle the week before would that tell you anything? Of course it might tell it was summertime but that is about it.

PS Statistically speaking: half of all brain surgeons are below average in skill & competence!!!

Similar Posts

2 Comments

Leave a Reply