What would you do if one person changed your results?

ByAnnMaria De Mars December 30, 2017

This is a hypothetical question, but it could easily happen. Let me give you a real example.

Using a mobile phone game, we administered a standard depression screening measure (CESD-C) to 18 children living on or near an American Indian reservation. All children had a family member who was an alcoholic or addicted to drugs. I decide to do a one-sample t-test of the hypothesis that the mean for this population = 15, which is the cutoff value for symptoms of depression . Here is the code but I didn’t code it (more about that later).
PROC TTEST DATA=cesd_score SIDES=2 H0=15 plots(showh0);var CESDTotal;

The results are shown below, with a mean of 21 and a range from 3 to 38.

You can see that the t-value of 2.34 is significant at p < .05, that is the mean for this sample is significantly different than the cutoff score of 15. You can see more results here. What if it hadn’t been, though? What if, instead of .0317 the probability was .0517?

What if dropping out this one person with a score of 3 changed the result? In fact, it did change the mean to 22, and the p-value to .0115 . You can see all of those results here.

So, let’s say that hypothetically dropping out this outlier WOULD change your results. Would you do it? Would you report it?

Think about it. In a couple of days, I will give you my answer and my justification.

As to not having coded it – I used the tasks in SAS Studio which I found to be pretty fun, but more on that in my next post.

Play Aztech: Meet the Maya – for your iPad in the app store, in Spanish and English. The second in our series of bilingual games teaching basic statistics and Latin American history. Only $1.99

P.S. There is a third possibility here, which is changing the test from a two-tailed test to one-tailed test. Surely, an argument can be made that we don’t expect children with a family member who is addicted to alcohol or drugs to be less depressed than the cut-off score? They would either be equal or more depressed. Personally, I don’t buy that argument. I could accept that the sample might be more depressed than the average but I’m not sure one could justify that the mean necessarily MUST be more than the cut-off for depressive symptoms.

computer games | Software

PHP Rambling

ByAnnMaria De Mars October 12, 2014October 12, 2014

I was reading a book on PHP just to get some ideas for a re-design I’m doing for a client, when I thought of this. Although I think of PHP as something you use to put stuff into a database and take it out – data entry of client records, reports of total sales –…

Software | Technology

SAS Enterprise Guide as a Programming Aid – Finding Functions

ByAnnMaria De Mars November 1, 2012

While I believe SAS Enterprise Guide was developed to make statistical analysis easier for non-programmers, it is also a useful tool for experienced programmers. Often, I find myself thinking, “I KNOW there is a function that does that ….” but I just can’t remember exactly what it is. Take today as an example. I have…

statistics

Multicollinearity statistics with SPSS

ByAnnMaria De Mars May 28, 2011April 5, 2017

“Can you explain multicollinearity statistics?” she asked. Why, yes, yes I can. First of all, as noted in the Journal of Polymorphous Perversity, “Multicollinearity is not a life-threatening condition except when a depressed graduate student employs multiple, redundant measures.” What is multicollinearity, then, and how do you know if you have it? Multicollinearity is a…

Software | statistics | Technology

MANOVA from beginning to end : Creating the scales

ByAnnMaria De Mars June 14, 2017June 15, 2017

Last time, we saw how to recode variables to score answers correct or incorrect, on a rating scale and weighted by importance. Today, we’re going to look at creating some scales from those variables because for reasons I’m sure I have written about at some point in the past, single items are usually not very…

Software | statistics

Statistics Guru Predicts Republican Sweep! With Proc GMAP

ByAnnMaria De Mars April 2, 2016April 2, 2016

Esteemed statistics guru, Dr. Nathaniel Golden has some sobering news for Democrats. His latest models predict a Republican blow out. As can be seen by the map below, the Republican front-runner has tapped into the mood of resentment in the country’s non-elites. When the dust has settled, only the two highest earning states in the…

Software | statistics | Technology

Logistic regression using SAS On-Demand with SAS Enterprise Guide – a movie and a rant

ByAnnMaria De Mars December 6, 2012

If you have a mad desire to do logistic regression with SAS On-Demand with SAS Enterprise Guide, here is a movie that shows how to do it. It is a .avi file so you may want to just download it and run it on your PC. Here is why the movie is not all that…

So, let’s say that hypothetically dropping out this outlier WOULD change your results. Would you do it? Would you report it?

Play Aztech: Meet the Maya – for your iPad in the app store, in Spanish and English. The second in our series of bilingual games teaching basic statistics and Latin American history. Only $1.99

Similar Posts

Leave a Reply