statistics

6 things about statistics everyone must know

ByAnnMaria De Mars July 12, 2011

What must everyone know and why must everyone know that?

This was one of the two central questions in Dr. J.T. Dillon’s course on Curriculum and Instruction, which I thought was going to be a colossal waste of time. After all, I was going to be a statistician, why would I need a course on curriculum, the most boring topic on earth? Like most of the courses I thought would be a waste of time and which the university in its infinite wisdom require I take, they were right and I was wrong. (Why would someone who was planning a career as a professor need to know something about teaching, gee, I can’t imagine!)

Usually when I am asked if I’d be available to teach a course, I say, “No”, partly because it doesn’t pay that well but mostly because I usually am NOT available unless you ask at least six months in advance. Well, they did. I haven’t taught a graduate statistics course in a few years, so I’m really looking forward to it. Most of these doctoral students will be writing a dissertation and then, for the rest of their careers, reading research and making decisions based on their evaluation of that research. None of them are mathematics or statistics majors and none are planning to do a lot of scientific research themselves. The question then, is what do they need to learn?

1. Descriptive statistics, distributions and data visualization – In analyzing their own data they need to get a feel for it. They need to understand what the average person is like, the variance among their population, and identify the outliers.

2. Correlation and regression – A basic understanding and application of statistics is the knowledge of relationships, how you measure relationships, how you interpret them.

3. Group mean differences – Yes, mathematically you can couch this as a regression problem and they should probably understand a little about that. Definitely need to know how to compare groups.

4. Hypothesis testing – Understand the difference between statistical significance and practical significance. This is usually an intuitive concept when teaching physicians because they are familiar with the idea of something being clinically significant. Other professions don’t always get this so easily. To understand statistical significance, you need to know something about probability. To understand probability, it helps to know something about combinations and permutations.

5. How to analyze categorical data – Sometimes the world realize does fit in neat little boxes. You have never had cancer, you have cancer now or you are in remission. You’re married or you’re not (and no, that weekend in Cabo doesn’t count, unless you were visited by the Holy Ghost) . For those times you need to understand chi-square and logistic regression. Maybe a little bit of odds ratios.

6. How to use some kind of software to compute results – It would be nice to have minions to do this for you. If you have the budget you can hire someone like me. Sadly, “graduate student” is one of the lower paid occupations, so it behooves the students to learn how to do at least some of the analysis themselves.

So, there’s my syllabus. I still don’t have a textbook. I’m debating on The Statistical Sleuth , supplemented by some other resources.

Anyone else have any ideas for topics, exercises or readings, please dive in.

I’m really looking forward to this semester. It’s going to be great!

Software | statistics

PPS sampling, PROC SURVEYSELECT and not getting naked in church

ByAnnMaria De Mars August 23, 2012August 24, 2012

As statisticians, we like to say that statistics is everywhere. Here is an example. Regular readers of this blog might know that my darling daughter number three is the world champion in mixed martial arts. There is a very wide gap in the general discourse at mixed martial arts events and, say, the Joint Statistical…

statistics

Survivor Functions, Hazard Functions and Pictures

ByAnnMaria De Mars October 21, 2011October 21, 2011

Unfamiliar jargon like Kaplan-Meier curves, PROC PHREG, right-censored and hazard functions can be daunting to the newcomer. Survival analysis is really quite straightforward; it is simply a set of statistical techniques used when the focus is “time to event”. The event can be death, divorce, arrest, substance abuse or literally anything else. You’ve been wanting…

statistics

Homer Simpson & Ketchup Packet Sex Meet World Statistics Day

ByAnnMaria De Mars October 20, 2010October 20, 2010

Statistics is definitely in. In the last month I’ve gotten three invitations to “tech events” from organizations which will remain nameless on the grounds that I may want to do business with them some time in the current century. All of them wanted to do something with statistics. And apps. And social media. And whatever…

Dr. De Mars General Life Ramblings | statistics

Replication, Correlation and Causation

ByAnnMaria De Mars March 31, 2015

There is not nearly enough replication in scientific research. It’s unfortunate that funding agencies and academic journals always want to see a new twist – a different technique, a different population. Personally, I’m very interested in reading studies that say: “I did the exact same study as Mary Lou Who and I found pretty much…

statistics

Categorical variables don’t make no never mind? Not!

ByAnnMaria De Mars December 31, 2010December 31, 2010

Back in 1976, Howard Wainer published an article in Psychological Bulletin entitled, “Estimating linear coefficients in linear models: It don’t make no never mind.” Since I read this sometime in graduate school and I took my last statistics course in 1989, spending the rest of the time writing my dissertation, I believe I should win…

computer games | Software | statistics | Technology

PROC FREQ for data analysis (sort of)

ByAnnMaria De Mars July 17, 2015July 17, 2015

Previously, I discussed PROC FREQ for checking the validity of your data. Now we are on to data analysis, but, as anyone who does analysis for more than about 23 minutes can tell you, cleaning your data and doing analysis is seldom a two-step process. In fact, it’s more like a loop of two steps,…

5 Comments

Rebecca says:

July 13, 2011 at 12:02 pm

The two things I had to teach myself/learn the hard way were Logistic Regression and how to look at Residuals. You probably won’t find a basic textbook that has those in it (the Gravetter and Wallnau book I teach out of doesn’t). However a lot of dissertation questions use logistic regression, and even just a basic awareness of residuals will point them in the right direction. I teach SPSS in my class, and there are a few decent books out there on that topic as well. (Green and Salkind or Mallery or Yocky are all one’s I’ve looked at and thought would be useful.) There are a couple of books out there that attempt to teach Statistics and SPSS at the same time, but they tend not to do either as well as I wanted.
AnnMaria says:

July 13, 2011 at 12:24 pm

Speaking of residuals

http://www.thejuliagroup.com/blog/?p=1523

(-:

Yes, I know people who use the Green and Salkind book. It’s pretty good.
Diane Lennox, SAS says:

July 14, 2011 at 11:22 am

I have to credit SAS’ Anne Milley (@annemilley)for telling me about “The Lady Sipping Tea” — more a why than how book. Supplemental reading? http://sww.sas.com/gobot/564
Meta Brown says:

July 14, 2011 at 7:11 pm

Hammer in the variance concept, it doesn’t sink in easily. It’s a good exercise to sketch different distributions and let students figure out which has greater/lesser variance.
A. C. says:

August 14, 2011 at 4:04 am

ANOVA, SPSS, and graphic knowledge never hurts! A Good basic book is Research Methods for Business Students, fifth edition by M. Saunders, P. Lewis, and A. Thornhill.

Pearson has a good book for aiding in graphic knowledge and SPSS called Statistical Persuasion.

Similar Posts

5 Comments

Leave a Reply