Plotting Agreement with Kappa Plots from PROC FREQ

ByAnnMaria De Mars August 10, 2015August 10, 2015

In assessing whether our Fish Lake game really works to teach fractions, we collect a lot of data, including a pretest and a post-test. We also use a lot of types of items, including a couple of essay questions. Being reasonable people, we are interested in the extent to which the ratings on these items agree.

To measure agreement between two raters, we use Kappa’s coefficient. PROC FREQ produces two types of Kappa coefficients. The Kappa coefficient ranges from -1 to 1, with 1 indicating perfect agreement, 1 indicating exactly the agreement that would be expected by chance and negative numbers indicating less agreement than would be expected by chance . When there are only two categories, PROC FREQ produces only the Kappa coefficient. When more than two categories are rated, a weighted Kappa is also produced which credits categories closer together as partial agreement and categories at the extreme ends as no agreement.

The code is really simple:

ODS GRAPHICS ON; PROC FREQ DATA =datasetname ; TABLES variable1*variable2 / PLOTS = KAPPAPLOT; TEST AGREE ;

Including the ODS GRAPHICS ON statement and the PLOTS = KAPPAPLOT option in your TABLES statement will give you a plot of both the agreement and distribution of ratings. Personally, I find the kappa plots, like the example below, to be pretty helpful.

This visual representation of the agreement shows that there was a large amount of exact agreement (dark blue shading) for incorrect answers, scored 0, with a small percentage partial agreement and very few with no agreement. With 3 categories, only exact agreement or partial agreement is possible for the middle category. Two other take-away points from this plot are that agreement is lower for correct and partially correct answers than incorrect ones and that the distribution is skewed, with a large proportion of answers scored incorrect. Because it is adjusted for chance agreement, Kappa is affected by the distribution among categories . If each rater scores 90% of the answers correct, there should be 81% agreement by chance, thus requiring an extremely high level of agreement to be significantly different from chance. The Kappa plot shows agreement and distribution simultaneously, which is why I like it.

———

Want to play the game ? You can download it here, as well as our game for younger players, Spirit Lake.

Algebra | statistics

Math and Computer Programming through Black Belt Eyes

ByAnnMaria De Mars December 10, 2010

In my misspent youth, I was the first American to win the world judo championships. This came about since I had a propensity to run my mouth off, which often led to fights. Those people who said I better be able to “walk the walk if I was going to talk the talk”. Well, I…

Dr. De Mars General Life Ramblings | statistics

Schoolchildren questions for statisticians

ByAnnMaria De Mars May 23, 2013May 23, 2013

It must be that time of year because I was asked to speak at two different schools in downtown Los Angeles this week, one elementary school and one middle school. The Perfect Jennifer probably won the coolest teacher award for getting her younger sister, a world champion in mixed martial arts and subject of a…

Software | statistics | Technology

Data quality – getting ready to teach data mining

ByAnnMaria De Mars April 23, 2014

Getting ready to teach a data mining course at the end of the year, I started looking through data sets I have on my desktop. Not sure what I will end up using. My first lesson, no matter what, is going to be on data quality. The very first thing I did was a series…

Grantwriting | Software | Technology

My Adventures with SAS 9.2 v2 and sexual harassment

ByAnnMaria De Mars May 4, 2009May 4, 2009

FINALLY got a few minutes to download the latest version. For some reason the download I received was for the planned installation as opposed to the basic installation. In 25 words or less, basic installation is for stand-alone installs on a single machine, which we have hundreds of users doing. The planned installation would be…

statistics

The complicated answer to “How long do I have to live?”

ByAnnMaria De Mars February 18, 2015February 18, 2015

Physicians say that once a patient hears the word “cancer”, their brain shuts down and they don’t hear anything else. To be fair to the patients, understanding survival statistics isn’t always simple. Let’s take just one example: The three-year survival rate is different from the third-year survival rate. If you have been told that the…

Open data | Software

SAS On-Demand stuff with .stc files I tried for the hell of it

ByAnnMaria De Mars January 5, 2012January 5, 2012

I’ve spent about 35 years messing around with computers based on the assumption that most discoveries are not preceded by “Eureka!” but rather, “What the hell! May as well try it.” Having my new computer pretty much dissolve in smoke (less than a month after I bought it!), I decided to continue my analyses of…

One Comment

Pingback: SAS Global Forum Random Post 1: Statistics : AnnMaria's Blog

Similar Posts

One Comment

Leave a Reply