statistics

The Multivariate Social Scientist: Book Review & Notes on Generalized Linear Models

ByAnnMaria De Mars October 11, 2014

I’ve been looking high and low for a supplemental text for a course on multivariate statistics and I found this one –

The Multivariate Social Scientist, by Graeme Hutcheson 7 Nick Sofroniou

They are big proponents of generalized linear models, in fact, the subtitle is “Introductory statistics using generalized linear models”, so if you don’t like generalized models, you won’t like this book.

I liked this book a lot. Because this is a random blog, here is day one of my random notes

A generalized linear model has three components:

The random component is the probability distribution assumed to underlie the response variable. (y)
The systematic component is the fixed structure of the explanatory variables, usually linear. (x1, x2 … xn)
The link function maps the systematic component on to the random component.

The systematic component takes the form

η = α + ß1×1 + ß2×2 + … ßnxn

They use η to designate the predicted variable instead of y-hat. I know you were dying to know that.

Obviously, since that IS a multiple regression equation (which could also be used for ANOVA), when you have linear regression, the link function is actually identity. With logistic regression, it is the logit function, which maps the log odds of the random component on to the systematic one.

The reason I think this is such a good book for students taking a multivariate statistics course is that it relates to what they should know. They certainly should be familiar with multiple regression and logistic regression, and understand that the log of the odds is used in the latter.

The book also discusses the log link used in loglinear analyses, which I don’t necessarily assume every student will have used. I don’t say that as a criticism, merely an observation.

statistics

Variance and Eigenvalues

ByAnnMaria De Mars September 6, 2015September 6, 2015

I find this scree plot of eigenvalues very helpful in identifying the number of factors. A scree plot is a plot of the eigenvalues by the factor number. I realized this is only helpful if one understands what an eigenvalue is. First of all, go way back to Stat 101 & remember that correlation is…

Dr. De Mars General Life Ramblings | statistics

Interview with a Vampire Researcher: What I was doing at JSM

ByAnnMaria De Mars August 6, 2012August 6, 2012

I hate Buffy the Vampire Slayer and the entire Twilight Series. This came about over the course of a week when my daughter had strep throat and was watching approximately 4,789,362 episodes of vampire shows in the living room, which is just down the hall from my office. Thus, when Brenda Osuna from USC asked…

Software | statistics

SAS: The . Manual

ByAnnMaria De Mars March 2, 2010March 2, 2010

I stole that name from Chris Hemedinger for “The Missing Manual” because I thought it was hilarious. If you don’t program in SAS much then you probably did not think immediately, “Oh, . is the symbol for missing numeric data, how funny.” In fact, you are probably more like my daughter, Maria Burns Ortiz, who…

statistics

Probability and Mixed Martial Arts Decisions

ByAnnMaria De Mars November 10, 2014November 10, 2014

A recent tweet about mixed martial arts decisions set me to thinking about probability. @Fight_ghost tweeted that a TV commentator made no sense when she said that she thought a fighter should have won by split, not unanimous decision. Others on twitter agreed with him that was a stupid comment, and asked did she think…

Software | statistics | Technology

SAS on an iPad in a Soccer Field: Testing Homework Limits

ByAnnMaria De Mars April 14, 2015

It has been pretty well established that I am the worst soccer mom in the history of soccer moms. Most of the games I miss because I am somewhere else. My children have told me that my autobiography should be entitled, “I was out of town at the time” because most of the stories of…

Software | statistics | Technology

How to do a regression analysis with SAS web editor and SAS Enterprise Guide

ByAnnMaria De Mars November 26, 2012

Here we have analysis of open data using free software with – uh, SAS? Click the links below and watch the videos. Seriously. They are too large to embed in the post. Sorry. Yes, you might think of SAS as the choice of multinational corporations with unlimited software budgets. You now have two options, if…

2 Comments

yop says:

October 11, 2014 at 5:22 am

eta is the linear predictor but not on the scale of the outcome variable. So y_hat is inv_link(eta), not eta.
AnnMaria says:

October 11, 2014 at 2:31 pm

It depends. If you are thinking in terms of multiple regression, which is where most students begin the course, then they are used to seeing that equation = y_hat because the link function is identity, and that equation + error = the actual y.

You’re right, though, that the whole point of generalized models is to generalize beyond that.

As you’ve probably guessed, sometimes I write this blog as sort of thinking out loud while working on lecture notes for an upcoming class. I’m assuming that many students will be used to seeing the same equation in a different context.

Your point helps clarify it, though. Thank you.

Similar Posts

2 Comments

Leave a Reply