Assignment 06

LMER: UNCONDITIONAL LONGITUDINAL MODELS

Published

January 13, 2023

This goal of this assignment is to give you experience working with mixed-effects regression models to analyze longitudinal data. In this assignment, you will use the data from the file nhl.csv to examine longitudinal variation in cost of attending an NHL game.

Instructions

Submit either your QMD and HTML file or, if you are not using Quarto, a PDF file of your responses to the following questions. Please adhere to the following guidelines for further formatting your assignment:

  • All graphics should be resized so that they do not take up more room than necessary and should have an appropriate caption.
  • Any typed mathematics (equations, matrices, vectors, etc.) should be appropriately typeset within the document using Markdown’s equation typesetting.
  • All syntax should be hidden (i.e., not displayed) unless specifically asked for.
  • Any messages or warnings produced (e.g., from loading packages) should also be hidden.

This assignment is worth 13 points.


Preparation

After importing the data set, create a new variable called c_year that centers the year values at 2002. To center a variable we subtract the value we want to center at. (e.g., \(\mathtt{c\_year}=\mathtt{year}-2002\)). This variable will represent the number of years since 2002.

For all analyses in this assignment, unless otherwise requested, use the c_year variable and not the year variable.


Data Exploration

  1. Create and report a plot to display the cost of attending an NHL game (FCI) over time for each team (team profiles). In this plot, all teams should be in the same panel. Also add the profile based on the mean cost of attending an NHL game FCI over time. Make the teams’ profiles slightly transparent so that the mean profile is easily visible.

  2. Based on the average growth profile, describe how the average cost of attending an NHL game has changed over time. Your description of the growth pattern should allude to the functional form for the model.


Fitting and Evaluating Models

Fit the following three models. For each of the growth models, include only a random-effect of intercept in the model. Use maximum likelihood estimation to fit all the models.

  • Model A: Unconditional random intercepts model
  • Model B: Unconditional linear growth model
  • Model C: Unconditional quadratic growth model
  1. Create and report a table that includes the estimated variance components for each of the three fitted models.

  2. Use the estimated variance components from Model A to determine the proportion of variation unaccounted for at the between- and within-team levels. Report these values.

  3. Use the estimated variance components to determine the variation accounted for at the between- and within-team levels based on Model B. Report these values.

  4. Which source of variation did Model B most account for? Explain why you would expect this based on the predictor included in Model B.


Adopting and Evaluating an Unconditional Growth Model

  1. Based on the AICc, which model (A, B, or C) has the most empirical evidence? Explain.

  2. Create and report a density plot of the residuals and a scatterplot of the residuals versus the fitted values for the model you identified in Question 8. You can use the augment() function from {broom.mixed} to obtain both the fitted values (.fitted) and residuals (.resid). We evaluate these residual plots using the same methods we do when we fit an lm() model, so add any confidence envelopes, smoothers, or other guides to these plots that will aid in your evaluation.

  3. Evaluate the assumptions of normality, homogeneity of variance, and that the average residual is equal to zero based on the plots you created in Question 9.


Reporting the Results from the Adopted Model

  1. Write the global fitted equation for the model you adopted in Question 7.

  2. Write the team-specific fitted equation for the Minnesota Wild based on the model you adopted in Question 7.

  3. Create a plot showing the predicted cost of attending an NHL game over time based on the global fitted model you reported in Question 10. Be sure to include a caption.

  4. Add a line to the plot you created in Question #12 (don’t re-create the plot) showing the predicted cost of attending a Minnesota Wild game. Update the caption by adding an additional sentence to differentiate the Minnesota Wild curve from the global curve.