The goal of this assignment is to give you experience fitting and interpreting regression models with interaction effects. Submit your responses to each of the questions below in a printed document. All graphics should be resized so that they do not take up more room than necessary and also should have an appropriate caption. This assignment is worth 12 points. (Each question is worth 1 point unless otherwise noted.)


For this assignment, you will be fitting several regression models to examine whether there are differences in the engagement level of reviewers on IMDb for Scooby-Doo episodes/movies based on which members of Mystery Inc caught the villain. To do so, you will use the data in the file scoobydoo.csv.


Preparation: Fitting Models

Create the following dummy variables for the analysis;

Then fit the following regression models. For all models, use IMDb engagement as the response variable.

You will use the output from these fitted models to answer the questions in the assignment.


Description

  1. Create and report a table of pairwise correlations between engagement, each of the three dummy variables you created to represent caught_by, IMDb rating, number of catchphrases uttered, and the media format dummy variable.

  2. Based on the correlation table, can you infer whether or not there will be a sample interaction between between IMDb rating and number of catchphrases uttered? Explain.


Table of Regression Results

  1. Examine the structure and formatting of the table in the “Presenting Results from Many Fitted Regression Models” section here. Mimic the format and structure of this table to create a table to present the numerical information from the four models you fitted in this assignment. Make sure the table you create also has an appropriate caption. If the table is too wide, change the page orientation in your word processing program to ``Landscape’’, rather than changing the size of the font. (2pts.)


Model B

  1. Write the fitted regression equation for Model B.

  2. Based on the inferential results from fitting this model, is there an interaction effect between media format and IMDb rating? Explain.

  3. Interpret the coefficient associated with the interaction effect between media format and IMDb rating.

  4. Re-write the fitted regression equation, by writing two fitted equations: one for TV episodes and one for movies. Be sure each equation is identified with the appropriate subgroup.

  5. Create and include a plot of the fitted model showing the pertinent results from Model B. Be sure to appropriately differentiate between lines that you include in the plot (e.g., do not differentiate by color unless you plan to print in color). If you partial out any variables, be sure to note this in the caption.


Model C

  1. Write the fitted regression equation for Model C.

  2. Based on the inferential results from fitting this model, is there an interaction effect between number of catchphrases uttered and IMDb rating? Explain.

  3. Create and include a plot of the fitted model showing the pertinent results from Model C. Be sure to appropriately differentiate between lines that you include in the plot (e.g., do not differentiate by color unless you plan to print in color). If you partial out any variables, be sure to note this in the caption.

  4. Use the plot you created in Question 11 to help interpret the interaction effect between number of catchphrases uttered and IMDb rating.