Regression Review
You will need to reference the slide deck on regression review to answer these questions. This deck is broken up into seven different sections. Answer the questions below (which will reference the different parts of the slide deck). Doing this will review the regression content you learned in EPSY 8251. Here are a few other resources that you can also refer to.
- Data Codebook for the pew.csv data
- R Script for the analysis presented in the slide deck
- Modeling textbook from EPSY 8251
Part I: Data and Analytic Strategy
Which of the covariates will you need to create dummy variables for?
Remind yourself how to create these dummy variables.
Part II: Exploration and Relationships
Slide 10 introduces output from the
skim()function (available in the{skimr}package). You can see the syntax for this in the script file. The variables being summarized in the output are being selected in the line:select(age:republican, -party). Explain what this syntax does.What summaries are being computed for each variable in the
skim()output?Describe the relationship depicted in the scatterplot of news exposure and news knowledge.
In the syntax to create the scatterplot (see the script file) the
theme()layer is customizing the plot. Describe howtext = element_text(size = 20)customizes this plot? What does it do? (Hint: Run the syntax with and without thetheme()layer.)Interpret the correlation between Republican and news knowledge.
Part III: Research Question 1
Remind yourself what \(\epsilon_i \overset{i.i.d.}{\sim}\mathcal{N}(0,\sigma^2_{\epsilon})\) indicates. Translate this from math to words.
The df for the F-statistic are 1 and 98. Explain why we get those values.
Interpret the intercept.
Interpret the slope.
Part IV: Research Question 2
Interpret the effect of news exposure from Model 2.
Evaluate the assumptions of Model 2.
Part VI: Research Question 3
The interpretation for the interaction effect in Model 3 was: “The effect of news exposure on political knowledge depends on education level, after controlling for the set of demographic and political covariates.” Provide an alternate interpretation of this effect. (Remember you can always interpret an interaction two ways…)
Based on the interaction plot, is the interaction between news exposure and education level ordinal or disordinal? Explain.
In the interaction plot, the effect of news exposure on political knowledge (seen in the slopes) is larger for Americans with 12 years of education than it is for Americans with 16 years of eduation. In general, the effect of news exposure on political knowledge diminishes at higher levels of education. Using the plot, describe the effect of education level on political knowledge.
Part VII: Tables for Publication
- Where in the R output do we find the Root Mean-Square Error (RMSE) estimates?
Part VIII: For Next Class…
- Answer the questions about your computer files and your organization of those files on the final slide.