Using 2010 GSS data, regress education (EDUC, Y) upon gender (FEMALE, X1), age (AGE, X2), and # of
Question: Using 2010 GSS data, regress education (EDUC, Y) upon gender (FEMALE, X1), age (AGE, X2), and
# of siblings (SIBS, X3). Recode SIBS (to SIBSNEW) so that everyone with 8 or more siblings has a value of 8 for the variable.
a. Run frequencies of all variables you will use for the regression. Show that there is no “funny” values (such as -9 or 9 for 1-5 responses).
b. Prepare histograms of all variables involved, and show where they are in the printout. Are there any outliers? Are there any gaps in the data? Are there skews in any variables? Is the dependent variables normally distributed? Very briefly comment.
c. Obtain the correlation matrix. Show where in the printout. Very briefly comment on it.
d. Fit the regression model to the data. State the estimated regression function. Briefly interpret b1 through b3.
e. Plot the residuals against Y^, X1, X2, and X3. Also prepare a normal probability plot. Interpret the plots and summarize your findings.
f. Test whether there is a regression relation at the .05 level. State the alternatives, decision rule, p-value, and conclusion.
g. Calculate or identify adjusted R2 (R squared). How does this compare to R2? From the formula, when is there a large difference between them? What’s the implication for researchers?
Type of Deliverable: Word Document
