[1] A research analyst for an oil company wants to develop a model to predict miles per gallon based on
[1] A research analyst for an oil company wants to develop a model to predict miles per gallon based on highway speed. An experiment is designed in which a test car is driven at speeds ranging from 10 miles per hour to 75 miles per hour. The results are in the data set (SPEED.xls in the Doc Sharing).
- Set up a scatter diagram for speed and miles per gallon.
- Apply simple regression analysis, and then interpret the meaning of the slope b 1 in this problem.
- Interpret the meaning of the regression coefficient b 0 in this problem.
- Determine the coefficient of determination, r 2 , and interpret its meaning.
- How useful do you think this regression model is for predicting mileage?
[2] Refer to the data set given in [1]. Now assume a quadratic relationship between speed and mileage:
- State the quadratic regression equation.
- Predict the average mileage obtained when the car is driven at 55 miles per hour.
- Determine the coefficient of multiple determination, r 2 Y.12 .
- Determine the adjusted r 2 .
- Determine the adequacy of the fit of the model.
- At the 0.05 level of significance, determine whether the quadratic model is a better fit than the linear regression model.
[3] Each year ninth-grade students in Ohio must take a proficiency test. The data (SCHOOL.xls in the Doc Sharing) contains data from 47 school districts in Ohio from 1994 to 1995 school year. The variables in the data set are:
– School District: Name of school district
– Percentage Passing: Percentage of students passing the ninth-grade proficiency test
– Percentage Attendance: Daily average of the percentage of students attending class
– Salary: Average teachers salary (dollars)
– Spending: Instructional spending per pupil (dollars)
- Set up a scatter diagram using the percentage passing the proficiency test as the dependent variable and daily attendance as the independent variable. Discuss the scatter diagram.
- Assuming a linear relationship, find the regression coefficients, b 0 , b 1 , and its regression equation.
- Interpret the meaning of the slope b 1 in this problem.
- Find the standard error of the estimate.
- Determine the coefficient of determination, r 2 , and interpret its meaning.
- Compute the coefficient of correlation r and interpret its meaning.
- Perform a residual analysis on your results and determine the adequacy of the fit of the model.
- At the 0.05 level of significance, is there evidence of a linear relationship between the independent variable and the dependent variable?
- Set up a 95% confidence interval estimate of the population slope, \(\beta \) 1 and interpret its meaning.
- Repeat (a)--(i) using instructional spending as the independent variable.
- Which of the two models is best at predicting the percentage of students who will pass the ninth-grade proficiency test? Write a short summary of your finding.
[4] In Problem [3], simple linear regression models were constructed to investigate the relationships between passing rate of the Ohio ninth-grade proficiency exam and two different independent variables. Develop the most appropriate multiple regression model to predict a school's passing rate. Be sure to perform a thorough residual analysis. In addition, provide a detailed explanation of the results, including a comparison of the most appropriate multiple
Deliverable: Word Document
