[1] A real estate agency collects data concerning y= the sales price of a house (in thousands of dollars),


[1] A real estate agency collects data concerning y= the sales price of a house (in thousands of dollars), and x = the home size (in hundreds of square feet)

Home size (x) 23 11 20 17 15 21 24 13 19 25
Sales price (y) 180 98.1 173.1 136.5 141 165.9 193.5 127.8 163.5 172.5
  1. Find the value of the linear correlation coefficient r.
  2. Find the value of the coefficient of determination r 2 , and interpret the meaning for this problem.
  3. Is there a linear correlation between home size and sales price?  Test it at the 0.05 significance level.
  4. If there is a linear correlation, what is the regression equation?
  5. Interpret the meaning of the slope b 1 in this problem.
  6. Interpret the meaning of the y-intercept b 0 in this problem.  Will this make sense to the data set?
  7. Using (d), find the predicted sales price that has home size of 1200 ft 2 .
  8. Find the standard error of estimate S YX .
  9. Find the total variation SST.
  10. Using (b) and (i), find the explained variation SSR.

[2] U.S. banks have been merging to form mega banks that span many states.  The table given lists the number of U.S. bank mergers each year from 1980 (year 1) to 1993 (year 14) for which $50 million or more changed hands in the transaction.

Year (x) 1 2 3 4 5 6 7 8 9 10 11 12 13 14
# of Mergers (y) 4 17 19 45 25 37 44 35 27 31 21 38 45 49
  1. Assuming a linear relationship between x and y, find the regression coefficients b 0 and b 1 .
  2. Interpret the meaning of b 1 in this problem.
  3. Find the correlation coefficient r.
  4. At the 0.05 level of significance, is there a significant correlation between x and y?
  5. Compute the coefficient of determination r 2 and interpret its meaning for the data.
  6. Based on results from (c) to (e), how many mergers would be predicted in 1994 (year 15)?  Compare your answer to the actual number of 1994 mergers: 42.

[3] The following data represents the digital-mode talk time in hours and the battery capacity in milliampere-hours of cell phones.

Talk Time Battery Capacity Talk Time Battery Capacity
4.50 800 1.50 450
4.00 1500 2.25 900
3.00 1300 2.25 900
2.00 1550 3.25 600
2.75 900 2.25 700
1.75 875 2.25 800
1.75 750 2.50 800
2.25 1100 2.25 900
1.75 850 2.00 900
  1. Compute the correlation coefficient r.
  2. At the 0.05 level of significance, is there a significant correlation between the battery capacity and the digital-mode talk time?
  3. What conclusions can you reach about the relationship between the battery capacity and the digital-model talk time?
  4. You would expect the cell phones with higher battery capacity to have a higher talk time.  Is this borne out by the data?

[4] Each year ninth-grade students in Ohio must take a proficiency test.  The data (SCHOOL.xls in the CD) contains data from 47 school districts in Ohio from 1994 to 1995 school year.  The variables in the data set are:

– School District: Name of school district

– Percentage Passing: Percentage of students passing the ninth-grade proficiency test

– Percentage Attendance: Daily average of the percentage of students attending class

– Salary: Average teachers salary (dollars)

– Spending: Instructional spending per pupil (dollars)

  1. Set up a scatter diagram using the percentage passing the proficiency test as the dependent variable and daily attendance as the independent variable.  Discuss the scatter diagram.
  2. Assuming a linear relationship, find the regression coefficients, b 0 , b 1 , and its regression equation.
  3. Interpret the meaning of the slope b 1 in this problem.
  4. Find the standard error of the estimate.
  5. Determine the coefficient of determination, r 2 , and interpret its meaning.
  6. Compute the coefficient of correlation r and interpret its meaning.
  7. Perform a residual analysis on your results and determine the adequacy of the fit of the model.
  8. At the 0.05 level of significance, is there evidence of a linear relationship between the independent variable and the dependent variable?
  9. Set up a 95% confidence interval estimate of the population slope, b 1 and interpret its meaning.
  10. Repeat (a)–(i) using instructional spending as the independent variable.
  1. Set up a 95% confidence interval estimate of the population slope, b 1 and interpret its meaning.

k)       Which of the two models is best at predicting the percentage of students who will pass the ninth-grade proficiency test?  Write a short summary of your finding.

[5] In Problem [4], simple linear regression models were constructed to investigate the relationships between passing rate of the Ohio ninth-grade proficiency exam and two different independent variables.  Develop the most appropriate multiple regression model to predict a school's passing rate.  Be sure to perform a thorough residual analysis.  In addition, provide a detailed explanation of the results, including a comparison of the most appropriate multiple regression model to the best simple linear regression model.

[6] A collector of antique grandfather clocks believes that the price (in dollars) received for the clocks at an antique auction increases with the age of the clocks and with the number of bidders.  Thus the model is hypothesized is Y = b 0 + b 1 x 1 + b 2 x 2 + e, where y = auction price, x 1 = age of clock (years) and x 2 = number of bidders.

A sample of 32 auction prices of grandfather clocks, along with their ages and the number of bidders, is given in the data (AUCTION.xls).

  1. State the multiple regression equation.
  2. Interpret the meaning of the slopes b 1 and b 2 in the model.
  3. Interpret the meaning of the regression coefficient b 0 .
  4. Test H 0 : b 2 = 0 against H 1 : b 2 > 0.  Interpret your finding.
  5. Use a 95% confidence interval to estimate b 1 .  Interpret the p-value corresponding to the estimate b 1 .  Does the confidence interval support your interpretation in b)?
  6. Determine the coefficient of multiple determination r 2 Y.12 and interpret its meaning.
  7. Perform a residual analysis on your results and determine the adequacy of the fit of the model.
  8. Plot the residuals against the prices.  Is there evidence of a pattern in the residuals?  Explain.
  9. At a = 0.05, is there evidence of positive autocorrelation in the residuals?
  10. Suppose the collector, having observed many auctions, believes that the rate of increase of the auction price with age will be driven upward by a large number of bidders.  In other words, the collector believes that the age of clock and the number of bidders should interact.  Is there evidence to support his claim that the rate of change in the mean price of the clocks with age increases as the number of bidders increases?  Should the interaction term be included in the model?  If so, what is the multiple regression equation?
Price: $49.99
Solution: The downloadable solution consists of 27 pages, 2539 words and 14 charts.
Deliverable: Word Document


log in to your account

Don't have a membership account?
REGISTER

reset password

Back to
log in

sign up

Back to
log in