The data set marks.xls introduced in Assignment 2 contains data on performance of a group of students
Question 1
The data set marks.xls introduced in Assignment 2 contains data on performance of a group of students in a university course. We are now interested in investigating the
exam performance of these students.
- Assume the data set represents a random sample of all students enrolled in a particular course over a five year period. Estimate the mean \(\mu \) and standard deviation \(\sigma \) of exam results for the wider student population.
- Use the results from (a) to obtain a 90% confidence interval for the mean exam mark m the wider student population.
-
Justify the use of the confidence interval formula you used in part (b).
We are interested in whether Australian residents perform better on the exam than non-Australian residents. - Calculate separately; the sample mean exam marks for Australian residents and non-Australian residents.
- Calculate the corresponding standard deviations.
- Using the results from (d) and (e) test whether there is evidence that the mean exam mark for Australian residents is higher than the mean exam mark for non-Australian residents.
Question 2
Using the data set marks.xls we consider now the breakdown of students according to gender, country of residence and enrolling faculty.
- Produce a contingency table displaying frequencies of male students only broken down according to country of residence and enrolling faculty.
- Repeat part (a) for female students only.
-
Determine the proportion of
- female students in the group.
- male students from Australia enrolled through the business faculty.
- science faculty students who are female.
- Making appropriate assumptions (state them), determine a 95% confidence interval for the proportion of overseas resident student in the course population.
Question 3
- What is a lurking variable?
- How does the central limit theorem differ from the law of large numbers?
- Why do sample means usually differ from population means in simple random samples?
- Why do sample means usually differ from population means in voluntary response samples?
- What is the difference between a parameter and a statistic?
- Why is the Central Limit Theorem important in statistics?
- What is meant by the sampling distribution of a mean?
-
Telephone surveys can record very high non-response rates through people refusing to participate and people not being home. In about 30 words, identify- the major problem associated with the validity of the results of surveys of this type.
Question 4
A union official claimed that 40%of trucks carry too heavy a load. In random spot checks, the Main Roads Department found 17 of the 60 trucks tested were carrying too heavy a load. Does the sample result suggest that the union official's claim is exaggerated (too large)? Perform an appropriate hypothesis test. Calculate the p-value. Make an appropriate conclusion in the context of these data.
Question 5
Using hospital and agency records, six pairs of identical twins. one of whom was adopted at birth and the other of whom was in foster care for at least, two years. All the twins are now five -years old. All twins are tested with the Wechsler Intelligence Scale for Children (WISC) with the following results:Twin Pair Adopted Twin Foster-Care Twin 1 105 103 2 99 97 3 112 105 4 101 99 5 124 104 6 100 110 7 110 104 8 116 106
Is there evidence of a difference in the average test results of the adopted twin and the foster care twin?- Use a parametric test to answer this question.
- Use a nonparametric test to answer this question.
- Comment on any differences in the results of these tests.
Price: $27.24
Solution: The downloadable solution consists of 13 pages, 1424 words and 4 charts.
Deliverable: Word Document
Deliverable: Word Document
