Groups: Data project report could be handed in by individual students or by a group of no more than two
Groups:
Data project report could be handed in by individual students or by a group of no more than two students.
What does the Report Require?
A data set from the Current Population Survey March 2003 has been posted on Blackboard under the name CPS2003. The data set is in Excel. There are two random samples of full time workers, one of female workers and one of male workers. Each sample has 180 workers. In the report, each team will analyze the two samples using graphs and statistical measures. After the samples are analyzed, each team will compare the two samples based on comparisons of graphs and statistical measures. Hypothesis testing techniques will be used when comparing statistics from both samples. All graphs and results must be done in Excel but the final report should be done in Microsoft Word.
Each Report must include:
- A histogram of weekly earnings for men and for women. Based on the histograms, what type of distribution best describes weekly earnings for men and women.
- Estimate mean and standard deviation for hours of work per week, age, weekly earnings and education separately for each sample of men and women. Compare the statistics in both samples.
- Plot weekly earnings (y axis) and education (x axis). What is the relationship between weekly earnings and education for men? What is the relationship between weekly earnings and education for women? Do you see any difference in the relationships based on the graph?
- Calculate the correlation coefficient between weekly earnings and education for each sample, one for men and one for women. What is the difference in the correlation coefficient for men and women? Can you give any reasons for this difference?
- Plot weekly earnings (y axis) and age (x axis). What is the relationship between weekly earnings and age for men? What is the relationship between age and weekly earnings for women? Do you see any difference in the relationships based on the graph?
- Calculate the correlation coefficient between earnings and age for each sample, one for men and one for women. What is the difference in the correlation coefficient for men and women? Can you give any reasons for this difference?
- Sort the data set for male workers by education (First, you must block the whole data set to sort it including all variables and observations). Calculate weekly earnings of workers by education group (Less than High School, High School, Some College, College and above). Sort the data set for female workers by education. Calculate weekly earnings of workers by education group (Less than High School, High School, Some College, College and above). Estimate the difference between weekly earnings of males and females for workers with the same education level. (Calculate the difference in wages for male and female workers with less than high school, high school, some college, and college and above.) Comment on your findings.
- After conducting your analysis, write a conclusion on the differences in earnings between men and women. Are there statistically significant differences in earnings? Based on this data, why do you think these differences exist (or don’t exist)? What are possible explanations and how would you test this possible explanations in future research?
Deliverable: Word Document