Description
these are the questions I am needing help on
- The following information regarding a sample of seven students is provided. (20 points in total)
Student |
Identification number |
Grade Point Average |
Classification |
Gender |
Rank in Class |
Adam |
1234 |
2.89 |
Senior |
Male |
15 |
Brandon |
8978 |
2.01 |
Junior |
Male |
25 |
Jason |
6578 |
3.97 |
Freshman |
Male |
3 |
Marissa |
2345 |
3.98 |
Sophomore |
Female |
2 |
Michelle |
8901 |
2.67 |
Senior |
Female |
18 |
Wendy |
7789 |
4.00 |
Senior |
Female |
1 |
Webster |
6780 |
3.77 |
Freshman |
Male |
4 |
2.1 How many elements are there in the above dataset? And how many observations are there in this dataset? (5 points)
2.2 Which variables are categorical, and which are quantitative variables? (Exclude the first column Student as a variable.) (5 points)
2.3 What scale of measurement is used for each variable? (Exclude the first column Student as a variable.) (10 points)
- The sales record of a real estate company for the month of May shows the following ten house prices (rounded to the nearest $1,000). Values are in thousands of dollars. (30 points in total)
105 |
55 |
45 |
85 |
75 |
30 |
60 |
75 |
79 |
95 |
3.1 A frequency distribution for the dataset above is already given as the 1st and 2nd columns in the following table. Based on the frequency distribution, complete the missing/empty cells (highlighted in green) for the corresponding relative and percent frequency distributions for the house prices in the table. (12 points)
Sales Price |
Frequency |
Relative Frequency |
Percent Frequency |
20 – 39 |
1 |
% |
|
40 – 59 |
2 |
% |
|
60 – 79 |
4 |
% |
|
80 – 99 |
2 |
% |
|
100 – 119 |
1 |
% |
|
Total |
10 |
% |
3.2 Complete the missing/empty cells (highlighted in green) for the corresponding cumulative and cumulative percent frequency distributions in the following table. (10 points)
Sales Price |
Cumulative Frequency |
Cumulative Percent Frequency |
? 39 |
% |
|
? 59 |
% |
|
? 79 |
% |
|
? 99 |
% |
|
? 119 |
% |
3.3 According to the non-overlapping classes/bins in the table in Part 3.1, how many such classes are used in the table? And what is the class width for each class? (4 points)
3.4 According to the table in Part 3.2, what percentage of the houses sold at a price below $80,000? (2 points)
3.5 The histogram of the house price dataset is given on the right. According to the histogram, comment on the skewness of the distribution. Is the distribution skewed to the right, skewed to the left, or symmetric? (2 points)
- The following data represent the daily supply (y in thousands of units) and the unit price (x in dollars) for a product. (10 points in total)
Unit Price (x) |
Daily Supply (y) |
2 |
5 |
4 |
7 |
8 |
9 |
5 |
12 |
7 |
10 |
8 |
13 |
16 |
16 |
6 |
16 |
4.1 For the x variable Unit Price, its box plot can be shown as on the right. According to the box plot, what is the median value? What are the first quartile (Q1) and the third quartile (Q3) values? Is there any outlier? If so, what is/are the value(s) of the outlier(s)? (7 points)
4.2 The scatter plot for the two variables y and x is given on the right. According to the scatter plot, comment on the relationship between y and x. (3 points)