In this assignment, we will be continuing our work with the project questions you came up last time
This is where you will turn in the final product from Project Discussion 3
The final product that you will be turning in as a document in PDF format:
- In this assignment, we will be continuing our work with the project questions you came up last time. Your questions should be relatively
These questions are not set in stone but I don't want you to come up with all new questions unless I have given specific feedback in assignment 1 to modify or redo them. In this module we will be working with Categorical -> Categorical variable pairs (Explanatory -> Response). Many of you may have numeric explanatory or response variables. That is ok. What we are going to do is modify them slightly to create a binary (two level), categorical variable. We will split the numeric variable up by some numeric threshold that you feel is appropriate, maybe use the median or mean value. This is similar to how I created the "Under 25" variable in the example data. This allows us to learn something about our data using probabilities.
The final product that you will be turning into is as follows:
-
- Restate your three questions from assignment 2. Making an appropriate adjustments based on feedback.
|
Next, you are going to redo your tables (you can copy and paste the original one, making adjustments based on feedback) but this time you are going to turn frequencies (raw counts) and turn them into relative frequencies (the count divided by the total). This should result in a decimal value that you should round to three decimal places as shown in the example table below.
(S) |
0.159 |
0.233 |
= 0.392 |
Totals |
59/189 = 0.312 |
130/189 = 0.688 |
189/189 = 1.000 |
-
- The values in these tables are now probabilities. You are going to use these values to answer the questions/calculate the probabilities below:
- Probability of each level of the explanatory variable
- Probability of each level of the response variable
- The probability of each AND combination (where the explanatory and response variable meets). For example, using the first table I would have: P(S AND L), P(S AND not L), P(not S and L), P(not S and not L). In other words, multiply how many rows and columns you have (ignoring the totals). That is how many AND combinations you should have. Here I have 2 x 2
- The values in these tables are now probabilities. You are going to use these values to answer the questions/calculate the probabilities below:
= 4.
-
-
- Choose any two OR probability statements. Example: P(WB OR U)
- Choose any two conditional probability statements. Example: P(S | L)
- Make one hypothesis, using a combination of response variable probability and its matched conditional probability, about whether
-
Project Assignment (2) |
|||||||
Criteria |
Ratings |
Pts |
|||||
Quality of Content Was the product well thought out and appropriate for the project |
10 pts Full Marks |
8 pts Very Good |
6 pts Good |
4 pts Fair |
2 pts Poor |
0 pts No Marks |
10 pts |
On time |
5 pts Full Marks |
0 pts No Marks |
5 pts |
||||
Accuracy of Content Were the individual items correctly done |
5 pts Full Marks |
4 pts Very Good |
3 pts Good |
2 pts Fair |
1 pts Poor |
0 pts No Marks |
5 pts |
Completeness Are all parts of the content complete? |
5 pts Full Marks |
4 pts Very Good |
3 pts Good |
2 pts Fair |
1 pts Poor |
0 pts No Marks |
5 pts |
Total Points: 25 |
-
Rating:
5/
Solution: In this assignment, we will be continuing our work with the project questions you came up last time