CS628 Week 11 Assignment - Data Science

Question # 00848394 Posted By: wildcraft Updated on: 12/06/2023 11:32 PM Due on: 12/07/2023
Subject Computer Science Topic General Computer Science Tutorials:
Question
Dot Image

CS628 - Data Science

Project # 5: Visualization Assignment

Monroe College

For the following questions (1 to 3), work with the bank_marketing_training data set. Use Python.

Attached is the file. Start with the following code.

1. # import required package 2. import pandas as pd 3. 4. # read the csv bank_train data using the pandas package 5. bank_train = pd.read_csv("/Users/edeki/Desktop/Website Data Sets/bank_marketing_training")

1. Create a bar graph of the previous_outcome variable, with response overlay.

2. Create a normalized bar graph of previous_outcome variable with response overlay. Describe the relationship between previous_outcome and response.

3.Examine the non?normalized and normalized histograms of duration, with overlay of response.

Identify cutoff point(s) for duration, which separate low values of response from high values.

Define a new categorical variable, duration_binned, using the cutoff points you identified.

For the following questions (4 to 5), work with the adult_ch3_training data set

1. # read the csv adult_ch3_training data using the pandas package 2. adult_ch3_train = pd.read_csv("/Users/edeki/Desktop/Website Data Sets/adult_ch3_training")

4.Consider capital?loss. Identify the outliers in capital?loss using the Z?score method. How many outliers are there?

5.Construct a bar graph of Income for these outlier records.

Dot Image
Tutorials for this Question
  1. Tutorial # 00843856 Posted By: wildcraft Posted on: 12/06/2023 11:32 PM
    Puchased By: 2
    Tutorial Preview
    The solution of CS628 Week 11 Assignment - Data Science...
    Attachments
    CS628_Week_11_Assignment_-_Data_Science.ZIP (18.96 KB)

Great! We have found the solution of this question!

Whatsapp Lisa