Fill This Form To Receive Instant Help
Homework answers / question archive / You have to submit 2 files: Answer Report: In this, you need to submit all the answers to all the questions in a sequential manner
You have to submit 2 files:
Problem 1: Clustering
The dataset given is about the Health and economic conditions in different States of a
country. The Group States based on how similar their situation is, so as to provide these groups to the government so that appropriate measures can be taken to escalate their Health and Economic conditions.
Questions:
Data Dictionary for State_wise_Health_income:
person in a given area (city, region, country, etc.) in a specified year. It is calculated by dividing the area's total income by its total population.
Dataset for Problem 1: State_wise_Health_income.csv
Problem 2: CART-RF-ANN
Mortality Outcomes for Females Suffering Myocardial Infarction
The mifem data frame has 1295 rows and 10 columns. This is a Dataset of females having coronary heart disease (CHD). you have to predict with the given information whether the female is dead or alive so as to discover important factors that should be considered crucial in the treatment of the disease. Use CART, RF & ANN, and compare the models' performances in train and test sets.
2.5 Inference: Basis on these predictions, what are the insights and recommendations?
Dataset for Problem 2: mifem.csv
Data Dictionary for mifem.csv :
known
Criteria |
Ratings |
Pts |
This criterion is linked to a Learning Outcome1.1. Read the data and do exploratory data analysis. Describe the data briefly. (Check the null values, Data types, shape, EDA, etc, etc) |
This area will be used by the assessor to leave comments related to this criterion. |
5.0 pts |
This criterion is linked to a Learning Outcome1.2. Do you think scaling is necessary for clustering in this case? Justify |
This area will be used by the assessor to leave comments related to this criterion. |
5.0 pts |
This criterion is linked to a Learning Outcome1.3. Apply hierarchical clustering to scaled data. Identify the number of optimum clusters using Dendrogram and briefly describe them. |
This area will be used by the assessor to leave comments related to this criterion. |
7.5 pts |
This criterion is linked to a Learning Outcome1.4. Apply K-Means clustering on scaled data and determine optimum clusters. Apply elbow curve and find the silhouette score. |
This area will be used by the assessor to leave comments related to this criterion. |
7.5 pts |
This criterion is linked to a Learning Outcome1.5. Describe cluster profiles for the clusters defined. Recommend different priority based actions that need to be taken for different clusters on the bases of their vulnerability situations according to their Economic and Health Conditions. |
This area will be used by the assessor to leave comments related to this criterion. |
5.0 pts |
This criterion is linked to a Learning Outcome2.1. Data Ingestion: Read the dataset. Do the descriptive statistics and do null value condition check, write an inference on it. |
This area will be used by the assessor to leave comments related to this criterion. |
5.0 pts |
This criterion is linked to a Learning Outcome2.2. Encode the data (having string values) for Modelling. Data Split: Split the data into test and train, build classification model CART, Random Forest, Artificial Neural Network. |
This area will be used by the assessor to leave comments related to this criterion. |
7.5 pts |
This criterion is linked to a Learning Outcome2.3 Performance Metrics: Check the performance of Predictions on Train and Test sets using Accuracy, Confusion Matrix, Plot ROC curve, and get ROC_AUC score for each model. |
This area will be used by the assessor to leave comments related to this criterion. |
7.5 pts |
This criterion is linked to a Learning Outcome2.4 Final Model: Compare all the models and write an inference which model is best/optimized. |
This area will be used by the assessor to leave comments related to this criterion. |
5.0 pts |
This criterion is linked to a Learning Outcome2.5 Inference: Basis on these predictions, what are the insights and recommendations? |
This area will be used by the assessor to leave comments related to this criterion. |
5.0 pts |
|
Total Points: 60.0 |
|
|
Already member? Sign In