Fill This Form To Receive Instant Help

Help in Homework
trustpilot ratings
google ratings


Homework answers / question archive / 1) Business Report: In this, you need to submit all the answers to all the questions in a sequential manner

1) Business Report: In this, you need to submit all the answers to all the questions in a sequential manner

Statistics

1) Business Report: In this, you need to submit all the answers to all the questions in a sequential manner. Your answer should include detailed explanations & inferences to all the questions. Your report should not be filled with codes. You will be evaluated based on the business report.

Note: In the business report, three should be a proper interpretation of all the tasks performed to be eligible for full marks in each of the criteria mentioned in the rubric. Marks will be deducted wherever inferences are not clearly mentioned.

6 marks are allotted for the “Quality of Business Report”.

2) Jupyter Notebook file: This is a must and will be used for reference while evaluating.

Problem 1: Linear Regression

You are hired by a company Gem Stones co itd, which is a cubic zirconia manufacture. You provided with the dataset containing the prices and other attributes of almost 27,000 cubic zirconia (which is an inexpensive diamond alternative with many of the same qualities as a diamond). The company is earning different profits on different prize slots. You have to help the company in predicting the price for the stone on the bases of the details given in the dataset so it can distinguish between higher profitable stones and lower profitable stones so as to have better profit share. Also. Provide them with the best 5 attributes that are most important.

Data Dictionary:

Variable Name:

Description

Carat

Carat weight of the cubic zirconia.

Cut

Describe the cut quality of the cubic zirconia. Quality is increasing order Fair, Good, very Good, Premium, Ideal.

Color

Colour of the cubic zirconia. With D being the worst and J the best.

Clarity

cubic zirconia Clarity refers to the absence of the Inclusions and Blemishes. (In order from Worst to Best) IF, VVS1, VVS2, VS1, VS2, Sl1, Sl2, l1

Depth

The Height of cubic zirconia, measured from the Culet to the table, divided by its average Girdle Diameter.

Table

The Width of the cubic zirconia's Table expressed as a Percentage of its Average Diameter.

Price

the Price of the cubic zirconia.

X

Length of the cubic zirconia in mm.

Y

Width of the cubic zirconia in mm.

Z

Height of the cubic zirconia in mm.

     

Dataset for Problem 1: cubic_zirconia.csv

Problem 2: Logistic Regression and LDA

You are hired by a tour and travel agency which deals in selling holiday packages. You are provided details of 872 employees of a company. Among these employees, some opted for the package and some didn't. You have to help the company in predicting whether an employee will opt for the package or not on the basis of the information given in the data set. Also, find out the important factors on the basis of which the company will focus on particular employees to sell their packages.

Dataset for Problem 2: Holiday_Package.csv

Data Dictionary:

Variable Name

Description

Holiday_Package

Opted for Holiday Package yes/no?

Salary

Employee salary

Age

Age in years

Edu

Years of formal education

no_young_children

The number of young children (younger than 7 years)

no_older_children

Number of older children

Foreign

foreigner Yes/No

 

Important Note: Please reflect on all that you have learned while working on this project. This step is critical in cementing all your concepts and closing the loop.

Scoring guide (Rubric) - Predictive Modelling Project

                          Criteria

1.1. Read the data and do exploratory data analysis. Describe the data briefly. (Check the null values, Data types, shape, EDA, duplicate values). Perform Univariate and Bivariate Analysis.

1.2 Impute null values if present, also check for the values which are equal to zero. Do they have any meaning or do we need to change them or drop them? Check for the possibility of combining the sub levels of a ordinal variables and take actions accordingly. Explain why you are combining these sub levels with appropriate reasoning.

1.3 Encode the data (having string values) for Modelling. Split the data into train and test (70:30). Apply Linear regression using scikit learn. Perform checks for significant variables using appropriate method from statsmodel. Create multiple models and check the performance of Predictions on Train and Test sets using Rsquare, RMSE & Adj Rsquare. Compare these models and select the best one with appropriate reasoning.

1.4 Inference: Basis on these predictions, what are the business insights and recommendations.

Please explain and summarise the various steps performed in this project. There should be proper business interpretation and actionable insights present.

2.1 Data Ingestion: Read the dataset. Do the descriptive statistics and do null value condition check, write an inference on it. Perform Univariate and Bivariate Analysis. Do exploratory data analysis.

2.2 Do not scale the data. Encode the data (having string values) for Modelling. Data Split: Split the data into train and test (70:30). Apply Logistic Regression and LDA (linear discriminant analysis).

2.3 Performance Metrics: Check the performance of Predictions on Train and Test sets using Accuracy, Confusion Matrix, Plot ROC curve and get ROC_AUC score for each model Final Model: Compare Both the models and write inference which model is best/optimized.

2.4 Inference: Basis on these predictions, what are the insights and recommendations.

Please explain and summarise the various steps performed in this project.

There should be proper business interpretation and actionable insights present.

Quality of Business Report

Please reflect on all that you learnt

 

 

 

Purchase A New Answer

Custom new solution created by our subject matter experts

GET A QUOTE