Trusted by Students Everywhere
Why Choose Us?
0% AI Guarantee

Human-written only.

24/7 Support

Anytime, anywhere.

Plagiarism Free

100% Original.

Expert Tutors

Masters & PhDs.

100% Confidential

Your privacy matters.

On-Time Delivery

Never miss a deadline.

PleaseLoad the Ames housing dataset

Math Oct 08, 2020

PleaseLoad the Ames housing dataset.

download the attached AmesHousing dataset. Then implement the skills you have learned to fit, interpret and evaluate a regression model. Please show the following steps through RStudio.

  1.  
  2. Perform Exploratory Data Analysis and use descriptive statistics to describe the data.
  3. Prepare the dataset for modeling by imputing missing values with the variable's mean value or any other value that you prefer.
  4. Use the "cor()" function to produce a correlation matrix of the numeric values.
  5. Produce a plot of the correlation matrix, and explain how to interpret it. (hint - check the corrplot or ggcorrplot plot libraries)
  6. Make a scatter plot for the X continuous variable with the highest correlation with SalePrice. Do the same for the X variable that has the lowest correlation with SalePrice. Finally, make a scatter plot between X and SalePrice with the correlation closest to 0.5. Interpret the scatter plots and describe how the patterns differ.
  7. Using at least 3 continuous variables, fit a regression model in R.
  8. Report the model in equation form and interpret each coefficient of the model in the context of this problem.
  9. Use the "plot()" function to plot your regression model. Interpret the four graphs that are produced.
  10. Check your model for multicollinearity and report your findings. What steps would you take to correct multicollinearity if it exists?
  11. Check your model for outliers and report your findings. Should these observations be removed from the model?
  12. Check your model for outliers.
  13. Attempt to correct any issues that you have discovered in your model. Did your changes improve the model, why or why not?
  14. Use the all subsets regression method to identify the "best" model. State the preferred model in equation form.
  15. Compare the preferred model from step 13 with your model from step 12. How do they differ? Which model do you prefer and why?

Expert Solution

For detailed step-by-step solution, place custom order now.
Need this Answer?

This solution is not in the archive yet. Hire an expert to solve it for you.

Get a Quote
Secure Payment