Unit 4 Core Task 3: Show me
Canvas: U4: Core Task 3 — Show Me
Type: Core Task (1 pt, complete/incomplete)
Copilot: Allowed for syntax lookup; disallow for answer generation.
Background
The clean air act of 1970 was the beginning of the end for the use of asbestos in home building. By 1976, the U.S. Environmental Protection Agency (EPA) was given authority to restrict the use of asbestos in paint. Homes built during and before this period are known to have materials with asbestos You can read more about this ban.
The state of Colorado has a large portion of their residential dwelling data that is missing the year built and they would like you to build a predictive model that can classify if a house is built pre 1980.
Colorado gave you home sales data for the city of Denver from 2013 on which to train your model. They said all the column names should be descriptive enough for your modeling and that they would like you to use the latest machine learning methods.
Client Request
The Client is a state agency in Colorado that is responsible for the health and safety of its residents. They have a large portion of their residential dwelling data that is missing the year built and they would like you to build a predictive model that can classify if a house is built pre 1980.
Data
URL: dwellings_ml.csv (ml ready)
Optional URL: dwellings_neighborhoods_ml.csv (ml ready)
Informational URL: dwellings_denver.csv (not cleansed)
Information: Data description
Readings
A scan of these articles can be helpful, or find your own resources on the web.
Core Questions
Skills: model comparison, permutation importance
Repeat the classification exercise using 2 different algorithms/models. Explain the differences between the 3 models (your original one, and the 2 new ones for this task).
For each model (your original one, and the 2 new ones for this task), display the classification report and the permutation importances for the most important features. Discuss which model you would recommend to the client and why.
Submission / Deliverables:
Use this unit4_task3_template to create your Client Report. Answer the questions. Each answer should include a written description of your results, code cells with comments, charts and/or tables.
Your instructor will advise you — or it will be evident in Canvas — whether to submit a rendered .html file, or a link to the rendered file on GitHub Pages (gh-pages). Do not submit the URL to the GitHub .qmd file.
When you have completed the report and are ready to submit, render the project into HTML and publish it to GitHub Pages. Follow these steps:
- Have this assignment’s template/quarto file open in VS Code and nothing else
- Click the
Previewbutton in VS Code (top right of the screen)- This renders the project so you can review it
- Confirm everything displays as you would like it to
- How you see it is how it is viewed for grading
- If there is an error in any cell, the rendering stops and you will need to fix the error before rendering again (if you get stuck post your error in Slack)
- Once the report is confirmed, close the preview and open the
GitHub Desktopapplication - Confirm you are in the correct repository (top left corner)
- Confirm you are on the
Mainbranch (top left corner — never change offMain) - Type a summary of the changes in the
Summarybox - Click
Commit to main(blue button, bottom left) - Click
Push origin(blue button, middle right)- This pushes your changes to GitHub
- The
publish.ymlworkflow renders the project into HTML files - The HTML files are published to the
gh-pagesbranch - The URL of the published project is in the deployment section on GitHub
- In
GitHub Desktop, clickOpen in GitHubto navigate to the repository - Click the
Actionstab and confirm there were no errors in rendering - Open the
deploymentsection on the main repo page to find the URL - Navigate to the URL and confirm it displays as you intended
- Copy the URL and submit it in Canvas
- In