# 06 Decision Trees and Random Forests

In this block we cover:

- Decision Trees
- The Classification and Regression Tree (CART) approach
- Decision loss functions: ID3 vs Gini impurity
- Pruning trees to reduce overfitting
- Regression trees

- Random Forests
- Ensembles of trees
- Bagging features
- Forests vs Boosted Decision Trees
- Feature importance

## Lectures:

- Decision Trees and Random Forests:
- 6.1.1 Decisions, Trees, Forests, (Part 1, Trees) (39:25)
- 6.1.2 Decisions, Trees, Forests, (Part 2, Forests) (17:22)
- Reference R code

## Worksheets:

## Workshop:

The workshop is split into two sections. The first of these is in R, and **generates the data** (so you should run it first). The second of these in in Python and compares to the R content. Note that the content is exported to the DST github and the code below grabs it from there, so it is possible to run it out of order.

- 6.2.1 Workshop on Random Forests (R content) (10:52)
- 6.2.2 Workshop on Random Forests (Python content) (32:58)

