Data Analytics Tutorials in Forestry

Tutorial Order Link
Data Wrangling 1 View
Descriptive Analytics 2 View
Geospatial Data 3 View
Lasso and Ridge Regression 4 View

Background

These tutorials make use of the publicly available Petawawa Research Forest (PRF) dataset... View Dataset

Using Tutorials

These tutorials are designed to be run on Google Colab...

Data Description

Individual Tree Measurements (trees.csv)

Individual tree measurements were taken at permanent sample plots (PSPs) across the PRF in 2018. A data dictionary is provided below summarizing the trees.csv. In this data, each tree is a row and each column is an attribute (e.g., height).

Column Definition
PlotName Unique plot identifier
TreeID Unique tree identifier
species Tree common species name
Origin N = natural, P = planted
Status L = Live, D = Dead (decayclass 1 & 2)
DBH Diameter at breast height (cm)
CrownClass Crown class (D, C, I, OS, A, E)
DecayClass Decay class (1, 2, >3)
height Tree top height in meters
baha Basal area/ha
codom Codominant (Y/N)
mvol Gross merchantable volume (m³/ha)
tvol Gross total volume (m³/ha)
biomass Aboveground biomass (kg/ha)
size Sawlog size category

Plot Locations

File: plots.gpkg

Field plots in the PRF containing the trees in trees.csv are georeferenced, and their locations are provided in the plots.gpkg file. This is a spatial point dataset stored in a GeoPackage (.gpkg) file.

Each field plot is circular, with a radius of 14.1m (625m^2). Note that this dataset is in a point format (i.e., only XY coordinates of plot centers).

Column Definition
Plot Same as "PlotName" in trees.csv
Date Date of plot visit
Northing Y coordinate
Easting X coordinate
Source Device used to collect coordinates

PRF Boundary

File: boundary.gpkg

Polygon dataset in WGS 84 / UTM zone 18N.

PRF Water

File: water.gpkg

Water bodies (lakes, rivers, etc.) in polygon format.

LiDAR-derived 99th Height Percentile (P99)

File: p99.tif

Raster proxy for forest canopy height.

Sentinel-2 Imagery

Files:

Multispectral imagery with 12 bands. All resampled to 10m resolution.

Band Wavelength Description
B1 443.9nm / 442.3nm Aerosols
B2 496.6nm / 492.1nm Blue
B3 560nm / 559nm Green
B4 664.5nm / 665nm Red
B5 703.9nm / 703.8nm Red Edge 1
B6 740.2nm / 739.1nm Red Edge 2
B7 782.5nm / 779.7nm Red Edge 3
B8 835.1nm / 833nm NIR
B8A 864.8nm / 864nm Red Edge 4
B9 945nm / 943.2nm Water vapor
B11 1613.7nm / 1610.4nm SWIR 1
B12 2202.4nm / 2185.7nm SWIR 2

Processed using this script: Earth Engine Script

More info: Dataset Overview

LiDAR Point Cloud Subset

Files:

LiDAR point cloud with XYZ coordinates, in LAS format. Coverage defined by GeoPackage footprint.

Suggested Use of AI

AI tools are helpful, but caution is needed...

Useful References