Using Machine Learning to Identify and Map Controls of Growing-Season Carbon Dioxide and Methane Fluxes in the Mackenzie Delta Region

June Skeeter
UBC Geography Department
Presentation in defense of my PhD thesis

Territorial Acknowledgement

This work was conducted in Indigenous territories.

  • The study area was in the Inuvialuit Settlement Region
  • I have been living in unceded Coast Salish territory through my PhD

Overview

Chapter 1: Introduction

Chapter 2: Vegetation Influence and Environmental Controls on Greenhouse Gas Fluxes from a Drained Thermokarst Lake in the Western Canadian Arctic

Chapter 3: Controls on Carbon Dioxide and Methane Fluxes from a Low-Center Polygonal Peatland in the Mackenzie River Delta

Chapter 4: Modeling Interannual Variability of Carbon Fluxes at a Low-Center Polygon Ecosystem in the Mackenzie River Delta

Chapter 5: Conclusions

Chapter 1: Framing the Problem

Arctic amplification is accelerating climate change in the northern high latitudes, leading to:

  • Permafrost degradation, extending growing seasons, etc.
    • Changing ecosystem carbon (C) balances
    • Influencing the rate of warming globally

Chapter 1: Limited Observations

Eddy covariance (EC) is a method to measure carbon dioxide (CO2) and methane (CH4) fluxes.

  • Sparse coverage in the Arctic
    • Bias towards accessible sites
    • Canadian Arctic is under-represented

Chapter 1: Tundra Ecosystems

Harsh conditions stunt vegetation.

  • Summers are brief but productive
  • Winters are long, dark, and frigid

Reindeer grazing at Illisarvik

Chapter 1: Tundra Ecosystems

Permafrost inhibits decomposition;
tundra soils are often C rich.

  • Active layer thaws seasonally
    • Allows root growth and microbial activity
  • Water tables are often elevated
    • Suppressed respiration
    • CH4 production

Chapter 1: Tundra Carbon Fluxes

NEE of CO2 is the primary component of the C balance.

  • GPP: Gross Primary Productivity
    • Photosynthetic uptake of CO2
  • ER: Ecosystem Respiration
    • Autotrophic Respiration (RA)
    • Heterotrophic Respiration (RH)
  • NEE = ER - GPP
    • -NEE = uptake
    • +NEE = emission

Chapter 1: Tundra Carbon Fluxes

NME of CH4 is an important secondary component in tundra wetlands.

  • Methanogenesis
    • CH4 production in anoxic soils
  • Methanotropy
    • CH4 consumption in aerobic soils
    • CH4 converted to CO2
  • NME = Methanogenesis - Methanotropy
    • -NME = uptake
    • +NME = emission

Chapter 1: Machine Learning

Algorithms that build predictive models from experience.

  • Neural Network (NN) are universal approximators
    • Often treated as "black box models"
    • Single layer with H "hidden" nodes and M independent inputs:
    • $f(X,w)= \sum_{h=1}^Hβ_hg(\sum_{m=0}^M\gamma_{hm}x_m)$

  • Gap filling is essential for calculating C budgets from EC observations
    • Flux partitioning is the standard approach for NEE, no standard for NME
      • Not practical during polar summer
    • NN methods offer a flexible alternative

Chapter 1: Study Area

Two ecosystems in the Mackenzie Delta Region we studied.

  • Arctic's 2nd largest river delta
    • No prior studies of NEE
    • Kohnert et al. (2018) measured summer NME with aircraft
      • [31.8 - 58.4 mg] CH4 m−2 d−1 north of the tree line

Chapter 1: Study Area

Two ecosystems in the Mackenzie Delta Region we studied.

  1. Illisarvik in 2016
    • Drained Thermokarst Lake Basin (DTLB)
    • Tuktoyaktuk Costal Plain
      • Just outside the Delta
  2. Fish Island in 2017
    • Low Center Polygon (LCP) tundra
    • Big Lake Delta Plain

Chapter 1: Research Objectives

Advance our understanding of the growing season C exchange in the Mackenzie Delta Region.

  1. Measure CO2 and CH4 fluxes and use gap-filling to estimate NEE and NME at Illisarvik and Fish Island
  2.  
  3. Use NN models to study controls of ecosystem scale NEE and NME
    1. Identify flux drivers and map functional relationships
    2. Investigate the influence of spatial heterogeneity
     
  4. Conduct a temporal upscaling experiment to study the possible influence of climate variability on growing season NEE and NME at Fish Island

Chapter 2: Data & Methods

EC system measured peak growing season CO2 and CH4 fluxes.

  • July 8th to August 7th, 2016
    • CSAT3, LI-7500, and LI-7700
    • Collocated climate sensors
    • Two satellite soil stations
  • Fluxes calculated in EddyPro
    • Post processing:
      • QC filtering
      • u* filtering
      • Spike removal

EC station at Illisarvik

Fish eye view of EC system

Chapter 2: Data & Methods

Kljun et al. (2015) flux footprints intersected with vegetation map.

ClassDominant SpeciesLandscape FractionMedian Fclim [min - max]
ShrubSalix spp. & Alnus viridis48%36%[0–79]
SedgeCarex aquatilis & Arctophila fulva29%39%[1–78]
GrassPocacea spp. & Eriophorum angustifolium12%11%[0–56]
SparseSparse veg. / bare ground8%2%[0–34]
WaterHippuris vulgaris3%0%[0–4]
UplandSalix spp. & Betula nana0%6%[0–15]

Chapter 2: Data & Methods

Neural Network (NN) models were used to:

  1. Identify dominant drivers of CO2 and CH4 fluxes:
    • Iteratively trained models, adding one new input at a time
      • Climate & soil observation
      • Source area fractions
  2. Gap-fill CO2 and CH4 fluxes to estimate NEE and NME

Chapter 2: Results

Conditions over the peak growing season in 2016.

  • NEE
    • -1.56 CI95% ±0.17 μmol m-2s-1
  • NME
    • 8.67 CI95% ±0.43 nmol m-2s-1
    • Decreasing trend

Chapter 2: Flux Drivers

NEE had four key drivers:
PPFD, VPD, VWC, and FShrub

  • Responds to PPFD as expected
    • Highlights influence of VPD
    • Not a true "light response curve"
      • GPP is not isolated
  • VWC influences ER
    • Night (PPFD = 0) vs. day (PPFD = 625)
    • Depends on source area

Chapter 2: Flux Drivers

NME had five key drivers:
FSedge, VWC, Ts, FShrub & U

  • NME responds to VWC as expected
    • Highlights influence of source area
      • Projection to 100% FSedge
      • Supported by chamber results
  • Inverse association with TS
    • Increased methanotrophy + drying
    • Artifact of short sampling period?

Chapter 2: Discussion

Long-term observations are needed to better contextualize results.

  1. Compared to young DTLB (< 50 yrs) studied in Alaska:
    • Greater shrub cover and higher CO2 uptake
    • Drier soil conditions and lower CH4 emission
     
  2. Illisarvik will continue to evolve, following one of two trajectories:
    • Sedge dominated > NME will increase significantly
    • Shrub dominated > NME will remain low

Chapter 3: Data & Methods

Similar Setup as Illisarvik.

  • June 23rd to Sept. 13th, 2017
    • CSAT3, LI-7200, and LI-7700
    • Collocated climate sensors
    • Three satellite soil stations
  • Fluxes calculated in EddyPro
    • Post processing:
      • QC filtering
      • u* filtering
      • Spike removal
Fish Island EC Station July 2017

Chapter 3: Data & Methods

Kljun et al. (2015) flux footprints intersected with landscape classification.

ClassDominant SpeciesLandscape FractionMedian Fclim [min - max]
Polygon CentersSphagnum spp., Equisetum spp., & Carex spp.66%63% [33–78]
Polygon RimsSalix spp.29%22% [5–56]
Polygon TroughsCarex spp. & Eriophorum angustifolium5%3% [0–9]

Chapter 3: Data & Methods

Similar approach to Chapter 2, but a more robust NN was used to identify flux drivers. Weights Method (Gevery et al. 2003):

  1. Start by training over-parametrized models
    • 21 climate, soil, and source area inputs
  2. Calculate Relative Importance (RI) from the sums the squared partial derivatives
    • Pruned inputs with RI < 2.5%
    • Limited to one pruning iteration for expedience
  3. Retrain model and gap-fill CO2 and CH4 fluxes to estimate NEE and NME
    • Derivatives were inspected to ensure response functions were plausible

Chapter 3: Results

Conditions over the 2017 growing season.

  • Snowmelt: June 1st
  • Vegetation and temperatures peaked in mid-July
  • Above average precipitation
  • Senescence in late August

Chapter 3: Results

Net growing season C sink.

  • NEE
    • -0.60 CI95% ±0.04 μmol m-2s-1
  • NME
    • 27.7 CI95% ±0.35 nmol m-2s-1
  • Data gaps
    • Power supply
    • Low signal strength (NME)
  • Diel cycles
    • CO2 and CH4 were negatively correlated (r2 = 0.40)

Chapter 3: Flux Drivers

Net Ecosystem Exchange

FactorRISign
1Photon Flux Density (PPFD)64%-
2Vapor Pressure Deficit (VPD)8%-
3Polygon Center Temp. 5cm (TCnt5)7%+
4Thaw Depth (TD)7%-
5Day/Night (Daytime)6%-
6Polygon Rim Temp. 5cm (TRim5)5%-
7Polygon Rim Temp. 15cm (TRim15)3%+
8Wind (U)1%-

Flux response functions

Chapter 3: Flux Drivers

Net Methane Exchange

FactorRISign
1Net Radiation (Rn)34%+
2Friction Velocity (u*)20%+
3Wind (U)17%-
4Thaw Depth (TD)12%-
5Polygon Center Temp. 15cm (TCnt15)6%-
6Water Table (WTD)5%-
7Rim Fraction (FRim)4%+
8Center Fraction (FCnt)2%-

Flux response functions

Chapter 3: Discussion

Long-term observations are needed.

  1. NEE was more negative than LCP ecosystems in Alaska and Siberia
    • Growing season CO2 sink
  2. NME had a minor impact on net C uptake (-49.5 g C m-2)
    • Matched aircraft fluxes Kohnert et al. (2018)
  3. Spatial heterogeneity
    • Location bias had minor influence over NME
      • NN projections suggest landscape scale NME was ~1 nmol m-2 s-1 lower

Chapter 4: Objectives

Chapter 3 fills a knowledge gap, but more context is needed.

  • Automated weather station (AWS)
    • 11 years of RN, Ta, U, and Rainfall
  • Reanalysis and satellite data supplements observations

Fish Island Landscape Classification

ClassCoverage
Low Center Polygons53%
Shrub Tundra42%
Open Water5%

Chapter 4: Temporal Upscaling

A Flux Driver Time Series (FDTS) was created spanning 2009-2019 snow-free seasons.

  • OLS regression models were trained to estimate 2017 drivers
    • AWS and ECMWF reanalysis data were inputs
  • Estimated NEE and NME using NN models described in Chapter 3

Chapter 4: Results

Substantial inter-annual variability.

Chapter 4: Results

It is not clear whether Fish Island is a net carbon sink.

  • Generally growing season CO2 sink
    • Significant inter-annual variability
    • Shoulder season emissions may offset growing season uptake

Chapter 4: Interannual Variability

Spearman Correlation Analysis

FactorNEENMENet C
Snowmelt Date--+0.65--
Flood Date--+0.79--
Max NDVI-0.71---0.73
Snowfall-0.57----
Snow-free Season Length+0.64----
Mean July Ta+0.68--+0.64

Chapter 4: Discussion

Temporal upscaling indicated significant interannual variability in growing season C exchange.

  1. Climate warming could reduce the CO2 sink strength
    • Highlights need for long-term monitoring
     
  2. Compared to two similar LCP ecosystems with long-term records:
    • Weaker CO2 sink and stronger CH4 source than LCP ecosystem near Utqiagvik, AK.
    • Stronger CO2 sink and CH4 source than LCP ecosystem at Samoylov Island, RU.

Chapter 5: Summary of Findings

IllisarvikFish Island
NEEGreater uptake than DTLB studied elsewhereMixed finding
NMELower emission than DTLB studied elsewhereHigher emission than LCP ecosystems studied elsewhere
Spatial HeterogeneitySubstantial variability, diverse vegetation communitiesSmall scale variability, repeating features ≈ heterogeneous at larger scale
Long-term OutlookDepends on trajectory (shrub/sedge)Potentially reduced C uptake with climate warming

Chapter 5: Methodological Contributions

Neural Networks are not "black-box" models when applied carefully.

  1. The weights method helps us understand what the models are doing
    • Ability to map response functions is a key advantage over other common methods (i.e.random forests)
    • To date, had not been applied to EC data
     
  2. Can be paired with footprint modelling to account for spatial heterogeneity
  3.  
  4. Meticulous application is the key to success
    • Bootstrapping, cross validation and early stopping

Chapter 5: Future Research

  1. Long-term flux data needed:
    • Interannual variability can have a significant impact on net fluxes
     
  2. Cold season dynamics:
    • Extended senescent periods could offset growing season uptake
     
  3. Methane fluxes:
    • Pressure pumping
     
  4. Network architecture
    • Multi-layer models to simulate GPP and ER separately
    • Time-lagged models to account for CH4 dynamics

Chapter 5: Final Reflections

If our priority is just extracting information, the research only serves to perpetuate colonialism (Liboiron 2021).

  • Future work in the Arctic should focus on long-term flux monitoring
    • Short, resource intensive campaigns are not sustainable
  • Prioritize involvement of local stakeholders
    • Technical training, employment, & education
    • Community support = longer-term viability

Thank you! Questions?

I would like to thank:

  • My supervisors and committee members
  • Friends and family
  • Funding agencies
    • NSERC
    • CFI

Appendix

Extra content that could not be discussed in the time allotted.

Machine Learning Methods

Random Forests (RF):

  • Ensembles of bootstrapped regression trees
    • Easy to train and implement
    • Resistant to over fitting
  • Limitations
    • Incapable of extrapolation
    • No interpretable response functions

Chapter 2

  • NEE: good agreement with FCO2 observations (r2 = 0.91)
    • PPFD was the primary driver of GPP
      • Modulated by VPD
    • VWC influenced ER
    • FShrub had weak influence
  • NME: reasonable agreement with FCH4 observations (r2 = 0.62)
    • FSedge was had strongest influence
      • Along with FShrub (strong correlation)
    • VWC and Ts drove fluxes
    • U had a weak influence
      • Possibly influenced transport

Chapter 3

Impact of sensor location bias at Fish Island.

  • Projecting over full range of source area fractions shows how the model maps these relationships
  • Troughs were not explicitly selected, but were an important feature

Chapter 3

Estimating ER by projecting to "nighttime" conditions:

  • The NN outperformed two common partitioning methods
  • NNr2 = 0.58RMSE = 0.23 μmol m-2 s-1
    Q10r2 = 0.32RMSE = 0.28 μmol m-2 s-1
    Logisticr2 = 0.45RMSE = 0.24 μmol m-2 s-1
  • Mean ER estimated to be:
    • 1.54 CI95% ± 0.87 μmol m-2 s-1
  • Limited chamber samples (2 days) suggest rim/center difference

Chapter 4

Wide range of possible conditions.