landsat-lai
Employing a data-driven approach to generate Leaf Area Index (LAI) maps from Landsat images over CONUS
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 19 DOI reference(s) in README -
✓Academic publication links
Links to: sciencedirect.com -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.1%) to scientific vocabulary
Repository
Employing a data-driven approach to generate Leaf Area Index (LAI) maps from Landsat images over CONUS
Basic Info
Statistics
- Stars: 71
- Watchers: 3
- Forks: 31
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
Landsat LAI project
This repository provides code and data to generate Leaf Area Index (LAI) images at 30-meter resolutions from Landsat data using Google Earth Engine. Using machine learning and a comprehensive training set derived from MODIS LAI/Fpar products (MCD15A2H v006), Landsat surface reflectance surface reflectance, and National Land Cover Dataset (NLCD), this approach allows fast generation of LAI maps for any Landsat 5, 7, 8, 9 image using Google Earth Engine. The Landsat LAI estimation is highly consistent with the state-of-the-art MODIS product, enabling downstream applications that operate across scales.
Journal Article:
Kang, Y., Ozdogan, M., Gao, F., Anderson, M.C., White, W.A., Yang, Yun, Yang, Yang, Erickson, T.A., 2021. A data-driven approach to estimate leaf area index for Landsat images over the contiguous US. Remote Sens. Environ. 258, 112383. doi:10.1016/j.rse.2021.112383
Training Dataset:
Kang, Yanghui; Ozdogan, Mutlu; Gao, Feng; Anderson, Martha C.; White, William A.; Yang, Yun; Yang, Yang; Erickson, Tyler A.. (2021). A dataset of spatiotemporally sampled MODIS Leaf Area Index with corresponding Landsat surface reflectance over the contiguous US. Ag Data Commons. doi:10.15482/USDA.ADC/1521097.
Updates
Jan 2025: the code is updated to use Landsat Collection 2 Level 2 surface reflectance data for Landsat 5, 6, 8, 9.
Methodology
The methodology has two general steps: 1) sample generation; 2) model implementation. In the first step, we extracted millions of samples from the MODIS LAI, Landsat surface reflectance, and NLCD data from 2006 to 2018 within CONUS. The samples were stratified by Landsat sensor (Landsat 5 TM, Landsat 7 ETM, and Landsat 8 OLI), biome types (8 biomes), and saturation status from the MODIS LAI algorithm. Each sample has a MODIS LAI value, corresponding surface reflectance values from Landsat (aggregated within the MODIS pixel), a biome type from the NLCD, and other auxiliary information (like geographic location and solar illumination angles). To avoid scale effects, we use only pixels that are spatially homogeneous, which means that the MODIS pixel area (500-m) has a single biome or land cover type from NLCD and the Landsat surface reflectance (30-m) variation is very small. There are additional outlier removal and balancing processes to ensure sample quality.
In the second step, we train seperate random forest (RF) models for each biome type and each Landsat sensor. This allows us to overcome the memory limit of Earth Engine and use more samples to ensure accuracy. The RF models are trained to predict LAI using surface reflectance and other information from the Landsat images. The features include surface reflectance from red, green, nir, and swir1 bands, Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), solar zenith angle, solar azimuth angle, and geographic cooridantes. When running on Google Earth Engine, the code will first load the training dataset stored as an EE asset, train random forest models, and then apply to every pixel of the Landsat surface reflectance image.

Validation
We validated this approach in 19 NEON sites and 8 independent study sites. The overall RMSE across the 19 NEON sites is 0.8, and R2 is 0.88. The uncertainty in LAI varies by biome and site. The RMSE ranges between 0.5 and 0.9 for the 8 independent sites. Below is the result from the NEON sites. Please be cautious that both satellite estimated and ground measured LAI are always associated with random and bias errors.

Training data
The training set with 1.6 million training samples is available in the "data" folder, along with the convex hull data used to generate the the QA flags.
A full version of the dataset with more auxilliary information is available at Ag Data Commons.
Code
Python and javascript scripts to export LAI maps for chosen Landsat scene (path/row) over certain time periods. Please find the scripts in the "scripts" folder. The python script is good for batch export of LAI images to a EE asset directory.
Output format
The output image contains two bands: a "LAI" band in signed int16 scaled by 100 and a "QA" band with quality flags of input and output range.
QA is coded in a byte-size band using the least significant 3 bits
Bit 0 - Input
0: Input within range
1: Input out-of-range
Bit 1 - Output (LAI)
0: LAI within range (0-8)
1: LAI out-of-range
Bit 2 - Biome
0: Vegetation (from NLCD scheme)
1: Non-vegetation (from NLCD scheme)
Aknowledgement
This approach was built upon previous works by Gao et al. (2012) and Gao et al. (2014), who proposed the original method to train machine learning models for Landsat LAI estimation based on homogeneous MODIS LAI pixels.
The LAI estimation GEE code was written with the help from Dr. Charles Morton (website).
This research was partly supported by the OpenET project. OpenET is dedicated to provide easily accessible high-resolution evapotranspiration (ET) data for improved water management.
The authors sincerely thank Dr. Yijing Liu for sharing their code for adapting the original algorithm to Landsat Collection 2.
_ | Citation ------------ | ------------- Kang et al. (2021) | Kang, Y., Ozdogan, M., Gao, F., Anderson, M.C., White, W.A., Yang, Yun, Yang, Yang, Erickson, T.A., 2021. A data-driven approach to estimate leaf area index for Landsat images over the contiguous US. Remote Sens. Environ. 258, 112383. doi:10.1016/j.rse.2021.112383 Gao et al. (2012) | Gao, F., Anderson, M.C., Kustas, W.P., Wang, Y., 2012. Simple method for retrieving leaf area index from Landsat using MODIS leaf area index products as reference. J. Appl. Remote Sens. 6, 063554. doi:10.1117/1.JRS.6.063554 Gao et al. (2014)| Gao, F., Anderson, M.C., Kustas, W.P., Houborg, R., 2014. Retrieving Leaf Area Index From Landsat Using MODIS LAI Products and Field Measurements. IEEE Geosci. Remote Sens. Lett. 11, 773777. doi:10.1109/LGRS.2013.2278782
Owner
- Name: yhkang
- Login: yanghuikang
- Kind: user
- Company: UC Berkeley
- Repositories: 3
- Profile: https://github.com/yanghuikang
GitHub Events
Total
- Release event: 2
- Watch event: 11
- Delete event: 1
- Push event: 6
- Pull request event: 4
- Fork event: 1
- Create event: 2
Last Year
- Release event: 2
- Watch event: 11
- Delete event: 1
- Push event: 6
- Pull request event: 4
- Fork event: 1
- Create event: 2
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 0
- Total pull requests: 2
- Average time to close issues: N/A
- Average time to close pull requests: about 8 hours
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 2
- Average time to close issues: N/A
- Average time to close pull requests: about 8 hours
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- yanghuikang (2)