Research Data--KGS
Archived
This content is available here strictly for research, reference, and/or recordkeeping and as such it may not be fully accessible. If you work or study at University of Kentucky and would like to request an accessible version, please use the SensusAccess Document Converter.
Researcher ORCID Identifier
Matthew Massey: https://orcid.org/0000-0003-3316-9550
Abdullah-Al-Zubaer Imran: https://orcid.org/0000-0001-5215-339X
Files
Dataset Creation Date
5/1/2025
Release Date
5-2025
Publisher
University of Kentucky Libraries
Description
EarthScape v1.1 Dataset Download Links:
esv1p1_smokeset.zip (197 MB)
esv1p1_splits.zip (2.5 MB)
esv1p1_usa_ky_data.zip (1 MB)
esv1p1_usa_ky_hardin_howevalley.zip (26.3 GB)
esv1p1_usa_ky_hardin_sonora.zip (26.3 GB)
esv1p1_usa_ky_warren.zip (166 GB)
Surficial geologic mapping is essential for understanding Earth surface processes and supporting applications in engineering, hazard assessment, and resource management. However, traditional mapping approaches are labor-intensive, limiting spatial coverage and introducing subjectivity. To address these challenges, we introduce EarthScape, a multimodal, AI-ready geospatial dataset designed for surficial geologic mapping and Earth surface analysis. EarthScape integrates high-resolution aerial RGB and near-infrared (NIR) imagery, digital elevation models (DEM), multi-scale terrain features derived from DEMs, and hydrologic and infrastructure vector data into a unified, co-registered framework. The dataset provides segmentation masks and labels for seven surficial geologic classes representing diverse geomorphic processes. It is structured as a multilabel, multi-scale dataset with inherent class imbalance and spatial complexity, reflecting real-world mapping conditions.
EarthScape is designed to support both multilabel classification and semantic segmentation tasks, and includes geographically disjoint regions with a shared label space to enable controlled evaluation of domain shift. As a living dataset, it is intended to support ongoing development of multimodal learning methods and robust geospatial models. Additional details are available in the current manuscript and GitHub repository for additional details.Users are strongly encouraged to use the GitHub repository, which provides the primary interface for accessing the dataset, reproducing benchmarks, and running experiments.
Digital Object Identifier (DOI)
https://doi.org/10.13023/kgs.data.05.01.2025
Rights
© 2025 University of Kentucky. This dataset is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided that the dataset creators and source are credited and that changes (if any) are clearly indicated.
Supporting Information
The reader is referred to the manuscript and GitHub repository for additional details.
File Format
ZIP, GeoTIFF, CSV, GeoJSON, TEXT
Version
1.1
Archival?
Archival
Language
English
Funding Information
Kentucky Geological Survey
Related Content
Massey, Matthew, and Abdullah-Al-Zubaer Imran, 2025, EarthScape: A Multimodal Dataset for Surficial Geologic Mapping and Earth Surface Analysis, arXiv preprint, https://doi.org/10.48550/arXiv.2503.15625
Recommended Citation
Massey, M.A., and Imran, A., 2025, EarthScape AI Dataset [ver. 1.1, 2026-04]: Kentucky Geological Survey, ser. 14, research data, https://doi.org/10.13023/kgs.data.05.01.2025

Notes
Point of Contact: Kentucky Geological Survey
Originator: Matthew Massey
Publisher: Kentucky Geological Survey
Distributor: UKnowledge