Research Data--KGS

Archived

This content is available here strictly for research, reference, and/or recordkeeping and as such it may not be fully accessible. If you work or study at University of Kentucky and would like to request an accessible version, please use the SensusAccess Document Converter.

EarthScape AI Dataset v1.1

Researcher ORCID Identifier

Matthew Massey: https://orcid.org/0000-0003-3316-9550

Abdullah-Al-Zubaer Imran: https://orcid.org/0000-0001-5215-339X

Files

Download

Download (13 KB)

Download Data Dictionary v1.1 (11 KB)

Download README v1.1 (13 KB)

Download 256SUMS (736 B)

Dataset Creation Date

5/1/2025

Release Date

5-2025

Publisher

University of Kentucky Libraries

Description

EarthScape v1.1 Dataset Download Links:

esv1p1_smokeset.zip (197 MB)

esv1p1_splits.zip (2.5 MB)

esv1p1_usa_ky_data.zip (1 MB)

esv1p1_usa_ky_hardin_howevalley.zip (26.3 GB)

esv1p1_usa_ky_hardin_sonora.zip (26.3 GB)

esv1p1_usa_ky_warren.zip (166 GB)

Surficial geologic mapping is essential for understanding Earth surface processes and supporting applications in engineering, hazard assessment, and resource management. However, traditional mapping approaches are labor-intensive, limiting spatial coverage and introducing subjectivity. To address these challenges, we introduce EarthScape, a multimodal, AI-ready geospatial dataset designed for surficial geologic mapping and Earth surface analysis. EarthScape integrates high-resolution aerial RGB and near-infrared (NIR) imagery, digital elevation models (DEM), multi-scale terrain features derived from DEMs, and hydrologic and infrastructure vector data into a unified, co-registered framework. The dataset provides segmentation masks and labels for seven surficial geologic classes representing diverse geomorphic processes. It is structured as a multilabel, multi-scale dataset with inherent class imbalance and spatial complexity, reflecting real-world mapping conditions.

EarthScape is designed to support both multilabel classification and semantic segmentation tasks, and includes geographically disjoint regions with a shared label space to enable controlled evaluation of domain shift. As a living dataset, it is intended to support ongoing development of multimodal learning methods and robust geospatial models. Additional details are available in the current manuscript and GitHub repository for additional details.Users are strongly encouraged to use the GitHub repository, which provides the primary interface for accessing the dataset, reproducing benchmarks, and running experiments.

Digital Object Identifier (DOI)

https://doi.org/10.13023/kgs.data.05.01.2025

Rights

© 2025 University of Kentucky. This dataset is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided that the dataset creators and source are credited and that changes (if any) are clearly indicated.

Supporting Information

The reader is referred to the manuscript and GitHub repository for additional details.

File Format

ZIP, GeoTIFF, CSV, GeoJSON, TEXT

Version

1.1

Archival?

Archival

Language

English

Funding Information

Kentucky Geological Survey

Related Content

Massey, Matthew, and Abdullah-Al-Zubaer Imran, 2025, EarthScape: A Multimodal Dataset for Surficial Geologic Mapping and Earth Surface Analysis, arXiv preprint, https://doi.org/10.48550/arXiv.2503.15625

Notes

Point of Contact: Kentucky Geological Survey

Originator: Matthew Massey

Publisher: Kentucky Geological Survey

Distributor: UKnowledge

EarthScape AI Dataset v1.1

Included in

Geology Commons

Share

COinS