Research Data--KGS

The Kentucky Microbiome Dataset

Researcher ORCID Identifier

D. Curl (0000-0002-7290-1030)

R. Washburn (0000-0002-7489-5092)

Files

Download

Download (119 KB)

Download README (6 KB)

Download Data Dictionary (18 KB)

Dataset Creation Date

5/1/2025

Release Date

7-11-2025

Publisher

University of Kentucky Libraries

Description

The Kentucky Microbiome Dataset is a continuously updated geolocated dataset which describes the distribution and diversity of microorganisms across the Commonwealth of Kentucky. Each geolocated point represents a location reported in the peer-reviewed literature where microorganisms were sampled from soil, water, geologic substrates, or clinical settings. Environmental studies include only non-host-associated, non-clinical microorganisms sampled from natural environments. Clinical studies are mapped separately to highlight locations where investigations into host-associated or human health-related microbes have occurred.

For each location, associated metadata include study information, taxonomic classifications, sample type, sampling method, latitude and longitude, sampling date, publication reference, and a direct link to the source publication. The data also incorporates Kentucky Geological Survey (KGS) research on microbial diversity in geologic environments.

Additionally, the data contains processed geolocated microbial surveillance data from the Kentucky Watershed Watch program, indicating sites where Escherichia coli (E. coli) has been sampled from streams, rivers, and other fluvial systems. A geometric mean for E. coli from each site was calculated and the results for each site are provided in this dataset.

The Kentucky Microbiome Dataset serves as a versatile dataset to apply to a range of applications, including pathogen surveillance, microbial ecology studies, preliminary environmental assessments, emerging pathogen prediction, bioprospecting, bioremediation targeting, epidemiological investigations, and public health monitoring. This dataset will be updated regularly as new studies and sampling results become available.

Digital Object Identifier (DOI)

https://doi.org/10.13023/kgs.data.07.11.2025

Rights

© 2025 University of Kentucky. This dataset is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided that the dataset creators and source are credited and that changes (if any) are clearly indicated.

Supporting Information

Each geolocated point represents a location reported in the peer-reviewed literature where microorganisms were sampled from soil, water, geologic substrates, or clinical settings. Environmental studies include only non-host-associated, non-clinical microorganisms sampled from natural environments. Clinical studies are mapped separately to highlight locations where investigations into host-associated or human health-related microbes have occurred.

File Format

.xlsx

File Size

120 KB

Version

1

Spatial Coverage

Kentucky

Temporal Coverage

Watershed Watch of Kentucky E. Coli data: 1998 - 2023

Language

English

Funding Information

Kentucky Geological Survey

Notes

  • Point of Contact: Kentucky Geological Survey

  • Originator: Rachel Washburn

  • Publisher: Kentucky Geological Survey

  • Distributor: UKnowledge

The Kentucky Microbiome Dataset

Included in

Geology Commons

Share

COinS