Treffer: Reproducibility Starts at the Source: R, Python, and Julia Packages for Retrieving USGS Hydrologic Data

Title:
Reproducibility Starts at the Source: R, Python, and Julia Packages for Retrieving USGS Hydrologic Data
Contributors:
MDPI AG
Source:
Civil and Environmental Engineering Faculty Publications
Publisher Information:
Hosted by Utah State University Libraries
Publication Year:
2023
Collection:
Utah State University: DigitalCommons@USU
Document Type:
Fachzeitschrift text
File Description:
application/pdf
Language:
unknown
DOI:
10.3390/w15244236
Rights:
Copyright for this work is held by the author. Transmission or reproduction of materials protected by copyright beyond that allowed by fair use requires the written permission of the copyright owners. Works not in the public domain cannot be commercially exploited without permission of the copyright owner. Responsibility for any use rests exclusively with the user. For more information contact the Institutional Repository Librarian at digitalcommons@usu.edu. ; http://creativecommons.org/licenses/by/4.0/
Accession Number:
edsbas.F81D09BC
Database:
BASE

Weitere Informationen

Much of modern science takes place in a computational environment, and, increasingly, that environment is programmed using R, Python, or Julia. Furthermore, most scientific data now live on the cloud, so the first step in many workflows is to query a cloud database and load the response into a computational environment for further analysis. Thus, tools that facilitate programmatic data retrieval represent a critical component in reproducible scientific workflows. Earth science is no different in this regard. To fulfill that basic need, we developed R, Python, and Julia packages providing programmatic access to the U.S. Geological Survey’s National Water Information System database and the multi-agency Water Quality Portal. Together, these packages create a common interface for retrieving hydrologic data in the Jupyter ecosystem, which is widely used in water research, operations, and teaching. Source code, documentation, and tutorials for the packages are available on GitHub. Users can go there to learn, raise issues, or contribute improvements within a single platform, which helps foster better engagement and collaboration between data providers and their users.