Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6962134 | Environmental Modelling & Software | 2018 | 8 Pages |
Abstract
HydroShare (https://www.hydroshare.org) is an online collaborative system to support the open sharing of hydrologic data, analytical tools, and computer models. Hydrologic data and models are often large, extending to multi-gigabyte or terabyte scale, and as a result, the scalability of centralized data management poses challenges for a system such as HydroShare. A distributed data management framework that enables distributed physical data storage and management in multiple locations thus becomes a necessity. We use the iRODS (Integrated Rule-Oriented Data System) data grid middleware as the distributed data storage and management back end in HydroShare. iRODS provides a unified virtual file system for distributed physical storages in multiple locations and enables data federation across geographically dispersed institutions around the world. In this paper, we describe the iRODS-based distributed data management approaches implemented in HydroShare to provide a practical demonstration of a production system for supporting big data in the environmental sciences.
Related Topics
Physical Sciences and Engineering
Computer Science
Software
Authors
Hong Yi, Ray Idaszak, Michael Stealey, Chris Calloway, Alva L. Couch, David G. Tarboton,