GSA Connects 2024 Meeting in Anaheim, California

Paper No. 53-3
Presentation Time: 2:05 PM

UTILIZING ONTOLOGY STRUCTURES TO CURATE THE DOE-NETL CARBON STORAGE OPEN DATABASE


MARTIN, Abigail1, MORKNER, Paige2, SABBATINO, Michael1, BAUER, Jennifer2 and ROSE, Kelly2, (1)Department of Energy, Support Contractor for the National Energy Technology Laboratory, 1450 SW Queen Ave, Albany, OR 97321, (2)US Department of Energy, National Energy Technology Laboratory, 1450 Queen Ave SW, Albany, OR 97321

Increased interest in geologic carbon sequestration to aid in meeting global climate change mitigation goals, has subsequently increased the demand for high-quality, geologic carbon storage data. To support ongoing risk modeling, site evaluations, and field data curation, the U.S. Department of Energy’s National Energy Technology Laboratory (NETL) continues to assess and optimize current carbon storage data curation practices. In the past 6 years, over 15 terabytes of data resources have been aggregated, cataloged, and published as part of NETL's Carbon Storage Open Database (accessible on NETL's Energy Data eXchange (EDX)).

With the large volume of publicly available data continuing to grow, it has become necessary to efficiently integrate, curate, and display data by enhancing existing workflows and developing new organizational capabilities. To support the scale of this data curation challenge, there has been an effort to create a hierarchical taxonomy for geologic concepts with rules and relations specific to carbon storage data, a structure typically referred to as an ontology. Through the development of ontological structures compatible with the Carbon Storage Open Database, it will be possible to translate the topical, temporal, and spatial metadata in a comprehensive manner compatible with programmatic and machine learning use cases.

Ensuring that an ontology structure maintains flexibility for future additions, supports interconnectivity with other specialized ontologies, and can reconcile the evolution of geologic lexicon without the risk of being reductive or exclusionary through simplification is critical. To balance the needs for flexibility, interconnectivity, and maintenance of dataset-specific lexicon, both a top-down and bottom-up approach to building an ontology are being explored. The specialized ontology for the Carbon Storage Open Database will enable more rapid assignment of appropriate symbology standards for visualization improvements, optimize topical and spatial tagging within keywords, and improve flexibility for utilization in existing data repositories such as EDX. This effort also aims to establish a foundation for utilization of ontologies for organization of other data related to geologic carbon storage in the future.