Geoinformatics 2007 Conference (17–18 May 2007)

Paper No. 6
Presentation Time: 9:30 AM

QUAKEML - XML FOR A SEISMOLOGICAL DATA EXCHANGE INFRASTUCTURE


SCHORLEMMER, Danijel, Earth Sciences, University of Southern California, 3651 Trousdale Parkway, Los Angeles, CA 90089 and EUCHNER, Fabian, Swiss Seismological Service, ETH Zurich, Schafmattstrasse 30, ETH Hoenggerberg, Zurich, 8093, Switzerland, ds@usc.edu

We report on the progress of the development of QuakeML. QuakeML is a flexible, extensible and modular XML representation of seismological data which is intended to cover a broad range of fields of application in modern seismology. QuakeML is an open standard and is developed by a distributed team in a transparent collaborative manner. The first part of the standard, QuakeML - Basic Event Description, will be subjected to a Request for Comments process. The standardization process for inventory information and resource metadata is also under way. The flexible approach of QuakeML allows further extensions of the standard in order to represent waveform data, macroseismic information, location probability density functions, moment tensors, slip distributions, shake maps, and others.

QuakeML is developed in parallel with a UML representation of its data model. This allows an elaborate software development strategy which uses the UML class model together with a custom UML profile as the basis for automated code generation. With this technique, a library of C++ classes is generated which can be serialized either to XML (QuakeML) or to SQL for persistent storage in a relational database. The XML Schema description is created automatically from the UML model with the help of tagged values, which describe the mapping from UML class attributes to XML representation. The library approach makes it easy for application developers to include QuakeML support in their products, since no own source code has to be written. Serialization of objects to and from QuakeML format will be supported by the API. It is possible to use the QuakeML library from other object-oriented programming languages, e.g., Java and Python, using wrappers.

The QuakeML language defintion is supplemented by a concept to provide resource metadata and facilitate metadata exchange between distributed data providers. For that purpose, we propose a URI-based format for unique, location-independent identifiers of seismological resources which are assigned by approved naming authorities. QuakeML - Resource Metadata defines a RDF vocabulary for resource metadata description, covering the resource's identity, curation, content, temporal availability, data quality, and associated services. We propose to set up a network of registry institutions which offer web services for resolving resource identifiers into corresponding RDF/XML metadata descriptions, and additionally provide means for resource discovery by offering services for searches against resource metadata.

Currently, the QuakeML development team is bringing together people from USC, ETH Zurich, GFZ Potsdam, USGS, and IRIS. QuakeML will be used in the NERIES framework in Europe, by the SeisComp3 software, by the EMSC, by the SCEC Collaboratory for the Study of Earthquake Predictability, and is under consideration by ANSS. We are confident that-in combination with further standardization efforts-the concept of QuakeML can contribute to facilitate data exchange and interoperability of seismological data providers.