MIXTURE-MODEL CLUSTERING OF REGIONAL GEOCHEMICAL DATA
The clustering procedure is evaluated with soil geochemical data from a survey of the state of Colorado (United States of America). The data comprise 959 samples with 31 element concentrations for each sample. The chosen mixture model has 4 density functions, and the calculated conditional probabilities partition the 959 samples into 4 clusters. For each cluster, most samples are spatially close together and thus are related to specific geologic features such as surficial deposits or bedrock. The independently-known geochemical properties of these geologic features are consistent with the random sample concentrations, and the order statistics for the random sample concentrations are almost identical to the corresponding order statistics for the field data (i.e., the measured concentrations for those samples with high conditional probabilities). Both results suggest that the clustering procedure is accurate. Another benefit of mixture-model clustering is that the element concentrations for each cluster are approximately statistically stationary, making them suitable for additional statistical processing such as multivariate kriging.