FACTORS AFFECTING PRINCIPAL COMPONENT ANALYSIS (PCA) OF X-RAY ABSORPTION FINE STRUCTURE SPECTRAL DATASETS OF ARSENIC AND IRON COMPOUNDS
Performing PCA on XAFS datasets prior to evaluation by linear combination least-squares fitting (LSF) provides: a model-independent way to view the variance in a spectral dataset, constraints on the number of unique spectra needed for LSF, and a quantitative process for selecting the appropriate model spectra to be used in LSF. Previous studies have highlighted key limitations or considerations for PCA, but the effect on PCA of factors such as element, number of data points, type of data (near-edge vs. extended XAFS), spectral noise, and low abundance species have not been described. Investigations into the magnitude of these effects are needed to establish the appropriate level of confidence to have in PCA on real datasets.
To date, we find that the correct number of components is identified in PCA of all 3-component test data sets. However, if 2 or more species in a dataset do not vary in relative abundance, the number of significant components identified by the procedure can be artificially low. The 3 significant components identified in XANES and EXAFS model compound data sets described an average of 99% of the total set variance, but the relative variance accounted for by each component varied considerably. In XANES data sets, components 1, 2 and 3 described an average of 90%, 6%, and 3% of set variance, respectively. For EXAFS, components 1, 2 and 3 described an average of 66%, 24%, and 9% of total set variance, respectively. Preliminary PCA on As and Fe EXAFS data sets collected on samples from the Empire Mine HSP indicate 2-3 components (species) for As and 3-4 for iron which cumulatively account for 69-89% of the set variance. Fe compounds identified as reasonable species for LSF of Fe EXAFS are ferrihydrite, lepidocrocite, nontronite, hematite, goethite, and Fe-smectite. Arsenopyrite, arsenian pyrite, As(V) sorbed on goethite, and As(V)sorbed on ferrihydrite are identified as reasonable species to use in LSF of As EXAFS.