LEVERAGING LARGE LANGUAGE MODELS FOR SEDIMENTARY ENVIRONMENTAL INTERPRETATION: A NEW APPROACH TO ADDRESSING THE NON-UNIQUENESS PROBLEM IN PALEOGEOGRAPHY

Li, Haipeng

Paper No. 267-6

Presentation Time: 2:50 PM

LEVERAGING LARGE LANGUAGE MODELS FOR SEDIMENTARY ENVIRONMENTAL INTERPRETATION: A NEW APPROACH TO ADDRESSING THE NON-UNIQUENESS PROBLEM IN PALEOGEOGRAPHY

LI, Haipeng, Department of Paleogeography, Deep-time Digital Earth Research Center of Excellence (Suzhou), Kunshan, Jiangsu 215347, China, WANG, Luoqi, School of the Earth Sciences, Zhejiang University, Hangzhou, Zhejiang 310027, China, YANG, Jie, China University of Geosciences (Beijing), Beijing, 100190, China and GUO, Yao, Institute of Sedimentary Geology, Chengdu University of Technology, Chengdu, Sichuan 610059, China

The interpretation of the sedimentary rock record is fundamental for reconstructing paleogeography. However, the inherent non-uniqueness in such interpretations often poses significant challenges, which are further amplified by the exponential increase in scholarly publications. Traditional expert systems, while providing a potential solution, demand substantial contributions from specialists across various sub-disciplines, making them labor-intensive and less adaptive to the incorporation of newly published studies.

In light of these challenges, we propose a novel approach that leverages the advancements in Large Language Models (LLMs), such as GPT-4, and LangChain to process and interpret the vast corpus of publications. By integrating these technologies and fine-tuning them with domain-specific text, we have developed a system capable of providing multiple, probabilistically-weighted paleoenvironmental interpretations, each substantiated with specific references.

Our preliminary results suggest that this integrative approach holds significant potential in aiding the interpretation of sedimentary environments and addressing the non-uniqueness problem by harnessing the wealth of knowledge embedded in published literature. This innovative methodology could pave the way for more accurate, comprehensive, and holistic interpretations of the rock record, thereby enhancing our ability to reconstruct paleogeography. We extend an invitation to the broader paleoenvironmental research community to collaborate in refining and applying this promising approach.

Session No. 267

T172. Time Machine: Earth’s Deep-Time Geography—Data, Reconstructions, Challenges

Wednesday, 18 October 2023: 1:30 PM-5:30 PM

320 (David L Lawrence Convention Center)

Geological Society of America Abstracts with Programs. Vol. 55, No. 6
doi: 10.1130/abs/2023AM-390624

© Copyright 2023 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T172. Time Machine: Earth’s Deep-Time Geography—Data, Reconstructions, Challenges

<< Previous Abstract | Next Abstract >>

GSA Connects 2023 Meeting in Pittsburgh, Pennsylvania

LEVERAGING LARGE LANGUAGE MODELS FOR SEDIMENTARY ENVIRONMENTAL INTERPRETATION: A NEW APPROACH TO ADDRESSING THE NON-UNIQUENESS PROBLEM IN PALEOGEOGRAPHY