The Accuracy, Robustness, and Readability of LLM-Generated Sustainability-Related Word Definitions

dc.contributor.authorHeiman, Alice
dc.contributor.editorBasile, Valerio
dc.contributor.editorBosco, Cristina
dc.contributor.editorGrasso, Francesca
dc.contributor.editorIbrahim, Muhammad Okky
dc.contributor.editorSkeppstedt, Maria
dc.contributor.editorStede, Manfred
dc.coverage.spatialTallinn, Estonia
dc.date.accessioned2025-02-17T12:09:25Z
dc.date.available2025-02-17T12:09:25Z
dc.date.issued2025-03
dc.description.abstractA common language with shared standard definitions is essential for effective climate conversations. However, there is concern that LLMs may misrepresent and/or diversify climate-related terms. We compare 305 official IPCC glossary definitions with those generated by OpenAI's GPT-4o-mini and investigate their adherence, robustness, and readability using a combination of SBERT sentence embeddings and statistical measures. The LLM definitions received average adherence and robustness scores of $0.58 \pm 0.15$ and $0.96 \pm 0.02$, respectively. Both sustainability-related terminologies remain challenging to read, with model-generated definitions varying mainly among words with multiple or ambiguous definitions. Thus, the results highlight the potential of LLMs to support environmental discourse while emphasizing the need to align model outputs with established terminology for clarity and consistency.
dc.identifier.isbn978-9908-53-114-4
dc.identifier.urihttps://hdl.handle.net/10062/107188
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleThe Accuracy, Robustness, and Readability of LLM-Generated Sustainability-Related Word Definitions
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
2025_nlp4ecology_1_21.pdf
Suurus:
302.18 KB
Formaat:
Adobe Portable Document Format