Large Language Models as Annotators of Named Entities in Climate Change and Biodiversity: A Preliminary Study

Volkanovska, Elena

Large Language Models as Annotators of Named Entities in Climate Change and Biodiversity: A Preliminary Study

Files

2025_nlp4ecology_1_7.pdf (262.18 KB)

Date

2025-03

Authors

Volkanovska, Elena

Publisher

University of Tartu Library

Abstract

This paper examines whether few-shot techniques for Named Entity Recognition (NER) utilising existing large language models (LLMs) as their backbone can be used to reliably annotate named entities (NEs) in scientific texts on climate change and biodiversity. A series of experiments aim to assess whether LLMs can be integrated into an end-to-end pipeline that could generate token- or sentence-level NE annotations; the former being an ideal-case scenario that allows for seamless integration of existing with new token-level features in a single annotation pipeline. Experiments are run on four LLMs, two NER datasets, two input and output data formats, and ten and nine prompt versions per dataset. The results show that few-shot methods are far from being a silver bullet for NER in highly specialised domains, although improvement in LLM performance is observed for some prompt designs and some NE classes. Few-shot methods would find better use in a human-in-the-loop scenario, where an LLM's output is verified by a domain expert.

URI

https://hdl.handle.net/10062/107179

Collections

Proceedings of the 1st Workshop on Ecology, Environment, and Natural Language Processing (NLP4Ecology2025)

Full item page

Large Language Models as Annotators of Named Entities in Climate Change and Biodiversity: A Preliminary Study

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections