Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection

Phelps, Dylan; Pickard, Thomas; Mi, Maggie; Gow-Smith, Edward; Villavicencio, Aline

Computer Science > Computation and Language

arXiv:2405.09279 (cs)

[Submitted on 15 May 2024]

Title:Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection

Authors:Dylan Phelps, Thomas Pickard, Maggie Mi, Edward Gow-Smith, Aline Villavicencio

View PDF HTML (experimental)

Abstract:Despite the recent ubiquity of large language models and their high zero-shot prompted performance across a wide range of tasks, it is still not known how well they perform on tasks which require processing of potentially idiomatic language. In particular, how well do such models perform in comparison to encoder-only models fine-tuned specifically for idiomaticity tasks? In this work, we attempt to answer this question by looking at the performance of a range of LLMs (both local and software-as-a-service models) on three idiomaticity datasets: SemEval 2022 Task 2a, FLUTE, and MAGPIE. Overall, we find that whilst these models do give competitive performance, they do not match the results of fine-tuned task-specific models, even at the largest scales (e.g. for GPT-4). Nevertheless, we do see consistent performance improvements across model scale. Additionally, we investigate prompting approaches to improve performance, and discuss the practicalities of using LLMs for these tasks.

Comments:	Presented at the MWE-UD Workshop at LREC-COLING 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.09279 [cs.CL]
	(or arXiv:2405.09279v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.09279

Submission history

From: Dylan Phelps [view email]
[v1] Wed, 15 May 2024 11:55:14 UTC (173 KB)

Computer Science > Computation and Language

Title:Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators