Skip to main navigation Skip to search Skip to main content

Word Definitions from Large Language Models

  • Bach Pham
  • , Jui Hsuan Wong
  • , Samuel Kim
  • , Yunting Yin
  • , Steven Skiena
  • Earlham College

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Dictionary definitions are historically the arbitrator of what words mean, but this primacy has come under threat by recent progress in NLP, including word embeddings and generative models like ChatGPT. We present an exploratory study of the degree of alignment between word definitions from classical dictionaries and these newer computational artifacts. Specifically, we compare definitions from three published dictionaries to those generated from variants of ChatGPT. We show that (i) definitions from different traditional dictionaries exhibit more surface form similarity than do model-generated definitions, (ii) that the ChatGPT definitions are highly accurate, comparable to traditional dictionaries, and (iii) ChatGPT-based embedding definitions retain their accuracy even on low frequency words, much better than GloVE and FastText word embeddings.

Original languageEnglish
Title of host publicationProceedings - 2025 19th International Conference on Semantic Computing, ICSC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages158-162
Number of pages5
ISBN (Electronic)9798331524265
DOIs
StatePublished - 2025
Event19th International Conference on Semantic Computing, ICSC 2025 - Hybrid, Laguna Hills, United States
Duration: Feb 3 2025Feb 5 2025

Publication series

NameProceedings - IEEE International Conference on Semantic Computing, ICSC
ISSN (Print)2325-6516
ISSN (Electronic)2472-9671

Conference

Conference19th International Conference on Semantic Computing, ICSC 2025
Country/TerritoryUnited States
CityHybrid, Laguna Hills
Period02/3/2502/5/25

Keywords

  • dictionary
  • large language model
  • word embedding

Fingerprint

Dive into the research topics of 'Word Definitions from Large Language Models'. Together they form a unique fingerprint.

Cite this