TY - GEN
T1 - Word Definitions from Large Language Models
AU - Pham, Bach
AU - Wong, Jui Hsuan
AU - Kim, Samuel
AU - Yin, Yunting
AU - Skiena, Steven
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Dictionary definitions are historically the arbitrator of what words mean, but this primacy has come under threat by recent progress in NLP, including word embeddings and generative models like ChatGPT. We present an exploratory study of the degree of alignment between word definitions from classical dictionaries and these newer computational artifacts. Specifically, we compare definitions from three published dictionaries to those generated from variants of ChatGPT. We show that (i) definitions from different traditional dictionaries exhibit more surface form similarity than do model-generated definitions, (ii) that the ChatGPT definitions are highly accurate, comparable to traditional dictionaries, and (iii) ChatGPT-based embedding definitions retain their accuracy even on low frequency words, much better than GloVE and FastText word embeddings.
AB - Dictionary definitions are historically the arbitrator of what words mean, but this primacy has come under threat by recent progress in NLP, including word embeddings and generative models like ChatGPT. We present an exploratory study of the degree of alignment between word definitions from classical dictionaries and these newer computational artifacts. Specifically, we compare definitions from three published dictionaries to those generated from variants of ChatGPT. We show that (i) definitions from different traditional dictionaries exhibit more surface form similarity than do model-generated definitions, (ii) that the ChatGPT definitions are highly accurate, comparable to traditional dictionaries, and (iii) ChatGPT-based embedding definitions retain their accuracy even on low frequency words, much better than GloVE and FastText word embeddings.
KW - dictionary
KW - large language model
KW - word embedding
UR - https://www.scopus.com/pages/publications/105009456846
U2 - 10.1109/ICSC64641.2025.00028
DO - 10.1109/ICSC64641.2025.00028
M3 - Conference contribution
AN - SCOPUS:105009456846
T3 - Proceedings - IEEE International Conference on Semantic Computing, ICSC
SP - 158
EP - 162
BT - Proceedings - 2025 19th International Conference on Semantic Computing, ICSC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 19th International Conference on Semantic Computing, ICSC 2025
Y2 - 3 February 2025 through 5 February 2025
ER -