Abstract
Large language models (LLMs) have the potential to revolutionize computational social science, particularly in automated textual analysis. In this paper, we conduct a systematic evaluation of the promises and risks associated with using LLMs for text classification tasks, using social movement studies as an example. We propose a framework for social scientists to incorporate LLMs into text annotation, either as the primary coding decision-maker or as a coding assistant. This framework offers researchers tools to develop the potential best-performing prompt, and to systematically examine and report the validity and reliability of LLMs as a methodological tool. Additionally, we evaluate and discuss its epistemic risks associated with validity, reliability, replicability, and transparency. We conclude with several practical guidelines for using LLMs in text annotation tasks and offer recommendations for more effectively communicating epistemic risks in research.
| Original language | English |
|---|---|
| Journal | Social Science Computer Review |
| DOIs | |
| State | Accepted/In press - 2025 |
Keywords
- automated textual analysis
- computational social science
- large language models (LLMs)
- social movement studies
- text annotation
- text classification
Fingerprint
Dive into the research topics of 'Navigating the Risks of Using Large Language Models for Text Annotation in Social Science Research'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver