Skip to main navigation Skip to search Skip to main content

SIGMORPHON-UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection

  • Jordan Kodner
  • , Salam Khalifa
  • , Khuyagbaatar Batsuren
  • , Hossep Dolatian
  • , Ryan Cotterell
  • , Faruk Akkuş
  • , Antonios Anastasopoulos
  • , Taras Andrushko
  • , Aryaman Arora
  • , Nona Atanelov Gábor Bella
  • , Elena Budianskaya
  • , Yustinus Ghanggo Ate
  • , Omer Goldman
  • , David Guriel
  • , Simon Guriel
  • , Silvia Guriel-Agiashvili
  • , Witold Kieraś
  • , Andrew Krizhanovsky
  • , Natalia Krizhanovsky
  • , Igor Marchenko
  • Magdalena Markowska, Polina Mashkovtseva, Maria Nepomniashchaya, Daria Rodionova, Karina Sheifer, Alexandra Serova, Anastasia Yemelina, Jeremiah Young, Ekaterina Vylomova
  • Stony Brook University
  • National University of Mongolia
  • Swiss Federal Institute of Technology Zurich
  • University of Massachusetts
  • George Mason University
  • Higher School of Economics
  • Georgetown University
  • University of Trento
  • Russian Academy of Sciences
  • STKIP Weetebula
  • Australian National University
  • Bar-Ilan University
  • Institute of Computer Science of the Polish Academy of Sciences
  • Karelian Research Centre, Russian Academy of Sciences
  • University of Oregon
  • University of Melbourne

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

33 Scopus citations

Abstract

The 2022 SIGMORPHON-UniMorph shared task on large scale morphological inflection generation included a wide range of typologically diverse languages: 33 languages from 11 top-level language families: Arabic (Modern Standard), Assamese, Braj, Chukchi, Eastern Armenian, Evenki, Georgian, Gothic, Gujarati, Hebrew, Hungarian, Itelmen, Karelian, Kazakh, Ket, Khalkha Mongolian, Kholosi, Korean, Lamahalot, Low German, Ludic, Magahi, Middle Low German, Old English, Old High German, Old Norse, Polish, Pomak, Slovak, Turkish, Upper Sorbian, Veps, and Xibe. We emphasize generalization along different dimensions this year by evaluating test items with unseen lemmas and unseen features separately under small and large training conditions. Across the six submitted systems and two baselines, the prediction of inflections with unseen features proved challenging, with average performance decreased substantially from last year. This was true even for languages for which the forms were in principle predictable, which suggests that further work is needed in designing systems that capture the various types of generalization required for the world's languages.

Original languageEnglish
Title of host publicationSIGMORPHON 2022 - 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Proceedings of the Workshop
EditorsGarrett Nicolai, Eleanor Chodroff
PublisherAssociation for Computational Linguistics (ACL)
Pages176-203
Number of pages28
ISBN (Electronic)9781955917827
StatePublished - 2022
Event19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, SIGMORPHON 2022 - Seattle, United States
Duration: Jul 14 2022 → …

Publication series

NameSIGMORPHON 2022 - 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Proceedings of the Workshop

Conference

Conference19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, SIGMORPHON 2022
Country/TerritoryUnited States
CitySeattle
Period07/14/22 → …

Fingerprint

Dive into the research topics of 'SIGMORPHON-UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection'. Together they form a unique fingerprint.

Cite this