Skip to main navigation Skip to search Skip to main content

Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop

  • Columbia University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

325 Scopus citations

Abstract

We present an approach to using a morphological analyzer for tokenizing andmorphologically tagging (including partof- speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.

Original languageEnglish
Title of host publicationACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages573-580
Number of pages8
ISBN (Print)1932432515, 9781932432510
DOIs
StatePublished - 2005
Event43rd Annual Meeting of the Association for Computational Linguistics, ACL-05 - Ann Arbor, MI, United States
Duration: Jun 25 2005Jun 30 2005

Publication series

NameACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference

Conference

Conference43rd Annual Meeting of the Association for Computational Linguistics, ACL-05
Country/TerritoryUnited States
CityAnn Arbor, MI
Period06/25/0506/30/05

Fingerprint

Dive into the research topics of 'Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop'. Together they form a unique fingerprint.

Cite this