Skip to main navigation Skip to search Skip to main content

Linguistically rich vector representations of supertags for TAG parsing

  • Dan Friedman
  • , Jungo Kasai
  • , R. Thomas McCoy
  • , Robert Frank
  • , Forrest Davis
  • , Owen Rambow
  • Yale University
  • Columbia University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In this paper, we explore several techniques for producing vector representations of TAG supertags that can be used as inputs to a neural network-based TAG parser. In the simplest case, the supertag is encoded as a 1-hot vector that is projected to a dense vector. Secondly, we use a tree-recursive neural network that is given as input the structure of the elementary tree. Thirdly, we use hand-crafted feature vectors that describe the syntactic features of each supertag, and project these to a dense vector. These three representations are learned during the training of a neural network TAG parser with a layer that embeds supertags in a low-dimensional space. Finally, we consider an embedding that is trained only on patterns of linear co-occurrence among supertags. By testing the resulting vector representations on the task of completing syntactic analogies, we show that these vector representations capture syntactically relevant information. While our linguistically-informed embeddings outperform atomic embeddings on the syntactic analogy task, we find that the same embeddings lead to only a slight improvement on the task of TAG parsing, indicating that the neural parser is able to induce useful representations of supertags from the data alone.

Original languageEnglish
Title of host publicationTAG+ 2017 - 13th International Workshop on Tree Adjoining Grammars and Related Formalisms, Proceedings
PublisherAssociation for Computational Linguistics (ACL)
Pages122-131
Number of pages10
ISBN (Electronic)9781945626982
StatePublished - 2017
Event13th International Workshop on Tree Adjoining Grammars and Related Formalisms, TAG+ 2017 - Umea, Sweden
Duration: Sep 4 2017Sep 6 2017

Publication series

NameTAG+ 2017 - 13th International Workshop on Tree Adjoining Grammars and Related Formalisms, Proceedings

Conference

Conference13th International Workshop on Tree Adjoining Grammars and Related Formalisms, TAG+ 2017
Country/TerritorySweden
CityUmea
Period09/4/1709/6/17

Fingerprint

Dive into the research topics of 'Linguistically rich vector representations of supertags for TAG parsing'. Together they form a unique fingerprint.

Cite this