Skip to main navigation Skip to search Skip to main content

Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones

  • Stony Brook University
  • Alphabet Inc.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Scopus citations

Abstract

Editing operations such as cut, copy, paste, and correcting errors in typed text are often tedious and challenging to perform on smartphones. In this paper, we present VT, a voice and touch-based multi-modal text editing and correction method for smartphones. To edit text with VT, the user glides over a text fragment with a finger and dictates a command, such as "bold"to change the format of the fragment, or the user can tap inside a text area and speak a command such as "highlight this paragraph"to edit the text. For text correcting, the user taps approximately at the area of erroneous text fragment and dictates the new content for substitution or insertion. VT combines touch and voice inputs with language context such as language model and phrase similarity to infer a user's editing intention, which can handle ambiguities and noisy input signals. It is a great advantage over the existing error correction methods (e.g., iOS's Voice Control) which require precise cursor control or text selection. Our evaluation shows that VT significantly improves the efficiency of text editing and text correcting on smartphones over the touch-only method and the iOS's Voice Control method. Our user studies showed that VT reduced the text editing time by 30.80%, and text correcting time by 29.97% over the touch-only method. VT reduced the text editing time by 30.81%, and text correcting time by 47.96% over the iOS's Voice Control method.

Original languageEnglish
Title of host publicationUIST 2021 - Proceedings of the 34th Annual ACM Symposium on User Interface Software and Technology
PublisherAssociation for Computing Machinery, Inc
Pages162-178
Number of pages17
ISBN (Electronic)9781450386357
DOIs
StatePublished - Oct 10 2021
Event34th Annual ACM Symposium on User Interface Software and Technology, UIST 2021 - Virtual, Online, United States
Duration: Oct 10 2021Oct 14 2021

Publication series

NameUIST 2021 - Proceedings of the 34th Annual ACM Symposium on User Interface Software and Technology

Conference

Conference34th Annual ACM Symposium on User Interface Software and Technology, UIST 2021
Country/TerritoryUnited States
CityVirtual, Online
Period10/10/2110/14/21

Keywords

  • Multimodal interaction
  • smartphones.
  • text correction
  • text editing
  • touch input

Fingerprint

Dive into the research topics of 'Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones'. Together they form a unique fingerprint.

Cite this