Skip to main navigation Skip to search Skip to main content

Natural language watermarking and tamperproofing

  • Mikhail J. Atallah
  • , Victor Raskin
  • , Christian F. Hempelmann
  • , Mercan Karahan
  • , Radu Sion
  • , Umut Topkara
  • , Katrina E. Triezenberg
  • Purdue University

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

129 Scopus citations

Abstract

Two main results in the area of information hiding in natural language text are presented. A semantically-based scheme dramatically improves the information-hiding capacity of any text through two techniques: (i) modifying the granularity of meaning of individual sentences, whereas our own previous scheme kept the granularity fixed, and (ii) halving the number of sentences affected by the watermark. No longer a "long text, short watermark" approach, it now makes it possible to watermark short texts, like wire agency reports. Using both the abovementioned semantic marking scheme and our previous syntactically-based method hides information in a way that reveals any non-trivial tampering with the text (while re-formatting is not considered to be tampering - the problem would be solved trivially otherwise by hiding a hash of the text) with a probability 1-2-β(n+1), n being its number of sentences and β a small positive integer based on the extent of co-referencing.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsFabien A. P. Petitcolas
PublisherSpringer Verlag
Pages196-212
Number of pages17
ISBN (Print)3540004211
DOIs
StatePublished - 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2578
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'Natural language watermarking and tamperproofing'. Together they form a unique fingerprint.

Cite this