TY - GEN
T1 - Parsing without a grammar
T2 - 3rd IEEE International Conference on Data Mining, ICDM '03
AU - Lloyd, Levon
AU - Skiena, Steven
PY - 2003
Y1 - 2003
N2 - The thousands of specialized structured file formats in use today present a substantial barrier to freely exchanging information between applications programs. We consider the problem of deducing such basic features as the whitespace characters, bracketing delimiter symbols, and self-delimiter characters of a given file format from one or more example files. We demonstrate that for sufficiently large example files, we can typically identify the basic features of interest.
AB - The thousands of specialized structured file formats in use today present a substantial barrier to freely exchanging information between applications programs. We consider the problem of deducing such basic features as the whitespace characters, bracketing delimiter symbols, and self-delimiter characters of a given file format from one or more example files. We demonstrate that for sufficiently large example files, we can typically identify the basic features of interest.
UR - https://www.scopus.com/pages/publications/78149320586
M3 - Conference contribution
AN - SCOPUS:78149320586
SN - 0769519784
SN - 9780769519784
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 195
EP - 202
BT - Proceedings - 3rd IEEE International Conference on Data Mining, ICDM 2003
Y2 - 19 November 2003 through 22 November 2003
ER -