EasyEnglishAnalyzer: Taking Controlled Language from Sentence to Discourse Level

Controlled Language checking has traditionally been applied largely on a sentence level by placing restrictions on permissible vocabulary and permissible syntactic constructions, including proper punctuation. Only little attention has been paid to document or discourse level checking. In this paper, we report on work on EasyEnglishAnalyzer to handle certain discourse and document level checks. This work helps take Controlled Language checkers from the sentence level to the discourse and document level.

Deep semantic analysis provided by a discourse understanding system assists with the semantically more challenging tasks such as proper paragraph structure. For checks related to overall style and organization, document structure is recognized by enhanced interpretation and use of document structure tags so that appropriate checks can be applied in a context-sensitive manner. Parsing of combined segments is applied in checking correctness of list environments.

By: Arendse Bernth

Published in: RC23994 in 2006

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

rc23994.pdf

Questions about this service can be mailed to reports@us.ibm.com .