Text Chunking Using Regularized Winnow

Many machine learning methods have recently been applied to natural language processing
tasks. Among them, the Winnow algorithm has been argued to be particularly suitable for NLP problems, due to its robustness to irrelevant features. However in theory, Winnow may not converge for non-separable data. To remedy this problem, a modification called regularized Winnow has
been proposed. In this paper, we apply this new method to text chunking. We show that
this method achieves state of the art performance with significantly less computation
than previous approaches.

By: Tong Zhang

Published in: RC22011 in 2001

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

RC22011.pdf

Questions about this service can be mailed to reports@us.ibm.com .