Pattern Classification using the Principle of Parsimony

Principle of parsimony or the Occam's razor is a key principle to improve the generalization performance in pattern classification. The principle is essentially equivalent to reducing the variance of a classifier at the cost of increased boundary bias. In this paper, we provide a quantitative way to express this principle in terms of the outcome of a classifier instead of explicitly regularizing the model complexity in terms of model parameters. We then use this principle to design a new kernel machine and a modified $k$-nearest neighbor algorithm. Experimentally we validate the performance of these two classifiers over real-life datasets. We also discuss the relationship of the proposed kernel machine with several other existing kernel machines.

By: Jayanta Basak

Published in: RI09003 in 2009

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

RI09003.pdf

Questions about this service can be mailed to reports@us.ibm.com .