A New Loss Function with “Markov Property”for Information Extraction

We propose a new loss function for the discriminative
learning of Markov random fields, which is an intermediate
loss function between sequential loss and pointwise loss.
We show this loss function has “Markov property”, that is,
the importance of correct labeling for a particular position
depends on the numbers of the correct labels around there.
This property works to keep local consistencies and is useful
for optimizing systems identifying structural segments, such
as information extraction systems.

By: Yuta Tsuboi and Hisashi Kashima

Published in: RT0660 in 2007


This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.


Questions about this service can be mailed to reports@us.ibm.com .