Discriminative training methods for hidden Markov models

1 January 2002

conference paper
conference paper
Published by Association for Computational Linguistics (ACL)

p. 1-8
https://doi.org/10.3115/1118693.1118694

Abstract

We describe new algorithms for training tagging models, as an alternative to maximum-entropy models or conditional random fields (CRFs). The algorithms rely on Viterbi decoding of training examples, combined with simple additive updates. We describe theory justifying the algorithms through a modification of the proof of convergence of the perceptron algorithm for classification problems. We give experimental results on part-of-speech tagging and base noun phrase chunking, in both cases showing improvements over results for a maximum-entropy tagger.

Keywords

TRAINING EXAMPLE
CONDITIONAL RANDOM FIELD
HIDDEN MARKOV MODEL
VITERBI DECODING
PERCEPTRON ALGORITHM
NEW ALGORITHM
CLASSIFICATION PROBLEM
PART-OF-SPEECH TAGGING
MAXIMUM-ENTROPY TAGGER
BASE NOUN PHRASE CHUNK
DISCRIMINATIVE TRAINING METHOD
RANDOM FIELD
MAXIMUM ENTROPY MODEL
MAXIMUM ENTROPY
NOUN PHRASE
VITERBI DECODER

Cited by 511 articles