Open Access

Discriminative Feature Selection via Multiclass Variable Memory Markov Model

EURASIP Journal on Advances in Signal Processing20032003:850172

DOI: 10.1155/S111086570321115X

Received: 18 April 2002

Published: 25 February 2003

Abstract

We propose a novel feature selection method based on a variable memory Markov (VMM) model. The VMM was originally proposed as a generative model trying to preserve the original source statistics from training data. We extend this technique to simultaneously handle several sources, and further apply a new criterion to prune out nondiscriminative features out of the model. This results in a multiclass discriminative VMM (DVMM), which is highly efficient, scaling linearly with data size. Moreover, we suggest a natural scheme to sort the remaining features based on their discriminative power with respect to the sources at hand. We demonstrate the utility of our method for text and protein classification tasks.

Keywords

variable memory Markov (VMM) model feature selection multiclass discriminative analysis

Authors’ Affiliations

(1)
School of Engineering and Computer Science and Interdisciplinary Center for Neural Computation, The Hebrew University of Jerusalem
(2)
School of Engineering and Computer Science, The Hebrew University of Jerusalem
(3)
IBM Research Laboratory in Haifa, Haifa University

Copyright

© Copyright © 2003 Hindawi Publishing Corporation 2003