Conditional Neural Fields

Jian Peng, Liefeng Bo and Jinbo Xu

Toyota Technological Institute at Chicago, USA

Abstract: Conditional random fields (CRF) [2] are widely used for sequence labeling such as natural language processing and biological sequence analysis. Most CRF models use a linear potential function to represent the relationship between input features and output. However, in many real-world applications such as protein structure prediction and handwriting recognition, the relationship between input features and output is highly complex and nonlinear, which cannot be accurately modeled by a linear function. To model the nonlinear relationship between input and output we propose a new conditional probabilistic graphical model, Conditional Neural Fields (CNF) [1], for sequence labeling. CNF extends CRF by adding one (or possibly more) middle layer between input and output. The middle layer consists of a number of gate functions, each acting as a local neuron or feature extractor to capture the nonlinear relationship between input and output. Therefore, conceptually CNF is much more expressive than CRF. Experiments on two widely-used benchmarks indicate that CNF performs significantly better than a number of popular methods. In particular, CNF is the best among approximately 10 machine learning methods for protein secondary structure prediction and also among a few of the best methods for handwriting recognition.

References

  1. Jian Peng, Liefeng Bo, and Jinbo Xu, Conditional Neural Fields, Advances in Neural Information Processing Systems (NIPS), December, 2009. [PDF] [BIB]

  2. J. Lafferty, A. McCallum, and F. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In ICML 2001.


C++ Source Code

Description: CNF is a package for Conditional Neural Fields.

Requirement: mpiCC.

Download: [code]. This package is free for academic usage. You can run it at your own risk.