Artificial Intelligence Research Laboratory
Department of Computer Science
Iowa State University


Constructive Neural Network Learning Algorithms for Pattern Classification
   Personnel   Project Summary   Funding   Publications   Additional Information   Projects   AI Lab  

Personnel

Project Summary

Induction of pattern classifiers from data is an important area of research in machine learning which finds applications in diverse areas including automated diagnosis, bioinformatics, design of customizable information assistants, intrusion detection in computer systems, among others. Artificial neural networks, because of their potential for massive parallelism and fault and noise tolerance, offer an attractive approach to the design of trainable pattern classifiers.

Constructive learning algorithms, which avoid the guesswork involved in deciding a suitable network architectures for different pattern classification problems by growing a network by recruiting neurons as needed can be effectively trained to solve complex pattern classification problems. Furthermore, it is possible to generalize (and provide convergence guarantees for) a large family of such algorithms designed for 2-class binary pattern classification problems to handle classification problemsinvolving real-valued patterns and an arbitrary number of classes.

Different constructive neural network algorithms as well as the algorithms used to train the neuron weights have different inductive and representational biases making their performance sensitive to specific characteristics of the datasets. Thus, it is useful to experiment with a toolkit of multiple algorithms for practical applications. It is also possible to design hybrid algorithms that exploit the synergy between different algorithms. Visualization of decision boundaries constructed by the different algorithms can often provide useful insights into their behavior. This argues for systems that combine machine learning algorithms with visualization techniques in high dimensions to facilitate interactive knowledge discovery.

A simple inter-pattern distance based, provably convergent, polynomial time constructive neural network algorithm compares very favorably with computationally far more expensive algorithms in terms of generalization accuracy. The generalization accuracy of this algorithm (as expected), is sensitive to the choice of attributes used to represent the patterns and can be improved by using feature subset selection.

Fairly simple algorithms for incorporating prior knowledge can be used to enhance the performance of constructive algorithms for designing pattern classifiers. This raises the possibility of using constructive algorithms for knowledge transfer across similar tasks to facilitate multitask learning in complex enviroments.

A constructive learning procedure, augmented with a Kalman filter mechanism for estimation, can be effectively used for place learning and localization in a-priori unknown environments in the presence of sensor and motion uncertainties. The resulting computational model successfully accounts for a large body of behavioral and neurobiological data from animal experiments and offers several testable predictions.

Preliminary results indicate that constructive learning algorithms and related approaches to machine learning can be adapted for data-driven knowledge discoveryfrom distributed, dynamic data sources in a number of domains including bioinformatics, monitoring and control of complex distributed systems (computer systems, communication networks, power systems) and decision support systems.

Funding

Publications

Additional Information

To appear.


Dr. Vasant Honavar
Artificial Intelligence Research Laboratory
Department of Computer Science
Iowa State University
Atanasoff Hall, Ames, IA 50011-1040 USA
phone: +1-515-294-1098, +1-515-294-4377; fax: +1-515-294-0258

© Vasant Honavar, 1999.