ECE Course Outline


Methods of Pattern Recognition with Application to Voice (3-0-3)

ECE 4270
Catalog Description
Theory and application of pattern recognition with a special application section for automatic speech recognition and related signal processing.
Theodorous, Sergios and Koutroumbas, Konstantinos, Pattern Recognition (4th edition), Academic Press, 2008. ISBN 9781597492720 (required)

Topical Outline
Review of probablilty with an emphasis on random vectors
Linear transformations, diagonalizations, rotations, projections
Distance measures
Clustering (unsupervised pattern recognition).
        Interset distances
        Sums of distances
        Intraset distances (distortion measures)
        Performance measures
        Hierarchical clustering
Parametric Modeling
        MAP classification, Bayesian analysis
        Minimum risk criteria, Neyman-Pearson criteria
        Gaussian assumptions
        Gaussian mixture densities, EM algorithm
        Non-Gaussian: training of densities using basis functions
Linear discriminant functions
        Single layer perceptron
        Gradient descent algorithms
        Widrow-Hoff algorithm
        Nonlinear transformations prior to LDFs (potential functions).
Neural Networks
        Feedforward (MLPs)
        Back Propagation.
        Radial Basis function NNs (RBFs)
        Self-organizing feature maps
Data (Dimensionality) Reduction
Intro to sequence comparisons: time warps and stochastic grammars.
Intro to acoustic phonetics
Front ends (feature acquisition) for speech
Filter banks and LPC
Auditory models
Development of Mel-Cepstra from both a PR and
        DSP point of view (Karhunen-Loeve transformation)
Dynamic Time Warping (Deterministic  and Probabilistic)
Clustering for VQ-DTW, Training, Template Adaptation 
Discriminative Methods
Robust methods
Markov Processes, hidden and observed
Discrete HMM's, Recognition and Training
Continuous Observation HMM's
Semi-Markov Models
Model Adaptation 
Connected Words: Level Building
Large Vocabulary Systems 
Word Spotting 
Speaker ID