CLSWeb Main
Caltech Library System
Electronic Theses
                  About | Browse | Search | Caltech Student Instructions

Song, Xubo (1998-11-23) Contextual pattern recognition with applications to biomedical image identification. http://resolver.caltech.edu/CaltechETD:etd-09222005-111015


Type of Document Dissertation
Author Song, Xubo
Author's Email Address xubosong AT csee.ogi.edu
URN etd-09222005-111015
Persistent URL http://resolver.caltech.edu/CaltechETD:etd-09222005-111015
Title Contextual pattern recognition with applications to biomedical image identification
Degree PhD
Option Electrical Engineering
Advisory Committee
Advisor Name Title
Yaser S. Abu-Mostafa Committee Chair
Demetri Psaltis Committee Member
Harvey Kasdan Committee Member
Pietro Perona Committee Member
Keywords
  • context
  • contextual information
  • pattern recognition
  • image recognition
  • Compound Bayesian Theory
Date of Defense 1998-11-23
Availability unrestricted
Abstract
This thesis studies two rather distinct topics: one is the incorporation of contextual information in pattern recognition, with applications to biomedical image identification; and the other is the theoretical modeling of learning and generalization in the regime of machine learning.

In Part I of the thesis, we propose techniques to incorporate contextual information into object classification. In the real world there are cases where the identity of an object is ambiguous due to the noise in the measurements based on which the classification should be made. It is helpful to reduce the ambiguity by utilizing extra information referred to as context, which in our case is the identities of the accompanying objects. We investigate the incorporation of both full and partial context. Their error probabilities, in terms of both set-by-set error and element-by-element error, are established and compared to context-free approach. The computational cost is studied in detail for full context, partial context and context-free cases. The techniques are applied to toy problems as well as real world problems such as white blood cell image classification and microscopic urinalysis. It is demonstrated that superior classification performance is achieved by using context. In our particular application, it reduces overall classification error, as well as false positive and false negative diagnosis rates.

In Part II of the thesis, we propose a novel theoretical framework, called the Bin Model, for learning and generalization. Using the Bin Model, a closed form is derived for generalization that estimates the out-of-sample performance in terms of the in-sample performance. We address the problems of overfitting, and characterize conditions under which it does not appear. The effect of noise on generalization is studied, and the generalization of the Bin Model framework from classification problems to regression problems is discussed.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  Song_x_1999.pdf 4.24 Mb 00:19:38 00:10:06 00:08:50 00:04:25 00:00:22

Browse All Available ETDs by ( Author | Option )

If you have more questions or technical problems, please Contact the Caltech Library System.