CLSWeb Main
Caltech Library System
Electronic Theses
                  About | Browse | Search | Caltech Student Instructions

Greenspan, Hayit (1994-05-20) Multi-resolution image processing and learning for texture recognition and image enhancement. http://resolver.caltech.edu/CaltechETD:etd-10192005-104013


Type of Document Dissertation
Author Greenspan, Hayit
URN etd-10192005-104013
Persistent URL http://resolver.caltech.edu/CaltechETD:etd-10192005-104013
Title Multi-resolution image processing and learning for texture recognition and image enhancement
Degree PhD
Option Electrical Engineering
Advisory Committee
Advisor Name Title
Rodney Goodman Committee Chair
Charles Anderson Committee Member
Christof Koch Committee Member
Demetri Psaltis Committee Member
Pietro Perona Committee Member
Keywords
  • none
Date of Defense 1994-05-20
Availability unrestricted
Abstract
A general recognition framework is presented that consists of multi-resolution pyramidal feature-extraction and learning paradigms for classification. The system is presented in the context of the texture recognition task.

In the feature extraction part of the system, an oriented Laplacian pyramid is used as an efficient filtering scheme to transform the input image to a more robust representation in the frequency and orientation space. An optimal technique is presented for computing a steerable representation of the pyramid. Steerability is used to generate a rotation-invariant input representation.

In the learning stage of the system we focus on a rule-based probabilistic learning scheme. This information-theoretic technique is utilized to find the most informative correlations between the attributes and the output classes while producing probability estimates for the outputs. Both unsupervised and supervised learning are utilized. Apart from the rule-based approach we experiment with other non-parametric classifiers, such as the k-nearest neighbor classifier and the Backprop neural-network.

We demonstrate experimentally that our scheme improves significantly upon the state-of-the-art both in rotation-invariant classification and in orientation estimation. A variety of applications are presented, including autonomous navigation scenarios and remote-sensing, as possible extensions for the texture recognition system. A generalization of the system to face-recognition is discussed.

In the latter part of the thesis, a procedure for creating images with higher resolution than the sampling rate would allow is described. The enhancement algorithm augments the frequency content of the image by using a non-linearity that generates phase-coherent higher harmonics. The procedure utilizes the Laplacian pyramid image representation. Results are presented depicting the power-spectra augmentation and the visual enhancement of several images. Simplicity of computations and ease of implementation allow for real-time applications such as high-definition television (HDTV). An initial investigation is pursued to combine the enhancement scheme with pyramid coding schemes.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  Greenspan_h_1994.pdf 15.43 Mb 01:11:25 00:36:43 00:32:08 00:16:04 00:01:22

Browse All Available ETDs by ( Author | Option )

If you have more questions or technical problems, please Contact the Caltech Library System.