Bilkent University
Department of Computer Engineering
MS THESIS PRESENTATION

 

Fine-Grained Object Recognition in Remote Sensing Imagery

 

Gencer Sümbül
MS Student
(Supervisor: Assoc. Prof. Dr. Selim Aksoy)
Computer Engineering Department
Bilkent University

Fine-grained object recognition aims to determine the type of an object in domains with a large number of sub-categories. The steadily increase in spatial and spectral resolution entailing new details in remote sensing image data, and consequently more diversified target object classes having subtle differences makes it an emerging application. For the approaches using images from a single domain, widespread fully supervised algorithms do not completely fit into accomplishing this problem since target object classes tend to have low between-class variance and high within-class variance with small sample sizes. As an even more arduous task, a method for zero-shot learning (ZSL), in which identification of unseen sub-categories is tackled by associating them with previously learned seen sub-categories when there is no training example for some of the classes, is proposed. More specifically, our method learns a compatibility function between image representation obtained from a deep convolutional neural network and the semantics of target object sub-categories explained by auxiliary information gathered from complementary sources. Knowledge transfer for unseen classes is carried out by maximizing this function throughout the inference. Furthermore, benefitting from multiple image sensors can overcome the drawbacks of closely intertwined sub-categories that limits the object recognition performance. However, since multiple images may be acquired from different sensors under different conditions at different spatial and spectral resolutions, they may be geometrically unaligned correctly due to seasonal changes, different viewing geometry, acquisition noise, an imperfection of sensors, different atmospheric conditions etc. To address these challenges, a neural network model that aims to correctly align images acquired from different sources and to learn the classification rules in a unified framework simultaneously is proposed. In this network, one of the sources is used as the reference and the others are aligned with the reference image at representation level throughout a learned weighting mechanism. At the end, classification of sub-categories is carried out with a feature-level fusion of representations from the source region and estimated multiple target regions. Experimental analysis conducted on a newly proposed data set shows that both zero-shot learning algorithm and the multi-source fine-grained object recognition algorithm give promising results.

 

DATE: 07 June 2018, Thursday @ 13:30
PLACE: EA-409