Shape classification using invariant features and contextual information in the bag-of-words model

Pattern Recognition, Volume 48, Issue 3, March 2015, Pages 894-906.

Bharath Ramesh, Cheng Xiang, Tong Heng Lee.

Department of Electrical and Computer Engineering, National University of Singapore, Singapore, 117576.

 

Abstract

In this paper, we describe a classification framework for binary shapes that have scale, rotation and strong viewpoint variations. To this end, we develop several novel techniques. First, we employ the spectral magnitude of log-polar transform as a local feature in the bag-of-words model. Second, we incorporate contextual information in the bag-of-words model using a novel method to extract bi-grams from the spatial co-occurrence matrix. Third, a novel metric termed ‘weighted gain ratio’ is proposed to select a suitable codebook size in the bag-of-words model. The proposed metric is generic, and hence it can be used for any clustering quality evaluation task. Fourth, a joint learning framework is proposed to learn features in a data-driven manner, and thus avoid manual fine-tuning of the model parameters. We test our shape classification system on the animal shapes dataset and significantly outperform state-of-the-art methods in the literature.

Go To Journal

 

 

Check Also

Modular Hardware Paths for Scalable Quantum Information Processing

Significance  Image credit: Science. 2025 Dec 4;390(6777):1004-1010. doi: 10.1126/science.adz8659. Reference Awschalom DD, Bernien H, Hanson …