Please use this identifier to cite or link to this item:
http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/822
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Barathy, Mayurathan | |
dc.date.accessioned | 2016-01-08T11:53:43Z | |
dc.date.accessioned | 2022-06-28T04:51:43Z | - |
dc.date.available | 2016-01-08T11:53:43Z | |
dc.date.available | 2022-06-28T04:51:43Z | - |
dc.date.issued | 2013-10-04 | |
dc.identifier.uri | http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/822 | - |
dc.description.abstract | In well-known framework for visual scene recognition literature, clustering to construct a visual codebook is an important step, and is usually achieved by k-means clustering. This is known to be computational and performance bottleneck. On current benchmark tasks like the PASCAL VOC challenge, with only 20 classes and a few hundred images per class, the computational problem is one of clustering millions of 128 dimensional vectors into codebooks of a few thousand clusters. K-means being computationally hard on problems of such a scale implies that scaling up to even larger tasks such as the ImageNet challenge, with thousands of classes, becomes impossible. Additionally, there is an inherent compromise between constructing a large codebook, which can potentially retain noise in the data as cluster centres and have the undesirable effect of posing the subsequent classification problem in high dimensions, and a small codebook which loses resolution of the distribution of image features. We present a novel approach to the design of codebooks in patch-based, bag-of-feature visual scene recognition problems. The Sequential Input Space Carving (SISC) approach that we present achieves compact codebooks in a fraction of the computation time needed by the k-means clustering method usually employed in this setting. We demonstrate the performance of the SISC using several recognition tasks including the visual object recognition tasks: PASCAL VOC challenge, MPEG-7 Part-B silhouette image, Caltech-101 and Caltech-256 datasets, human action classification task: KTH and WEIZMANN datasets and texture classification tasks: UIUC and CUReT datasets. Hence, we compare and contrast the recognition performance of SISC evaluated with two different clustering techniques: k-means and Resource-Allocating Codebook (RAC).In all these, the SISC approach achieves classification performances comparable to those reported by other authors, and sometimes outperforms them, in a fraction of the computing time and at significantly smaller codebook sizes. | en_US |
dc.language.iso | en | en_US |
dc.publisher | BMVA | en_US |
dc.subject | Visual codebook, Sequential Input Space Carving, K-means, Mean-shift, Resource Allocating Codebook | en_US |
dc.title | Sequential Input Space Carving for visual codebook design | en_US |
dc.type | Article | en_US |
Appears in Collections: | Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Sequential Input Space Carving for visual codebook design.pdf | 180.62 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.