Unsupervised Object Annotation through Context Analysis

A. M. Riad; Hamdy K. Elminir; Sameh Abd-elghany

Call for Paper

August Edition

IJAIS solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 28 July 2025

Submit your paper

Know more

The week's pick

Enhancing Financial Time Series Predictions with a Hybrid BNN-LSTM Approach

Anika Tahsin Biva A.B.M. Shahadat Hossain Md. Shafiul Alom Khan Iqbal Habib

Random Articles

Analysis and Performance Assessment of CPU Scheduling Algorithms in Cloud using CloudSim

July

2013

The Prospects of E-Learning in School Education

May

2016

Optimizing the Intelligent Generic Query Mode and its Interface for Relational Database Applications

December

2012

Semantic Similarity Measure for Pairs of Short Biological Texts

October

2012

Reseach Article

Unsupervised Object Annotation through Context Analysis

by A. M. Riad, Hamdy K. Elminir, Sameh Abd-elghany

International Journal of Applied Information Systems

Foundation of Computer Science (FCS), NY, USA

Volume 5 - Number 1

Year of Publication: 2013

Authors: A. M. Riad, Hamdy K. Elminir, Sameh Abd-elghany

10.5120/ijais12-450787

A. M. Riad, Hamdy K. Elminir, Sameh Abd-elghany . Unsupervised Object Annotation through Context Analysis. International Journal of Applied Information Systems. 5, 1 ( January 2013), 10-19. DOI=10.5120/ijais12-450787

@article{ 10.5120/ijais12-450787,

author = { A. M. Riad, Hamdy K. Elminir, Sameh Abd-elghany },

title = { Unsupervised Object Annotation through Context Analysis },

journal = { International Journal of Applied Information Systems },

issue_date = { January 2013 },

volume = { 5 },

number = { 1 },

month = { January },

year = { 2013 },

issn = { 2249-0868 },

pages = { 10-19 },

numpages = {9},

url = { https://www.ijais.org/archives/volume5/number1/405-0787/ },

doi = { 10.5120/ijais12-450787 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2023-07-05T16:00:39.584853+05:30

%A A. M. Riad

%A Hamdy K. Elminir

%A Sameh Abd-elghany

%T Unsupervised Object Annotation through Context Analysis

%J International Journal of Applied Information Systems

%@ 2249-0868

%V 5

%N 1

%P 10-19

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

The goal of object level annotation is to locate and identify instances of an object category within an image. Nowadays, Most of the current object level annotation systems annotate the object according to the visual appearance in the image. Recognizing an object in an image based visual appearance yield ambiguity in object detection due to appearance confusion for example "sky" object may be annotated as "water" according to similarity in visual appearance. As a result, these systems don't recognize the objects in an image accurately due to the lack of scene context. In the task of visual object recognition, scene context can play important role in resolving the ambiguities in object detection. In order to solve the ambiguity problem, this paper presents a new technique for a context based object level annotation that considers both the semantic context and spatial context analysis to reduce ambiguous in object annotation.

References

Sevil, S. G. ; Kucuktunc, O. ; Duygulu, P. & Can, F. (2010), 'Automatic tag expansion using visual similarity for photo sharing websites. ', Multimedia Tools Appl. 49 (1) , 81-99 .
Weinberger, K. Q. ; Slaney, M. & van Zwol, R. (2008), Resolving tag ambiguity. , in Abdulmotaleb El-Saddik; Son Vuong; Carsten Griwodz; Alberto Del Bimbo; K. Selçuk Candan & Alejandro Jaimes, ed. , 'ACM Multimedia' , ACM, , pp. 111-120 .
arneiro, G. ; Chan, A. B. ; Moreno, P. J. & Vasconcelos, N. (2007), 'Supervised Learning of Semantic Classes for Image Annotation and Retrieval. ', IEEE Trans. Pattern Anal. Mach. Intell. 29 (3) , 394-410.
Zhang, L. & Ma, J. (2011), 'Image annotation by incorporating word correlations into multi-class SVM. ', Soft Comput. 15 (5) , 917-927 .
S. Zhang, B. Li, and X. Xue, "Semi-automatic dynamic auxiliary-tag-aided image annotation", presented at Pattern Recognition, 2010, pp. 470-477.
Ding, G. ; 0001, J. W. ; Xu, N. & 0014, L. Z. (2009), Automatic Image Annotations by Mining Web Image Data. , in, 'ICDM Workshops' , IEEE Computer Society, , pp. 152-157 .
Wang, X. -J. ; 0001, L. Z. ; Jing, F. & Ma, W. -Y. (2006), AnnoSearch: Image Auto-Annotation by Search. , in 'CVPR (2)' , IEEE Computer Society, , pp. 1483-1490 .
Llorente, A. ; Motta, E. & Rüger, S. M. (2009), Image Annotation Refinement Using Web-Based Keyword Correlation. , in 'SAMT' , Springer, , pp. 188-191 .
Weston, J. ; Bengio, S. & Usunier, N. (2010), 'Large scale image annotation: learning to rank with joint word-image embeddings', Machine Learning 81 , 21-35 .
Liu, D. ; Hua, X. -S. & Zhang, H. -J. (2011), 'Content-based tag processing for Internet social images. ', Multimedia Tools Appl. 51 (2) , 723-738 .
Liu, D. ; Hua, X. -S. ; Yang, L. & Zhang, H. -J. (2009), Multiple-Instance Active Learning for Image Categorization. , in Benoit Huet; Alan F. Smeaton; Ketan Mayer-Patel & Yannis S. Avrithis, ed. , 'MMM' , Springer, , pp. 239-249 .
Liu, J. ; Wang, B. ; Lu, H. & Ma, S. (2008), 'A graph-based image annotation framework. ', Pattern Recognition Letters 29 (4) , 407-415
Wang, X. -J. ; Ma, W. -Y. ; 0001, L. Z. & Li, X. (2005), Multi-graph enabled active learning for multimodal web image retrieval. , in HongJiang Zhang; John R. Smith & Qi Tian, ed. , 'Multimedia Information Retrieval' , ACM, , pp. 65-72
Jing, Y. & Baluja, S. (2008), 'VisualRank: Applying PageRank to Large-Scale Image Search. ', IEEE Trans. Pattern Anal. Mach. Intell. 30 (11) , 1877-1890 .
Wu F, Han YH, Zhuang YT, "Multiple hypergraph clustering of Web images by mining Word2Image correlations", presented at COMPUTER SCIENCE AND TECHNOLOGY , 2010, pp 750-760
Liu, D. ; Yan, S. ; Rui, Y. & Zhang, H. -J. (2010), Unified tag analysis with multi-edge graph. , in 'ACM Multimedia' , ACM, , pp. 25-34 .
Fergus, R. ; 0002, F. -F. L. ; Perona, P. & Zisserman, A. (2010), 'Learning Object Categories From Internet Image Searches. ', Proceedings of the IEEE 98 (8) , 1453-1466 .
Viola, P. & Jones, M. ( 2001), ' Rapid Object Detection using a Boosted Cascade of Simple Features' ' Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition' , Hawaii .
Li, Y. & Shapiro, L. G. (2002), Consistent Line Clusters for Building Recognition in CBIR. , in 'ICPR (3)' , pp. 952-956 .
Leibe, B. ; Leonardis, A. & Schiele, B. (2006), An Implicit Shape Model for Combined Object Categorization and Segmentation. , in Jean Ponce; Martial Hebert; Cordelia Schmid & Andrew Zisserman, ed. , 'Toward Category-Level Object Recognition' , Springer, , pp. 508-524 .
Hsieh, L. -C. & Hsu, W. H. (2010), Search-Based Automatic Image Annotation via Flickr Photos Using Tag Expansion. , in 'ICASSP' , IEEE, , pp. 2398-2401 .
Chen, Y. ; Zhu, L. ; Yuille, A. L. & Zhang, H. (2008), Unsupervised learning of probabilistic object models (POMs) for object classification, segmentation and recognition. , in 'CVPR' , IEEE Computer Society, .
He, R. ; Xiong, N. ; Yang, L. T. & Park, J. H. (2011), 'Using Multi-Modal Semantic Association Rules to fuse keywords and visual features automatically for Web image retrieval. ', Information Fusion 12 (3) , 223-230 .
hatzilari, E. ; Nikolopoulos, S. ; Papadopoulos, S. ; Zigkolis, C. & Kompatsiaris, Y. (2011), Semi-supervised object recognition using flickr images. , in José M. Martinez, ed. , 'CBMI' , IEEE, , pp. 229-234 .
Barrat, S. & Tabbone, S. (2010), 'Modeling, classifying and annotating weakly annotated images using Bayesian network. ', J. Visual Communication and Image Representation 21 (4) , 355-363 .
Liu, D. ; Hua, X. -S. ; Wang, M. & Zhang, H. -J. (2010), Image retagging. , in, 'ACM Multimedia' , ACM, , pp. 491-500 .
Chang, C. -Y. ; Wang, H. -J. & Li, C. -F. (2009), 'Semantic analysis of real-world images using support vector machine. ', Expert Syst. Appl. 36 (7) , 10560-10569 .
Rahman, M. M. ; Bhattacharya, P. & Desai, B. C. (2009), 'A unified image retrieval framework on local visual and semantic concept-based feature spaces. ', J. Visual Communication and Image Representation 20 (7) , 450-462 .
Wong, R. C. F. & Leung, C. H. C. (2008), 'Automatic Semantic Annotation of Real-World Web Images. ', IEEE Trans. Pattern Anal. Mach. Intell. 30 (11) , 1933-1944 .
Salman, N. (2006), 'Image Segmentation Based on Watershed and Edge Detection Techniques. ', Int. Arab J. Inf. Technol. 3 (2) , 104-110 .
Lowe, D. G. (2004), 'Distinctive Image Features from Scale-Invariant Keypoints', Int. J. Comput. Vision 60 (2) , 91--110 .
Z. Wang, Y. Mei, F. Yan, "A New Web Image Searching Engine by Using SIFT Algorithm", in Proc. WISM, 2009,pp 366-370
Quack, T. ; Mönich, U. ; Thiele, L. & Manjunath, B. S. (2004), Cortina: a system for large-scale, content-based web image retrieval. , in, 'ACM Multimedia' , ACM, , pp. 508-511
Deng, J. ; Dong, W. ; Socher, R. ; Li, L. -J. ; Li, K. & 0002, F. -F. L. (2009), ImageNet: A large-scale hierarchical image database. , in 'CVPR' , IEEE, , pp. 248-255 .
Liu, D. ; Hua, X. -S. ; Wang, M. & Zhang, H. -J. (2010), Retagging social images based on visual and semantic consistency. , in Michael Rappa; Paul Jones; Juliana Freire & Soumen Chakrabarti, ed. , 'WWW' , ACM, , pp. 1149-1150 .
Kilinç, D. & Alpkocak, A. (2011), 'An expansion and reranking approach for annotation-based image retrieval from Web. ', Expert Syst. Appl. 38 (10) , 13121-13127 .
Yang, C. ; Dong, M. & Fotouhi, F. (2005), I2A: an interactive image annotation system. , in 'ICME' , IEEE, , pp. 948-951 .
Jin, Y. ; 0021, L. W. & Khan, L. (2005), Improving Image Annotations Using WordNet. , in K. Selçuk Candan & Augusto Celentano, ed. , 'Multimedia Information Systems' , Springer, , pp. 115-130 .
Z. Wang, K. Jia, P. Liu," A Novel Image Retrieval Algorithm Based on ROI by Using SIFT Feature Matching" in Proc. MultiMedia and Information Technology ,2008,pp 338-341
The Wordnet website. [Online]. Available: http://wordnet. princeton. edu
Verb Semantics and Lexical Selection
G. Qi, X. Hua, and H. Zhang, "Learning semantic distance from community-tagged media collection", in Proc. ACM Multimedia, 2009, pp. 243-252.
Wang, Y. & Gong, S. (2007), Refining image annotation using contextual relations between words. , in Nicu Sebe & Marcel Worring, ed. , 'CIVR' , ACM, , pp. 425-432 .
Li, X. ; Snoek, C. G. M. & Worring, M. (2009), 'Learning Social Tag Relevance by Neighbor Voting. ', IEEE Transactions on Multimedia 11 (7) , 1310-1322 .
Agrawal ,R. and Srikant, R(1994). . Fast algorithms for mining association rules. VLDB'94.

Index Terms

Computer Science

Information Sciences

Keywords

Image Annotation Semantic Context Objects Recognition